Filter
Conference contribution

Search results

  • 2021

    Understanding Effectiveness of Multi-Error-Bounded Lossy Compression for Preserving Ranges of Interest in Scientific Analysis

    Lin, Y., Di, S., Zhao, K., Jin, S., Wang, C., Chard, K., Tao, D., Foster, I. & Cappello, F., 2021, Proceedings of DRBSD-7 2021: 7th International Workshop on Data Analysis and Reduction for Big Scientific Data, Held in conjunction with SC 2021: The International Conference for High Performance Computing, Networking, Storage and Analysis. Institute of Electrical and Electronics Engineers Inc., p. 40-46 7 p. (Proceedings of DRBSD-7 2021: 7th International Workshop on Data Analysis and Reduction for Big Scientific Data, Held in conjunction with SC 2021: The International Conference for High Performance Computing, Networking, Storage and Analysis).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2020

    CuSZ: An efficient GPU-based error-bounded lossy compression framework for scientific data

    Tian, J., Di, S., Zhao, K., Rivera, C., Fulp, M. H., Underwood, R., Jin, S., Liang, X., Calhoun, J., Tao, D. & Cappello, F., Sep 30 2020, PACT 2020 - Proceedings of the ACM International Conference on Parallel Architectures and Compilation Techniques. Institute of Electrical and Electronics Engineers Inc., p. 3-15 13 p. (Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • DeepClone: Lightweight State Replication of Deep Learning Models for Data Parallel Training

    Nicolae, B., Wozniak, J. M., Dorier, M. & Cappello, F., Sep 2020, Proceedings - 2020 IEEE International Conference on Cluster Computing, CLUSTER 2020. Institute of Electrical and Electronics Engineers Inc., p. 226-236 11 p. 9229626. (Proceedings - IEEE International Conference on Cluster Computing, ICCC; vol. 2020-September).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • DeepFreeze: Towards Scalable Asynchronous Checkpointing of Deep Learning Models

    Nicolae, B., Li, J., Wozniak, J. M., Bosilca, G., Dorier, M. & Cappello, F., May 2020, Proceedings - 20th IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing, CCGRID 2020. Lefevre, L., Varela, C. A., Pallis, G., Toosi, A. N., Rana, O. & Buyya, R. (eds.). Institute of Electrical and Electronics Engineers Inc., p. 172-181 10 p. 9139666. (Proceedings - 20th IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing, CCGRID 2020).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • FRaZ: A Generic High-Fidelity Fixed-Ratio Lossy Compression Framework for Scientific Floating-point Data

    Underwood, R., Di, S., Calhoun, J. C. & Cappello, F., May 2020, Proceedings - 2020 IEEE 34th International Parallel and Distributed Processing Symposium, IPDPS 2020. Institute of Electrical and Electronics Engineers Inc., p. 567-577 11 p. 9139812. (Proceedings - 2020 IEEE 34th International Parallel and Distributed Processing Symposium, IPDPS 2020).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Optimizing asynchronous multi-level checkpoint/restart configurations with machine learning

    Dey, T., Sato, K., Nicolae, B., Guo, J., Domke, J., Yu, W., Cappello, F. & Mohror, K., May 2020, Proceedings - 2020 IEEE 34th International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2020. Institute of Electrical and Electronics Engineers Inc., p. 1036-1043 8 p. 9150452. (Proceedings - 2020 IEEE 34th International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2020).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • SDRBench: Scientific Data Reduction Benchmark for Lossy Compressors

    Zhao, K., Di, S., Lian, X., Li, S., Tao, D., Bessac, J., Chen, Z. & Cappello, F., Dec 10 2020, Proceedings - 2020 IEEE International Conference on Big Data, Big Data 2020. Wu, X., Jermaine, C., Xiong, L., Hu, X. T., Kotevska, O., Lu, S., Xu, W., Aluru, S., Zhai, C., Al-Masri, E., Chen, Z. & Saltz, J. (eds.). Institute of Electrical and Electronics Engineers Inc., p. 2716-2724 9 p. 9378449. (Proceedings - 2020 IEEE International Conference on Big Data, Big Data 2020).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Significantly Improving Lossy Compression for HPC Datasets with Second-Order Prediction and Parameter Optimization

    Zhao, K., Di, S., Liang, X., Li, S., Tao, D., Chen, Z. & Cappello, F., Jun 23 2020, HPDC 2020 - Proceedings of the 29th International Symposium on High-Performance Parallel and Distributed Computing. Association for Computing Machinery, p. 89-100 12 p. (HPDC 2020 - Proceedings of the 29th International Symposium on High-Performance Parallel and Distributed Computing).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • Toward Feature-Preserving 2D and 3D Vector Field Compression

    Liang, X., Guo, H., Di, S., Cappello, F., Raj, M., Liu, C., Ono, K., Chen, Z. & Peterka, T., Jun 2020, 2020 IEEE Pacific Visualization Symposium, PacificVis 2020 - Proceedings. Beck, F., Seo, J. & Wang, C. (eds.). IEEE Computer Society, p. 81-90 10 p. 9086223. (IEEE Pacific Visualization Symposium; vol. 2020-June).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Towards End-to-end SDC Detection for HPC Applications Equipped with Lossy Compression

    Li, S., Di, S., Zhao, K., Liang, X., Chen, Z. & Cappello, F., Sep 2020, Proceedings - 2020 IEEE International Conference on Cluster Computing, CLUSTER 2020. Institute of Electrical and Electronics Engineers Inc., p. 326-336 11 p. 9229628. (Proceedings - IEEE International Conference on Cluster Computing, ICCC; vol. 2020-September).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Wavesz: A hardware-algorithm co-design of efficient lossy compression for scientific data

    Tian, J., Di, S., Zhang, C., Liang, X., Jin, S., Cheng, D., Tao, D. & Cappello, F., Feb 19 2020, PPoPP 2020 - Proceedings of the 2020 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. Association for Computing Machinery, p. 74-88 15 p. (Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2019

    Accelerating Relative-error Bounded Lossy Compression for HPC datasets with Precomputation-Based Mechanisms

    Zou, X., Lu, T., Xia, W., Wang, X., Zhang, W., Di, S., Tao, D. & Cappello, F., May 2019, Proceedings - 2019 35th Symposium on Mass Storage Systems and Technologies, MSST 2019. IEEE Computer Society, p. 65-78 14 p. 8890088. (IEEE Symposium on Mass Storage Systems and Technologies; vol. 2019-May).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Analyzing the performance and accuracy of lossy checkpointing on sub-iteration of NWChem

    Reza, T., Keipert, K., Di, S., Liang, X., Calhoun, J. & Cappello, F., Nov 2019, Proceedings of DRBSD-5 2019: 5th International Workshop on Data Analysis and Reduction for Big Scientific Data - Held in conjunction with SC 2019: The International Conference for High Performance Computing, Networking, Storage and Analysis. Institute of Electrical and Electronics Engineers Inc., p. 23-27 5 p. 8955115. (Proceedings of DRBSD-5 2019: 5th International Workshop on Data Analysis and Reduction for Big Scientific Data - Held in conjunction with SC 2019: The International Conference for High Performance Computing, Networking, Storage and Analysis).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Characterizing and Understanding HPC Job Failures over the 2K-Day Life of IBM BlueGene/Q System

    Di, S., Guo, H., Pershey, E., Snir, M. & Cappello, F., Jun 2019, Proceedings - 49th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, DSN 2019. Institute of Electrical and Electronics Engineers Inc., p. 473-484 12 p. 8809553. (Proceedings - 49th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, DSN 2019).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • DeepSZ: A novel framework to compress deep neural networks by using error-bounded lossy compression

    Jin, S., Di, S., Liang, X., Tian, J., Tao, D. & Cappello, F., Jun 17 2019, HPDC 2019- Proceedings of the 28th International Symposium on High-Performance Parallel and Distributed Computing. Association for Computing Machinery, p. 159-170 12 p. (HPDC 2019- Proceedings of the 28th International Symposium on High-Performance Parallel and Distributed Computing).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Error-Controlled Lossy Compression Optimized for High Compression Ratios of Scientific Datasets

    Liang, X., Di, S., Tao, D., Li, S., Li, S., Guo, H., Chen, Z. & Cappello, F., Jan 22 2019, Proceedings - 2018 IEEE International Conference on Big Data, Big Data 2018. Song, Y., Liu, B., Lee, K., Abe, N., Pu, C., Qiao, M., Ahmed, N., Kossmann, D., Saltz, J., Tang, J., He, J., Liu, H. & Hu, X. (eds.). Institute of Electrical and Electronics Engineers Inc., p. 438-447 10 p. 8622520. (Proceedings - 2018 IEEE International Conference on Big Data, Big Data 2018).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • FT-iSort: Efficient fault tolerance for introsort

    Li, S., Li, H., Liang, X., Chen, J., Giem, E., Ouyang, K., Zhao, K., Di, S., Cappello, F. & Chen, Z., Nov 17 2019, Proceedings of SC 2019: The International Conference for High Performance Computing, Networking, Storage and Analysis. IEEE Computer Society, a71. (International Conference for High Performance Computing, Networking, Storage and Analysis, SC).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • Full-state quantum circuit simulation by using data compression

    Wu, X. C., Di, S., Dasgupta, E. M., Cappello, F., Finkel, H., Alexeev, Y. & Chong, F. T., Nov 17 2019, Proceedings of SC 2019: The International Conference for High Performance Computing, Networking, Storage and Analysis. IEEE Computer Society, a80. (International Conference for High Performance Computing, Networking, Storage and Analysis, SC).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • Improving Performance of Data Dumping with Lossy Compression for Scientific Simulation

    Liang, X., Di, S., Tao, D., Li, S., Nicolae, B., Chen, Z. & Cappello, F., Sep 2019, Proceedings - 2019 IEEE International Conference on Cluster Computing, CLUSTER 2019. Institute of Electrical and Electronics Engineers Inc., 8891037. (Proceedings - IEEE International Conference on Cluster Computing, ICCC; vol. 2019-September).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Optimizing Lossy Compression with Adjacent Snapshots for N-body Simulation Data

    Li, S., Di, S., Liang, X., Chen, Z. & Cappello, F., Jan 22 2019, Proceedings - 2018 IEEE International Conference on Big Data, Big Data 2018. Song, Y., Liu, B., Lee, K., Abe, N., Pu, C., Qiao, M., Ahmed, N., Kossmann, D., Saltz, J., Tang, J., He, J., Liu, H. & Hu, X. (eds.). Institute of Electrical and Electronics Engineers Inc., p. 428-437 10 p. 8622101. (Proceedings - 2018 IEEE International Conference on Big Data, Big Data 2018).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Significantly improving lossy compression quality based on an optimized hybrid prediction model

    Liang, X., Di, S., Li, S., Tao, D., Nicolae, B., Chen, Z. & Cappello, F., Nov 17 2019, Proceedings of SC 2019: The International Conference for High Performance Computing, Networking, Storage and Analysis. IEEE Computer Society, a33. (International Conference for High Performance Computing, Networking, Storage and Analysis, SC).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • Towards Portable Online Prediction of Network Utilization Using MPI-Level Monitoring

    Tseng, S. M., Nicolae, B., Bosilca, G., Jeannot, E., Chandramowlishwaran, A. & Cappello, F., 2019, Euro-Par 2019: Parallel Processing - 25th International Conference on Parallel and Distributed Computing, Proceedings. Yahyapour, R. (ed.). Springer, p. 47-60 14 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 11725 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Veloc: Towards high performance adaptive asynchronous checkpointing at large scale

    Nicolae, B., Moody, A., Gonsiorowski, E., Mohror, K. & Cappello, F., May 2019, Proceedings - 2019 IEEE 33rd International Parallel and Distributed Processing Symposium, IPDPS 2019. Institute of Electrical and Electronics Engineers Inc., p. 911-920 10 p. 8821049. (Proceedings - 2019 IEEE 33rd International Parallel and Distributed Processing Symposium, IPDPS 2019).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2018

    An Efficient Transformation Scheme for Lossy Data Compression with Point-Wise Relative Error Bound

    Liang, X., Di, S., Tao, D., Chen, Z. & Cappello, F., Oct 29 2018, Proceedings - 2018 IEEE International Conference on Cluster Computing, CLUSTER 2018. Institute of Electrical and Electronics Engineers Inc., p. 179-189 11 p. 8514879. (Proceedings - IEEE International Conference on Cluster Computing, ICCC; vol. 2018-September).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Coupling exascale multiphysics applications: Methods and lessons learned

    Choi, J. Y., Chang, C. S., Dominski, J., Klasky, S., Merlo, G., Suchyta, E., Ainsworth, M., Allen, B., Cappello, F., Churchill, M., Davis, P., Di, S., Eisenhauer, G., Ethier, S., Foster, I., Geveci, B., Guo, H., Huck, K., Jenko, F. & Kim, M. & 17 others, Kress, J., Ku, S. H., Liu, Q., Logan, J., Malony, A., Mehta, K., Moreland, K., Munson, T., Parashar, M., Peterka, T., Podhorszki, N., Pugmire, D., Tugluk, O., Wang, R., Whitney, B., Wolf, M. & Wood, C., Dec 24 2018, Proceedings - IEEE 14th International Conference on eScience, e-Science 2018. Institute of Electrical and Electronics Engineers Inc., p. 442-452 11 p. 8588752. (Proceedings - IEEE 14th International Conference on eScience, e-Science 2018).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Evaluation of a floating-point intensive kernel on FPGA: A case study of geodesic distance kernel

    Jin, Z., Finkel, H., Yoshii, K. & Cappello, F., 2018, Euro-Par 2017: Parallel Processing Workshops - Euro-Par 2017 International Workshops. Heras, D. B. & Bouge, L. (eds.). Springer, p. 664-675 12 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 10659 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Fixed-PSNR Lossy Compression for Scientific Data

    Tao, D., Di, S., Liang, X., Chen, Z. & Cappello, F., Oct 29 2018, Proceedings - 2018 IEEE International Conference on Cluster Computing, CLUSTER 2018. Institute of Electrical and Electronics Engineers Inc., p. 314-318 5 p. 8514891. (Proceedings - IEEE International Conference on Cluster Computing, ICCC; vol. 2018-September).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Improving performance of iterative methods by lossy checkponting

    Tao, D., Di, S., Liang, X., Chen, Z. & Cappello, F., Jun 11 2018, HPDC 2018 - Proceedings of the 2018 International Symposium on High-Performance Parallel and Distributed Computing. Association for Computing Machinery, p. 52-65 14 p. (HPDC 2018 - Proceedings of the 2018 International Symposium on High-Performance Parallel and Distributed Computing).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • Neural Network Based Silent Error Detector

    Wang, C., Dryden, N., Cappello, F. & Snir, M., Oct 29 2018, Proceedings - 2018 IEEE International Conference on Cluster Computing, CLUSTER 2018. Institute of Electrical and Electronics Engineers Inc., p. 168-178 11 p. 8514878. (Proceedings - IEEE International Conference on Cluster Computing, ICCC; vol. 2018-September).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Parallel Partial Reduction for Large-Scale Data Analysis and Visualization

    He, W., Guo, H., Peterka, T., Di, S., Cappello, F. & Shen, H. W., Oct 2018, 2018 IEEE 8th Symposium on Large Data Analysis and Visualization, LDAV 2018. Institute of Electrical and Electronics Engineers Inc., p. 45-55 11 p. 8739165. (2018 IEEE 8th Symposium on Large Data Analysis and Visualization, LDAV 2018).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • PaSTRI: Error-Bounded Lossy Compression for Two-Electron Integrals in Quantum Chemistry

    Gok, A. M., Di, S., Alexeev, Y., Tao, D., Mironov, V., Liang, X. & Cappello, F., Oct 29 2018, Proceedings - 2018 IEEE International Conference on Cluster Computing, CLUSTER 2018. Institute of Electrical and Electronics Engineers Inc., p. 1-11 11 p. 8514854. (Proceedings - IEEE International Conference on Cluster Computing, ICCC; vol. 2018-September).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Understanding and improving the trust in results of numerical simulations and scientific data analytics

    Cappello, F., Gupta, R., Di, S., Constantinescu, E., Peterka, T. & Wild, S. M., 2018, Euro-Par 2017: Parallel Processing Workshops - Euro-Par 2017 International Workshops. Heras, D. B. & Bouge, L. (eds.). Springer, p. 545-556 12 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 10659 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2017

    Computing Just What You Need: Online Data Analysis and Reduction at Extreme Scales

    Foster, I., Ainsworth, M., Allen, B., Bessac, J., Cappello, F., Choi, J. Y., Constantinescu, E., Davis, P. E., Di, S., Di, W., Guo, H., Klasky, S., Van Dam, K. K., Kurc, T., Liu, Q., Malik, A., Mehta, K., Mueller, K., Munson, T. & Ostouchov, G. & 10 others, Parashar, M., Peterka, T., Pouchard, L., Tao, D., Tugluk, O., Wild, S., Wolf, M., Wozniak, J. M., Xu, W. & Yoo, S., 2017, Euro-Par 2017: Parallel Processing - 23rd International Conference on Parallel and Distributed Computing, Proceedings. Rivera, F. F., Pena, T. F. & Cabaleiro, J. C. (eds.). Springer, p. 3-19 17 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 10417 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Detection of Silent Data Corruption in Adaptive Numerical Integration Solvers

    Guhur, P. L., Constantinescu, E., Ghosh, D., Peterka, T. & Cappello, F., Sep 22 2017, Proceedings - 2017 IEEE International Conference on Cluster Computing, CLUSTER 2017. Institute of Electrical and Electronics Engineers Inc., p. 592-602 11 p. 8048974. (Proceedings - IEEE International Conference on Cluster Computing, ICCC; vol. 2017-September).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Evaluating irregular memory access on OpenCL FPGA platforms: A case study with XSBench

    Luo, Y., Wen, X., Yoshii, K., Ogrenci-Memik, S., Memik, G., Finkel, H. & Cappello, F., Oct 2 2017, 2017 27th International Conference on Field Programmable Logic and Applications, FPL 2017. Gohringer, D., Stroobandt, D., Mentens, N., Santambrogio, M. & Nurmi, J. (eds.). Institute of Electrical and Electronics Engineers Inc., 8056827. (2017 27th International Conference on Field Programmable Logic and Applications, FPL 2017).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Exploration of pattern-matching techniques for lossy compression on cosmology simulation data sets

    Tao, D., Di, S., Chen, Z. & Cappello, F., 2017, High Performance Computing - ISC High Performance 2017 International Workshops, DRBSD, ExaComm, HCPM, HPC-IODC, IWOPH, IXPUG, P^3MA, VHPC, Visualization at Scale, WOPSSS, Revised Selected Papers. Yokota, R., Kunkel, J. M., Taufer, M. & Shalf, J. (eds.). Springer, p. 43-54 12 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 10524 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Identifying the right replication level to detect and correct silent errors at scale

    Benoit, A., Raghavan, P., Cavelan, A., Robert, Y., Cappello, F. & Sun, H., Jun 26 2017, FTXS 2017 - Proceedings of the 2017 Workshop on Fault-Tolerance for HPC at Extreme Scale, co-located with HPDC 2017. Association for Computing Machinery, p. 31-38 8 p. (FTXS 2017 - Proceedings of the 2017 Workshop on Fault-Tolerance for HPC at Extreme Scale, co-located with HPDC 2017).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • In-depth exploration of single-snapshot lossy compression techniques for N-body simulations

    Tao, D., Di, S., Chen, Z. & Cappello, F., Jul 1 2017, Proceedings - 2017 IEEE International Conference on Big Data, Big Data 2017. Nie, J.-Y., Obradovic, Z., Suzumura, T., Ghosh, R., Nambiar, R., Wang, C., Zang, H., Baeza-Yates, R., Baeza-Yates, R., Hu, X., Kepner, J., Cuzzocrea, A., Tang, J. & Toyoda, M. (eds.). Institute of Electrical and Electronics Engineers Inc., p. 486-493 8 p. (Proceedings - 2017 IEEE International Conference on Big Data, Big Data 2017; vol. 2018-January).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • LOGAIDER: A Tool for Mining Potential Correlations of HPC Log Events

    Di, S., Gupta, R., Snir, M., Pershey, E. & Cappello, F., Jul 10 2017, Proceedings - 2017 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGRID 2017. Institute of Electrical and Electronics Engineers Inc., p. 442-451 10 p. 7973730. (Proceedings - 2017 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGRID 2017).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • MACORD: Online adaptive machine learning framework for silent error detection

    Subasi, O., Di, S., Balaprakash, P., Unsal, O., Labarta, J., Cristal, A., Krishnamoorthy, S. & Cappello, F., Sep 22 2017, Proceedings - 2017 IEEE International Conference on Cluster Computing, CLUSTER 2017. Institute of Electrical and Electronics Engineers Inc., p. 717-724 8 p. 8049008. (Proceedings - IEEE International Conference on Cluster Computing, ICCC; vol. 2017-September).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Significantly Improving Lossy Compression for Scientific Data Sets Based on Multidimensional Prediction and Error-Controlled Quantization

    Tao, D., Di, S., Chen, Z. & Cappello, F., Jun 30 2017, Proceedings - 2017 IEEE 31st International Parallel and Distributed Processing Symposium, IPDPS 2017. Institute of Electrical and Electronics Engineers Inc., p. 1129-1139 11 p. 7967203. (Proceedings - 2017 IEEE 31st International Parallel and Distributed Processing Symposium, IPDPS 2017).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Welcome message from the program chairs

    Garcia-Blas, J., Fox, G. C. & Cappello, F., Jul 10 2017, 2017 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID). IEEE Computer Society, p. xviii-xviii

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2016

    DSN 2016 Tutorial: Resilience for Scientific Computing: From Theory to Practice

    Cappello, F. & Bosilca, G., Sep 22 2016, Proceedings - 46th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, DSN-W 2016. Institute of Electrical and Electronics Engineers Inc., p. 267 1 p. 7575396. (Proceedings - 46th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, DSN-W 2016).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Exploring partial replication to improve lightweight silent data corruption detection for HPC applications

    Berrocal, E., Bautista-Gomez, L., Di, S., Lan, Z. & Cappello, F., 2016, Parallel Processing - 22nd International Conference on Parallel and Distributed Computing, Euro-Par 2016, Proceedings. Dutot, P.-F. & Trystram, D. (eds.). Springer, p. 419-430 12 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 9833 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Fast Error-Bounded Lossy HPC Data Compression with SZ

    Di, S. & Cappello, F., Jul 18 2016, Proceedings - 2016 IEEE 30th International Parallel and Distributed Processing Symposium, IPDPS 2016. Institute of Electrical and Electronics Engineers Inc., p. 730-739 10 p. 7516069. (Proceedings - 2016 IEEE 30th International Parallel and Distributed Processing Symposium, IPDPS 2016).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Lightweight and accurate silent data corruption detection in ordinary differential equation solvers

    Guhur, P. L., Zhang, H., Peterka, T., Constantinescu, E. & Cappello, F., 2016, Parallel Processing - 22nd International Conference on Parallel and Distributed Computing, Euro-Par 2016, Proceedings. Dutot, P.-F. & Trystram, D. (eds.). Springer, p. 644-656 13 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 9833 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Reducing Waste in Extreme Scale Systems through Introspective Analysis

    Bautista-Gomez, L., Gainaru, A., Perarnau, S., Tiwari, D., Gupta, S., Engelmann, C., Cappello, F. & Snir, M., Jul 18 2016, Proceedings - 2016 IEEE 30th International Parallel and Distributed Processing Symposium, IPDPS 2016. Institute of Electrical and Electronics Engineers Inc., p. 212-221 10 p. 7516017. (Proceedings - 2016 IEEE 30th International Parallel and Distributed Processing Symposium, IPDPS 2016).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Spatial Support Vector Regression to Detect Silent Errors in the Exascale Era

    Subasi, O., Di, S., Bautista-Gomez, L., Balaprakash, P., Unsal, O., Labarta, J., Cristal, A. & Cappello, F., Jul 18 2016, Proceedings - 2016 16th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, CCGrid 2016. Institute of Electrical and Electronics Engineers Inc., p. 413-424 12 p. 7515717. (Proceedings - 2016 16th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, CCGrid 2016).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2015

    Addressing the last roadblock for message logging in HPC: Alleviating the memory requirement using dedicated resources

    Martsinkevich, T., Ropars, T. & Cappello, F., 2015, Euro-Par 2015: Parallel Processing Workshops - Euro-Par 2015 International Workshops, Revised Selected Papers. Hunold, S., Weidendorfer, J., Gimenez, D., Ricci, L., Lankes, S., Costan, A., Varbanescu, A. L., Scott, S. L., Requena, M. E. G., Scarano, V., Iosup, A. & Alexander, M. (eds.). Springer, p. 644-655 12 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 9523).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Analysis of the tradeoffs between energy and run time for multilevel checkpointing

    Balaprakash, P., Gomez, L. A. B., Bouguerra, M. S., Wild, S. M., Cappello, F. & Hovland, P. D., 2015, High Performance Computing Systems: Performance Modeling, Benchmarking, and Simulation - 5th International Workshop, PMBS 2014, Revised Selected Papers. Hammond, S. D., Jarvis, S. A. & Wright, S. A. (eds.). Springer, p. 249-263 15 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 8966).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution