Find Research Outputs

Search in all content

Filters for Research & Scholarship

Search concepts
Selected Filters

Publication Year

  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011

Author

  • Laxmikant V Kale
2013

Acceleration of an asynchronous message driven programming paradigm on IBM Blue Gene/Q

Kumar, S., Sun, Y. & Kalé, L. V., Oct 7 2013, p. 689-699. 11 p.

Research output: Contribution to conferencePaper

2019

Fine-Grained Energy Efficiency Using Per-Core DVFS with an Adaptive Runtime System

Acun, B., Chandrasekar, K. & Kale, L. V., Oct 2019, 2019 10th International Green and Sustainable Computing Conference, IGSC 2019. Institute of Electrical and Electronics Engineers Inc., 8957174. (2019 10th International Green and Sustainable Computing Conference, IGSC 2019).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2014

Towards realizing the potential of malleable jobs

Gupta, A., Acun, B., Sarood, O. & Kale, L. V., Jan 1 2014, 2014 21st International Conference on High Performance Computing, HiPC 2014. Institute of Electrical and Electronics Engineers Inc., 7116905. (2014 21st International Conference on High Performance Computing, HiPC 2014).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2019

Visualizing, Measuring, and Tuning Adaptive MPI Parameters

Diener, M., White, S. & Kale, L. V., Jan 1 2019, Programming and Performance Visualization Tools - International Workshops, ESPT 2017 and VPA 2017, Revised Selected Papers. Bhatele, A., Boehme, D., Levine, J. A., Malony, A. D. & Schulz, M. (eds.). Springer-Verlag Berlin Heidelberg, p. 219-230 12 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 11027 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2017

Improving the memory access locality of hybrid MPI applications

Diener, M., White, S., Kale, L. V., Campbell, M., Bodony, D. J. & Freund, J. B., Sep 25 2017, EuroMPI 2017 - Proceedings of the 24th European MPI Users� Group Meeting. Association for Computing Machinery, a11. (ACM International Conference Proceeding Series).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2016

Towards PDES in a message-driven paradigm: A preliminary case study using Charm++

Mikida, E., Jain, N., Kale, L., Gonsiorowski, E., Carothers, C. D., Barnes, P. D. & Jefferson, D., May 15 2016, SIGSIM-PADS 2016 - Proceedings of the 2016 Annual ACM Conference on Principles of Advanced Discrete Simulation. Association for Computing Machinery, Inc, p. 99-110 12 p. (SIGSIM-PADS 2016 - Proceedings of the 2016 Annual ACM Conference on Principles of Advanced Discrete Simulation).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2012

"Cool" Load balancing for high performance computing data centers

Sarood, O., Miller, P., Totoni, E. & Kalé, L. V., Nov 26 2012, In : IEEE Transactions on Computers. 61, 12, p. 1752-1764 13 p., 6226358.

Research output: Contribution to journalArticle

2017

A memory heterogeneity-aware runtime system for bandwidth-sensitive hpc applications

Chandrasekar, K., Ni, X. & Kale, L. V., Jun 30 2017, Proceedings - 2017 IEEE 31st International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2017. Institute of Electrical and Electronics Engineers Inc., p. 1293-1300 8 p. 7965187. (Proceedings - 2017 IEEE 31st International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2017).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2019

Accelerating Scientific Applications on Heterogeneous Systems with HybridOMP

Diener, M., Bodony, D. J. & Kale, L., Jan 1 2019, High Performance Computing for Computational Science – VECPAR 2018 - 13th International Conference, Revised Selected Papers. Senger, H., Marques, O., de Brito, T. P., Iope, R., Stanzani, S., Gil-Costa, V. & Garcia, R. (eds.). Springer-Verlag Berlin Heidelberg, p. 174-187 14 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 11333 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2011

Debugging large scale applications in a virtualized environment

Gioachin, F., Zheng, G. & Kalé, L. V., Mar 18 2011, Languages and Compilers for Parallel Computing - 23rd International Workshop, LCPC 2010, Revised Selected Papers. p. 199-214 16 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 6548 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2014

TRAM: Optimizing fine-grained communication with topological routing and aggregation of messages

Wesolowski, L., Venkataraman, R., Gupta, A., Yeom, J. S., Bisset, K., Sun, Y., Jetley, P., Quinn, T. R. & Kalé, L. V., Nov 13 2014, In : Proceedings of the International Conference on Parallel Processing. 2014-November, November, p. 211-220 10 p., 6957230.

Research output: Contribution to journalConference article

2020

Optimizing point-to-point communication between adaptive MPI endpoints in shared memory

White, S. & Kale, L. V., Feb 10 2020, In : Concurrency Computation. 32, 3, e4467.

Research output: Contribution to journalArticle

2011

ACM SRC Poster: Optimizing all-to-all algorithm for percs network using simulation

Totoni, E. & Kale, L. V., Dec 1 2011, SC'11 - Proceedings of the 2011 High Performance Computing Networking, Storage and Analysis Companion, Co-located with SC'11. p. 123-124 2 p. (SC'11 - Proceedings of the 2011 High Performance Computing Networking, Storage and Analysis Companion, Co-located with SC'11).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2012

Collectives on two-tier direct networks

Jain, N., Lau, J. & Kale, L., Oct 24 2012, Recent Advances in the Message Passing Interface - 19th European MPI Users' Group Meeting, EuroMPI 2012, Proceedings. p. 67-77 11 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 7490 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2018

CharmPy: A Python Parallel Programming Model

Galvez, J. J., Senthil, K. & Kale, L., Oct 29 2018, Proceedings - 2018 IEEE International Conference on Cluster Computing, CLUSTER 2018. Institute of Electrical and Electronics Engineers Inc., p. 423-433 11 p. 8514902. (Proceedings - IEEE International Conference on Cluster Computing, ICCC; vol. 2018-September).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2011

Automatic handling of global variables for multi-threaded MPI programs

Zheng, G., Negara, S., Mendes, C. L., Kale, L. V. & Rodrigues, E. R., Dec 1 2011, Proceedings - 2011 17th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2011. p. 220-227 8 p. 6121281. (Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2013

The who, what, why and how of high performance computing applications in the cloud

Gupta, A., Kale, L. V., Gioachin, F., March, V., Suen, C. H., Lee, B. S., Faraboschi, P., Kaufmann, R. & Milojicic, D., Aug 9 2013, In : HP Laboratories Technical Report. 49

Research output: Contribution to journalArticle

2012

Scalable algorithms for distributed-memory adaptive mesh refinement

Langer, A., Lifflander, J., Miller, P., Pan, K. C., Kale, L. V. & Ricker, P. M., Dec 1 2012, In : Proceedings - Symposium on Computer Architecture and High Performance Computing. p. 100-107 8 p., 6374777.

Research output: Contribution to journalConference article

Incorporating dynamic communication patterns in a static dataflow notation

Jetley, P., Keshan, A. & Kale, L. V., Jan 1 2012, Proceedings - 2012 2nd Workshop on Data-Flow Execution Models for Extreme Scale Computing, DFM 2012. IEEE Computer Society, p. 27-35 9 p. 6612857. (Proceedings - 2012 2nd Workshop on Data-Flow Execution Models for Extreme Scale Computing, DFM 2012).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2014

Energy profile of rollback-recovery strategies in high performance computing

Meneses, E., Sarood, O. & Kalé, L. V., Oct 1 2014, In : Parallel Computing. 40, 9, p. 536-547 12 p.

Research output: Contribution to journalArticle

2017

Automated Load Balancer Selection Based on Application Characteristics

Menon, H., Chandrasekar, K. & Kale, L. V., Jan 26 2017, In : ACM SIGPLAN Notices. 52, 8, p. 447-448 2 p.

Research output: Contribution to journalArticle

Open Access
2014

Optimizing the performance of parallel applications on a 5D torus via task mapping

Bhatele, A., Jain, N., Isaacs, K. E., Buch, R., Gamblin, T., Langer, S. H. & Kale, L. V., Jan 1 2014, 2014 21st International Conference on High Performance Computing, HiPC 2014. Institute of Electrical and Electronics Engineers Inc., 7116706. (2014 21st International Conference on High Performance Computing, HiPC 2014).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2015

Scalable Asynchronous Contact Mechanics Using Charm++

Ni, X., Kale, L. V. & Tamstorf, R., Jul 17 2015, Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium, IPDPS 2015. Institute of Electrical and Electronics Engineers Inc., p. 677-686 10 p. 7161555. (Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium, IPDPS 2015).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2016

Evaluating and Improving the Performance and Scheduling of HPC Applications in Cloud

Gupta, A., Faraboschi, P., Gioachin, F., Kale, L. V., Kaufmann, R., Lee, B. S., March, V., Milojicic, D. & Suen, C. H., Jul 1 2016, In : IEEE Transactions on Cloud Computing. 4, 3, p. 307-321 15 p., 6858018.

Research output: Contribution to journalArticle

2019

Scalable GW software for quasiparticle properties using OpenAtom

Kim, M., Mandal, S., Mikida, E., Chandrasekar, K., Bohm, E., Jain, N., Li, Q., Kanakagiri, R., Martyna, G. J., Kale, L. & Ismail-Beigi, S., Nov 2019, In : Computer Physics Communications. 244, p. 427-441 15 p.

Research output: Contribution to journalArticle

2015

Using migratable objects to enhance fault tolerance schemes in supercomputers

Meneses, E., Ni, X., Zheng, G., Mendes, C. L. & Kalé, L. V., Jul 1 2015, In : IEEE Transactions on Parallel and Distributed Systems. 26, 7, p. 2061-2074 14 p., 6862914.

Research output: Contribution to journalArticle

2018

Adaptive methods for irregular parallel discrete event simulation workloads

Mikida, E. & Kale, L., May 14 2018, SIGSIM-PADS 2018 - Proceedings of the 2018 ACM SIGSIM Conference on Principles of Advanced Discrete Simulation. Association for Computing Machinery, Inc, p. 189-200 12 p. (SIGSIM-PADS 2018 - Proceedings of the 2018 ACM SIGSIM Conference on Principles of Advanced Discrete Simulation).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2012

A scalable double in-memory checkpoint and restart scheme towards exascale

Zheng, G., Xiang, N. & Kale, L. V., Dec 1 2012, 2012 IEEE/IFIP 42nd International Conference on Dependable Systems and Networks Workshops, DSN-W 2012. 6264677. (Proceedings of the International Conference on Dependable Systems and Networks).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2014

Maximizing Throughput on a Dragonfly Network

Jain, N., Bhatele, A., Ni, X., Wright, N. J. & Kale, L. V., Jan 16 2014, In : International Conference for High Performance Computing, Networking, Storage and Analysis, SC. 2015-January, January, p. 336-347 12 p., 7013015.

Research output: Contribution to journalConference article

2015

Charm++ and MPI: Combining the Best of Both Worlds

Jain, N., Bhatele, A., Yeom, J. S., Adams, M. F., Miniati, F., Mei, C. & Kale, L. V., Jul 17 2015, Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium, IPDPS 2015. Institute of Electrical and Electronics Engineers Inc., p. 655-664 10 p. 7161553. (Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium, IPDPS 2015).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2012

Hiding checkpoint overhead in HPC applications with a semi-blocking algorithm

Ni, X., Meneses, E. & Kalé, L. V., Jan 1 2012, Proceedings - 2012 IEEE International Conference on Cluster Computing, CLUSTER 2012. IEEE Computer Society, p. 364-372 9 p. 6337799. (Proceedings - 2012 IEEE International Conference on Cluster Computing, CLUSTER 2012).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2013

ACR: Automatic checkpoint/restart for soft and hard error protection

Ni, X., Meneses, E., Jain, N. & Kale, L. V., Jan 1 2013, Proceedings of SC 2013: The International Conference for High Performance Computing, Networking, Storage and Analysis. IEEE Computer Society, 7. (International Conference for High Performance Computing, Networking, Storage and Analysis, SC).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2012

Using shared arrays in message-driven parallel programs

Miller, P., Becker, A. & Kalé, L., Jan 1 2012, In : Parallel Computing. 38, 1-2, p. 66-74 9 p.

Research output: Contribution to journalArticle

2011

Enabling massive parallelism for stochastic optimization problems

Langer, A., Venkataraman, R., Gupta, G., Kale, L., Palekar, U. & Baker, S., Dec 1 2011, SC'11 - Proceedings of the 2011 High Performance Computing Networking, Storage and Analysis Companion, Co-located with SC'11. p. 89-90 2 p. (SC'11 - Proceedings of the 2011 High Performance Computing Networking, Storage and Analysis Companion, Co-located with SC'11).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2017

NAMD: Scalable molecular dynamics based on the charm++ parallel runtime system

Acun, B., Buch, R., Kale, L. V. & Phillips, J. C., Jan 1 2017, Exascale Scientific Applications: Scalability and Performance Portability. Straatsma, T. P., Antypas, K. B. & Williams, T. J. (eds.). CRC Press, p. 119-143 25 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

2016

Power, Reliability, and Performance: One System to Rule them All

Acun, B., Langer, A., Meneses, E., Menon, H., Sarood, O., Totoni, E. & Kalé, L. V., Oct 2016, Computer, 49, 10, p. 30-37 8 p.

Research output: Contribution to specialist publicationArticle

2013

Adoption protocols for fanout-optimal fault-tolerant termination detection

Lifflander, J., Miller, P. & Kale, L., 2013, PPoPP 2013 - Proceedings of the 2013 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. p. 13-22 10 p. (Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2014

Parallel Programming with Migratable Objects: Charm++ in Practice

Acun, B., Gupta, A., Jain, N., Langer, A., Menon, H., Mikida, E., Ni, X., Robson, M., Sun, Y., Totoni, E., Wesolowski, L. & Kale, L., Jan 16 2014, In : International Conference for High Performance Computing, Networking, Storage and Analysis, SC. 2015-January, January, p. 647-658 12 p., 7013040.

Research output: Contribution to journalConference article

2012

Exploring the performance and mapping of HPC applications to platforms in the cloud

Gupta, A., Kalé, L. V., Gioachin, F., March, V., Suen, C. H., Lee, B. S., Faraboschi, P., Kaufmann, R. & Milojicic, D., Jul 23 2012, HPDC '12 - Proceedings of the 21st ACM Symposium on High-Performance Parallel and Distributed Computing. p. 121-122 2 p. (HPDC '12 - Proceedings of the 21st ACM Symposium on High-Performance Parallel and Distributed Computing).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2014

Overcoming the scalability challenges of epidemic simulations on blue waters

Yeom, J. S., Bhatele, A., Bisset, K., Bohm, E., Gupta, A., Kale, L. V., Marathe, M., Nikolopoulos, D. S., Schulz, M. & Wesolowski, L., 2014, Proceedings - IEEE 28th International Parallel and Distributed Processing Symposium, IPDPS 2014. IEEE Computer Society, p. 755-764 10 p. 6877307. (Proceedings of the International Parallel and Distributed Processing Symposium, IPDPS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2012

Optimizing VM placement for HPC in the cloud

Gupta, A., Milojicic, D. & Kalé, L. V., Oct 30 2012, FederatedClouds'12 - Proceedings of the 2012 Workshop on Cloud Services, Federation, and the 8th Open Cirrus Summit, Co-located with ICAC'12. p. 1-6 6 p. (FederatedClouds'12 - Proceedings of the 2012 Workshop on Cloud Services, Federation, and the 8th Open Cirrus Summit, Co-located with ICAC'12).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2011

Automatic MPI to AMPI program transformation using photran

Negara, S., Zheng, G., Pan, K. C., Negara, N., Johnson, R. E., Kalé, L. V. & Ricker, P. M., Aug 19 2011, Euro-Par 2010 - Parallel Processing Workshops: HeteroPar, HPPC, HiBB, CoreGrid, UCHPC, HPCF, PROPER, CCPI, VHPC, Revised Selected Papers. p. 531-539 9 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 6586 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2012

Hybrid static/dynamic scheduling for already optimized dense matrix factorization

Donfack, S., Grigori, L., Gropp, W. D. & Kale, V., 2012, Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium, IPDPS 2012. p. 496-507 12 p. 6267853. (Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium, IPDPS 2012).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2016

A fault-tolerance protocol for parallel applications with communication imbalance

Meneses, E. & Kale, L. V., Jan 12 2016, Proceedings - IEEE 27th International Symposium on Computer Architecture and High Performance Computing, SBAC-PAD 2015. IEEE Computer Society, p. 162-169 8 p. 7379847. (Proceedings - Symposium on Computer Architecture and High Performance Computing; vol. 2016-January).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Preface

Kale, L. V. & Bhatele, A., Apr 19 2016, Parallel Science and Engineering Applications: The Charm++ Approach. CRC Press, p. xxiii-xxxi

Research output: Chapter in Book/Report/Conference proceedingForeword/postscript

FlipBack: Automatic Targeted Protection against Silent Data Corruption

Ni, X. & Kale, L. V., Jul 2 2016, Proceedings of SC 2016: The International Conference for High Performance Computing, Networking, Storage and Analysis. IEEE Computer Society, p. 335-346 12 p. 7877107. (International Conference for High Performance Computing, Networking, Storage and Analysis, SC; vol. 0).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2012

Simulating the spread of infectious disease over large realistic social networks using Charm++

Bisset, K. R., Aji, A. M., Bohm, E., Kale, L. V., Kamal, T., Marathe, M. V. & Yeom, J. S., Oct 18 2012, Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2012. p. 507-518 12 p. 6270685. (Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2012).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Optimizing fine-grained communication in a biomolecular simulation application on Cray XK6

Sun, Y., Zheng, G., Mei, C., Bohm, E. J., Phillips, J. C., Kalé, L. V. & Jones, T. R., Dec 1 2012, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2012. 6468525. (International Conference for High Performance Computing, Networking, Storage and Analysis, SC).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2020

Unified data movement for offloading charm++ applications

Diener, M. & Kale, L., May 2020, Proceedings - 2020 IEEE 34th International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2020. Institute of Electrical and Electronics Engineers Inc., p. 471-474 4 p. 9150456. (Proceedings - 2020 IEEE 34th International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2020).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2011

Evaluation of simple causal message logging for large-scale fault tolerant HPC systems

Meneses, E., Bronevetsky, G. & Kalé, L. V., 2011, 2011 IEEE International Symposium on Parallel and Distributed Processing, Workshops and Phd Forum, IPDPSW 2011. p. 1533-1540 8 p. 6009012. (IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum).

Research output: Chapter in Book/Report/Conference proceedingConference contribution