Search results

  • 2011

    Performance modeling for systematic performance tuning

    Hoefler, T., Gropp, W., Snir, M. & Kramer, W., 2011, State of the Practice Reports, SC'11. 6. (State of the Practice Reports, SC'11).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • Scalable memory use in MPI: A case study with MPICH2

    Goodell, D., Gropp, W., Zhao, X. & Thakur, R., 2011, Recent Advances in the Message Passing Interface - 18th European MPI Users' Group Meeting, EuroMPI 2011, Proceedings. p. 140-149 10 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 6960 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • The international exascale software project roadmap

    Dongarra, J., Beckman, P., Moore, T., Aerts, P., Aloisio, G., Andre, J. C., Barkai, D., Berthou, J. Y., Boku, T., Braunschweig, B., Cappello, F., Chapman, B., Xuebin Chi, C., Choudhary, A., Dosanjh, S., Dunning, T., Fiore, S., Geist, A., Gropp, B., Harrison, R., & 45 othersHereld, M., Heroux, M., Hoisie, A., Hotta, K., Zhong Jin, J., Ishikawa, Y., Johnson, F., Kale, S., Kenway, R., Keyes, D., Kramer, B., Labarta, J., Lichnewsky, A., Lippert, T., Lucas, B., MacCabe, B., Matsuoka, S., Messina, P., Michielse, P., Mohr, B., Mueller, M. S., Nagel, W. E., Nakashima, H., Papka, M. E., Reed, D., Sato, M., Seidel, E., Shalf, J., Skinner, D., Snir, M., Sterling, T., Stevens, R., Streitz, F., Sugar, B., Sumimoto, S., Tang, W., Taylor, J., Thakur, R., Trefethen, A., Valero, M., Van Der Steen, A., Vetter, J., Williams, P., Wisniewski, R. & Yelick, K., Feb 2011, In: International Journal of High Performance Computing Applications. 25, 1, p. 3-60 58 p.

    Research output: Contribution to journalArticlepeer-review

    Open Access
  • Weighted locality-sensitive scheduling for mitigating noise on multi-core clusters

    Kale, V., Bhatele, A. & Gropp, W. D., 2011, 18th International Conference on High Performance Computing, HiPC 2011. IEEE Computer Society, 6152722. (18th International Conference on High Performance Computing, HiPC 2011).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • 2010

    An adaptive performance modeling tool for GPU architectures

    Baghsorkhi, S. S., Delahaye, M., Patel, S. J., Gropp, W. D. & Hwu, W. M. W., May 2010, In: ACM SIGPLAN Notices. 45, 5, p. 105-114 10 p.

    Research output: Contribution to journalArticlepeer-review

  • An adaptive performance modeling tool for GPU architectures

    Baghsorkhi, S. S., Delahaye, M., Patel, S. J., Gropp, W. D. & Hwu, W. M. W., 2010, PPoPP'10 - Proceedings of the 2010 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. p. 105-114 10 p. (Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • An introductory exascale feasibility study for FFTs and multigrid

    Gahvari, H. & Gropp, W. D., 2010, Proceedings of the 2010 IEEE International Symposium on Parallel and Distributed Processing, IPDPS 2010. 5470417. (Proceedings of the 2010 IEEE International Symposium on Parallel and Distributed Processing, IPDPS 2010).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • A pipelined algorithm for large, irregular all-gather problems

    Träff, J. L., Ripke, A., Siebert, C., Siebert, P., Siebert, R. & Gropp, W., 2010, In: International Journal of High Performance Computing Applications. 24, 1, p. 58-68 11 p.

    Research output: Contribution to journalArticlepeer-review

  • A scalable MPI-Comm-split algorithm for exascale computing

    Sack, P. & Gropp, W. D., 2010, Recent Advances in the Message Passing Interface - 17th European MPI Users' Group Meeting, EuroMPI 2010, Proceedings. p. 1-10 10 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 6305 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Enabling concurrent multithreaded MPI communication on multicore petascale systems

    Dózsa, G., Kumar, S., Balaji, P., Buntinas, D., Goodell, D., Gropp, W., Ratterman, J. & Thakur, R., 2010, Recent Advances in the Message Passing Interface - 17th European MPI Users' Group Meeting, EuroMPI 2010, Proceedings. p. 11-20 10 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 6305 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Fine-grained multithreading support for hybrid threaded MPI programming

    Balaji, P., Buntinas, D., Goodell, D., Gropp, W. & Gropp, R., 2010, In: International Journal of High Performance Computing Applications. 24, 1, p. 49-57 9 p.

    Research output: Contribution to journalArticlepeer-review

  • Formal methods applied to high-performance computing software design: A case study of MPI one-sided communication-based locking

    Pervez, S., Gopalakrishnan, G., Kirby, R. M., Thakur, R. & Gropp, W., Jan 2010, In: Software - Practice and Experience. 40, 1, p. 23-43 21 p.

    Research output: Contribution to journalArticlepeer-review

    Open Access
  • Load balancing for regular meshes on SMPs with MPI

    Kale, V. & Gropp, W., 2010, Recent Advances in the Message Passing Interface - 17th European MPI Users' Group Meeting, EuroMPI 2010, Proceedings. p. 229-238 10 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 6305 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • Minimizing MPI resource contention in multithreaded multicore environments

    Goodell, D., Balaji, P., Buntinas, D., Dózsa, G., Gropp, W., Kumar, S., De Supinski, B. R. & Thakur, R., 2010, Proceedings - 2010 IEEE International Conference on Cluster Computing, Cluster 2010. p. 1-8 8 p. 5600326. (Proceedings - IEEE International Conference on Cluster Computing, ICCC).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • PMI: A scalable parallel process-management interface for extreme-scale systems

    Balaji, P., Buntinas, D., Goodell, D., Gropp, W., Krishna, J., Lusk, E. & Thakur, R., 2010, Recent Advances in the Message Passing Interface - 17th European MPI Users' Group Meeting, EuroMPI 2010, Proceedings. p. 31-41 11 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 6305 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Proceedings of the International Conference on Parallel Processing Workshops: Welcome Message

    Balaji, P., Vishnu, A., Panda, D. K., Gropp, W. & Saraswat, V., 2010, In: Proceedings of the International Conference on Parallel Processing Workshops. p. xxviii 5599136.

    Research output: Contribution to journalEditorialpeer-review

  • Self-consistent MPI performance guidelines

    Larsson Träff, J., Gropp, W. D. & Thakur, R., 2010, In: IEEE Transactions on Parallel and Distributed Systems. 21, 5, p. 698-709 12 p., 5184825.

    Research output: Contribution to journalArticlepeer-review

  • The importance of non-datacommunication overheads in MPI

    Balaji, P., Chan, A., Gropp, W., Thakur, R. & Lusk, E., 2010, In: International Journal of High Performance Computing Applications. 24, 1, p. 5-15 11 p.

    Research output: Contribution to journalArticlepeer-review

  • Toward performance models of MPI implementations for understanding application scaling issues

    Hoefler, T., Gropp, W., Thakur, R. & Träff, J. L., 2010, Recent Advances in the Message Passing Interface - 17th European MPI Users' Group Meeting, EuroMPI 2010, Proceedings. p. 21-30 10 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 6305 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2009

    Hierarchical collectives in MPICH2

    Zhu, H., Goodell, D., Gropp, W. & Thakur, R., 2009, Recent Advances in Parallel Virtual Machine and Message Passing Interface - 16th European PVM/MPI Users' Group Meeting, Proceedings. Springer, p. 325-326 2 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 5759 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • Investigating high performance RMA interfaces for the MPI-3 standard

    Tipparaju, V., Gropp, W., Ritzdorf, H., Thakur, R. & Träff, J. L., 2009, ICPP-2009 - The 38th International Conference on Parallel Processing. p. 293-300 8 p. 5362364. (Proceedings of the International Conference on Parallel Processing).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • MPI on a million processors

    Balaji, P., Buntinas, D., Goodell, D., Gropp, W., Kumar, S., Lusk, E., Thakur, R. & Träff, J. L., 2009, Recent Advances in Parallel Virtual Machine and Message Passing Interface - 16th European PVM/MPI Users' Group Meeting, Proceedings. Springer, p. 20-30 11 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 5759 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Natively supporting true one-sided communication in MPI on multi-core systems with infiniband

    Santhanaraman, G., Balaji, P., Gopalakrishnan, K., Thakur, R., Gropp, W. & Panda, D. K., 2009, 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid, CCGRID 2009. IEEE Computer Society, p. 380-387 8 p. 5071895. (2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid, CCGRID 2009).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • On the need for a consortium of capability centers

    Gropp, W. & Snir, M., 2009, In: International Journal of High Performance Computing Applications. 23, 4, p. 413-420 8 p.

    Research output: Contribution to journalArticlepeer-review

  • Processing MPI datatypes outside MPI

    Ross, R., Latham, R., Gropp, W. D., Lusk, E. & Thakur, R., 2009, Recent Advances in Parallel Virtual Machine and Message Passing Interface - 16th European PVM/MPI Users' Group Meeting, Proceedings. Springer, p. 42-53 12 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 5759 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • Software for petascale computing systems

    Gropp, W. D., Sep 2009, In: Computing in Science and Engineering. 11, 5, p. 17-21 5 p., 5228711.

    Research output: Contribution to journalArticlepeer-review

  • Test suite for evaluating performance of multithreaded MPI communication

    Thakur, R. & Gropp, W., Dec 2009, In: Parallel Computing. 35, 12, p. 608-617 10 p.

    Research output: Contribution to journalArticlepeer-review

  • Toward exascale resilience

    Cappello, F., Geist, A., Gropp, B., Kale, L., Kramer, B. & Snir, M., 2009, In: International Journal of High Performance Computing Applications. 23, 4, p. 374-388 15 p.

    Research output: Contribution to journalArticlepeer-review

  • Toward message passing for a million processes: Characterizing MPI on a massive scale blue gene/P

    Balaji, P., Chan, A., Thakur, R., Gropp, W. & Lusk, E., Sep 2009, In: Computer Science - Research and Development. 24, 1-2, p. 11-19 9 p.

    Research output: Contribution to journalArticlepeer-review

  • 2008

    A formal approach to detect functionally irrelevant barriers in MPI programs

    Sharma, S., Vakkalanka, S., Gopalakrishnan, G., Kirby, R. M., Thakur, R. & Gropp, W., 2008, Recent Advances in Parallel Virtual Machine and Message Passing Interface - 15th European PVM/MPI Users' Group Meeting, Proceedings. p. 265-273 9 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 5205 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • An efficient format for nearly constant-time access to arbitrary time intervals in large trace files

    Chan, A., Gropp, W. & Lusk, E., 2008, In: Scientific Programming. 16, 2-3, p. 155-165 11 p.

    Research output: Contribution to journalArticlepeer-review

  • A simple, pipelined algorithm for large, irregular all-gather problems

    Träff, J. L., Ripke, A., Siebert, C., Balaji, P., Thakur, R. & Gropp, W., 2008, Recent Advances in Parallel Virtual Machine and Message Passing Interface - 15th European PVM/MPI Users' Group Meeting, Proceedings. p. 84-93 10 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 5205 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Communication analysis of parallel 3D FFT for flat cartesian meshes on large blue gene systems

    Chan, A., Balaji, P., Gropp, W. & Thakur, R., 2008, High Performance Computing - HiPC 2008 - 15th International Conference, Proceedings. Springer, p. 350-364 15 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 5374 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Exploring parallel I/O concurrency with speculative prefetching

    Chen, Y., Byna, S., Sun, X. H., Thakur, R. & Gropp, W., 2008, Proceedings - 37th International Conference on Parallel Processing, ICPP 2008. p. 422-429 8 p. 4625877. (Proceedings of the International Conference on Parallel Processing).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Hiding I/O latency with pre-execution prefetching for parallel applications

    Chen, Y., Byna, S., Sun, X. H., Thakur, R. & Gropp, W., 2008, 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2008. 5213209. (2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2008).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • Implementing efficient dynamic formal verification methods for MPI programs

    Vakkalanka, S., Delisi, M., Gopalakrishnan, G., Kirby, R. M., Thakur, R. & Gropp, W., 2008, Recent Advances in Parallel Virtual Machine and Message Passing Interface - 15th European PVM/MPI Users' Group Meeting, Proceedings. p. 248-256 9 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 5205 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Improving the performance of tensor matrix vector multiplication in cumulative reaction probability based quantum chemistry codes

    Kaushik, D., Gropp, W., Minkoff, M. & Smith, B., 2008, High Performance Computing - HiPC 2008 - 15th International Conference, Proceedings. Springer, p. 120-130 11 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 5374 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • Non-data-communication overheads in MPI: Analysis on blue Gene/P

    Balaji, P., Chan, A., Gropp, W., Thakur, R. & Lusk, E., 2008, Recent Advances in Parallel Virtual Machine and Message Passing Interface - 15th European PVM/MPI Users' Group Meeting, Proceedings. p. 13-22 10 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 5205 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Parallel I/O prefetching using MPI file caching and I/O signatures

    Byna, S., Chen, Y., Sun, X. H., Thakur, R. & Gropp, W., 2008, 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2008. 5213604. (2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2008).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • Scaling science applications on blue gene

    Gropp, W. D., Frings, W., Hermanns, M. A., Jedlicka, E., Jordan, K. E., Mintzer, F. & Orth, B., 2008, Parallel Computing: Architectures, Algorithms and Applications. IOS Press BV, p. 583-584 2 p. (Advances in Parallel Computing; vol. 15).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Self-consistent MPI-IO performance requirements and expectations

    Gropp, W. D., Kimpe, D., Ross, R., Thakur, R. & Träff, J. L., 2008, Recent Advances in Parallel Virtual Machine and Message Passing Interface - 15th European PVM/MPI Users' Group Meeting, Proceedings. p. 167-176 10 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 5205 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Toward efficient support for multithreaded MPI communication

    Balaji, P., Buntinas, D., Goodell, D., Gropp, W. & Thakur, R., 2008, Recent Advances in Parallel Virtual Machine and Message Passing Interface - 15th European PVM/MPI Users' Group Meeting, Proceedings. p. 120-129 10 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 5205 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2007

    Advanced flow-control mechanisms for the sockets direct protocol over InfiniBand

    Balaji, P., Bhagvat, S., Panda, D. K., Thakur, R. & Gropp, W., 2007, 2007 International Conference on Parallel Processing, ICPP. 4343880. (Proceedings of the International Conference on Parallel Processing).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Analyzing the impact of supporting out-of-order communication on in-order performance with iWARP

    Balaji, P., Feng, W., Bhagvat, S., Panda, D. K., Thakur, R. & Gropp, W., 2007, Proceedings of the 2007 ACM/IEEE Conference on Supercomputing, SC'07. 35. (Proceedings of the 2007 ACM/IEEE Conference on Supercomputing, SC'07).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • A portable method for finding user errors in the usage of MPI collective operations

    Falzone, C., Chan, A., Lusk, E. & Gropp, W., May 2007, In: International Journal of High Performance Computing Applications. 21, 2, p. 155-165 11 p.

    Research output: Contribution to journalArticlepeer-review

    Open Access
  • Electron injection by a nanowire in the bubble regime

    Shen, B., Li, Y., Nemeth, K., Shang, H., Chae, Y. C., Soliday, R., Crowell, R., Frank, E., Gropp, W. & Cary, J., 2007, In: Physics of Plasmas. 14, 5, 053115.

    Research output: Contribution to journalArticlepeer-review

  • Extending the MPI-2 generalized request interface

    Latham, R., Gropp, W., Ross, R. & Thakur, R., 2007, Recent Advances in Parallel Virtual Machine and Message Passing Interface - 14th European PVM/MPI Users' Group Meeting, Proceedings. Springer, p. 223-232 10 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 4757 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • Grid-based image registration

    Gropp, W., Haber, E., Heldmann, S., Keyes, D., Miller, N., Schopf, J. & Yang, T., 2007, Grid-Based Problem Solving Environments: IFIP TC2/ WG 2.5 Working Conference on Grid-Based Problem Solving Environments: Implications for Development and Deployment of Numerical Software. Gaffney, P. W. & Pool, J. C. T. (eds.). p. 435-448 14 p. (IFIP International Federation for Information Processing; vol. 239).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Implementation and evaluation of shared-memory communication and synchronization operations in MPICH2 using the Nemesis communication subsystem

    Buntinas, D., Mercier, G. & Gropp, W., Sep 2007, In: Parallel Computing. 33, 9, p. 634-644 11 p.

    Research output: Contribution to journalArticlepeer-review

    Open Access
  • Nonuniformly communicating noncontiguous data: A case study with PETSc and MPI

    Balaji, P., Buntinas, D., Balay, S., Smith, B., Thakur, R. & Gropp, W., 2007, Proceedings - 21st International Parallel and Distributed Processing Symposium, IPDPS 2007; Abstracts and CD-ROM. 4227951. (Proceedings - 21st International Parallel and Distributed Processing Symposium, IPDPS 2007; Abstracts and CD-ROM).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution