If you made any changes in Pure these will be visible here soon.
Filter
Conference contribution

Search results

  • 2020

    HAL: Computer System for Scalable Deep Learning

    Kindratenko, V., Mu, D., Zhan, Y., Maloney, J., Hashemi, S. H., Rabe, B., Xu, K., Campbell, R., Peng, J. & Gropp, W., Jul 26 2020, PEARC 2020 - Practice and Experience in Advanced Research Computing 2020: Catch the Wave. Association for Computing Machinery, p. 41-48 8 p. (ACM International Conference Proceeding Series).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2019

    Learning with analytical models

    Ibeid, H., Meng, S., Dobon, O., Olson, L. & Gropp, W., May 2019, Proceedings - 2019 IEEE 33rd International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2019. Institute of Electrical and Electronics Engineers Inc., p. 778-786 9 p. 8778229. (Proceedings - 2019 IEEE 33rd International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2019).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Locus: A System and a Language for Program Optimization

    Teixeira, S. F. X. T., Ancourt, C., Padua, D. & Gropp, W., Mar 5 2019, CGO 2019 - Proceedings of the 2019 IEEE/ACM International Symposium on Code Generation and Optimization. Moseley, T., Jimborean, A. & Kandemir, M. T. (eds.). Institute of Electrical and Electronics Engineers Inc., p. 217-228 12 p. 8661203. (CGO 2019 - Proceedings of the 2019 IEEE/ACM International Symposium on Code Generation and Optimization).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Moya—A JIT Compiler for HPC

    Prabhu, T. & Gropp, W., 2019, Programming and Performance Visualization Tools - International Workshops, ESPT 2017 and VPA 2017, Revised Selected Papers. Bhatele, A., Boehme, D., Levine, J. A., Malony, A. D. & Schulz, M. (eds.). Springer-Verlag Berlin Heidelberg, p. 56-73 18 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 11027 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Node-Aware Improvements to Allreduce

    Bienz, A., Olson, L. & Gropp, W., Nov 2019, Proceedings of ExaMPI 2019: Workshop on Exascale MPI - Held in conjunction with SC 2019: The International Conference for High Performance Computing, Networking, Storage and Analysis. Institute of Electrical and Electronics Engineers Inc., p. 19-28 10 p. 8955452. (Proceedings of ExaMPI 2019: Workshop on Exascale MPI - Held in conjunction with SC 2019: The International Conference for High Performance Computing, Networking, Storage and Analysis).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Using performance models to understand scalable Krylov solver performance at scale for structured grid problems

    Eller, P. R., Hoefler, T. & Gropp, W., Jun 26 2019, ICS 2019 - International Conference on Supercomputing. Association for Computing Machinery, p. 138-149 12 p. (Proceedings of the International Conference on Supercomputing).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2018

    Improving performance models for irregular point-to-point communication

    Bienz, A., Gropp, W. D. & Olson, L. N., Sep 23 2018, EuroMPI 2018 - Proceedings of the 25th European MPI Users' Group Meeting. Association for Computing Machinery, a7. (ACM International Conference Proceeding Series).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Using node information to implement MPI cartesian topologies

    Gropp, W. D., Sep 23 2018, EuroMPI 2018 - Proceedings of the 25th European MPI Users' Group Meeting. Association for Computing Machinery, a18. (ACM International Conference Proceeding Series).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2017

    A DSL for Performance Orchestration

    Teixeira, T. S. F. X., Padua, D. & Gropp, W., Oct 31 2017, Proceedings - 26th International Conference on Parallel Architectures and Compilation Techniques, PACT 2017. Institute of Electrical and Electronics Engineers Inc., 1 p. (Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT; vol. 2017-September).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Scalability challenges in current MPI one-sided implementations

    Zhao, X., Balaji, P. & Gropp, W., Apr 18 2017, Proceedings - 15th International Symposium on Parallel and Distributed Computing, ISPDC 2016. Chen, R., Grigoras, D. & Rong, C. (eds.). Institute of Electrical and Electronics Engineers Inc., p. 38-47 10 p. 7904267. (Proceedings - 15th International Symposium on Parallel and Distributed Computing, ISPDC 2016).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Towards a more complete understanding of SDC propagation

    Calhoun, J., Snir, M., Olson, L. N. & Gropp, W. D., Jun 26 2017, HPDC 2017 - Proceedings of the 26th International Symposium on High-Performance Parallel and Distributed Computing. Association for Computing Machinery, Inc, p. 131-142 12 p. (HPDC 2017 - Proceedings of the 26th International Symposium on High-Performance Parallel and Distributed Computing).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2016

    Modeling MPI communication performance on SMP nodes: Is it time to retire the ping pong test

    Gropp, W., Olson, L. N. & Samfass, P., Sep 25 2016, Proceedings of the 23rd European MPI Users' Group Meeting, EuroMPI 2016. Association for Computing Machinery, p. 41-50 10 p. (ACM International Conference Proceeding Series; vol. 25-28-September-2016).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Rethinking high performance computing system architecture for scientific big data applications

    Chen, Y., Chen, C., Yin, Y., Sun, X. H., Thakur, R. & Gropp, W., Jan 1 2016, Proceedings - 15th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, 10th IEEE International Conference on Big Data Science and Engineering and 14th IEEE International Symposium on Parallel and Distributed Processing with Applications, IEEE TrustCom/BigDataSE/ISPA 2016. Institute of Electrical and Electronics Engineers Inc., p. 1605-1612 8 p. 7847131. (Proceedings - 15th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, 10th IEEE International Conference on Big Data Science and Engineering and 14th IEEE International Symposium on Parallel and Distributed Processing with Applications, IEEE TrustCom/BigDataSE/ISPA 2016).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Scalable Non-blocking Preconditioned Conjugate Gradient Methods

    Eller, P. R. & Gropp, W., Jul 2 2016, Proceedings of SC 2016: The International Conference for High Performance Computing, Networking, Storage and Analysis. IEEE Computer Society, p. 204-215 12 p. 7877096. (International Conference for High Performance Computing, Networking, Storage and Analysis, SC; vol. 0).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Towards millions of communicating threads

    Dang, H. V., Snir, M. & Gropp, W., Sep 25 2016, Proceedings of the 23rd European MPI Users' Group Meeting, EuroMPI 2016. Association for Computing Machinery, p. 1-14 14 p. (ACM International Conference Proceeding Series; vol. 25-28-September-2016).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2015

    Algebraic multigrid on a dragonfly network: First experiences on a cray XC30

    Gahvari, H., Gropp, W., Jordan, K. E., Schulz, M. & Yang, U. M., Jan 1 2015, High Performance Computing Systems: Performance Modeling, Benchmarking, and Simulation - 5th International Workshop, PMBS 2014, Revised Selected Papers. Hammond, S. D., Jarvis, S. A. & Wright, S. A. (eds.). Springer-Verlag, p. 3-23 21 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 8966).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • A multiplatform study of i/o behavior on petascale supercomputers

    Luu, H., Winslett, M., Gropp, W., Ross, R., Carns, P., Harms, K., Prabhat, M., Byna, S. & Yao, Y., Jun 15 2015, HPDC 2015 - Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing. Association for Computing Machinery, Inc, p. 33-44 12 p. (HPDC 2015 - Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Composing low-overhead scheduling strategies for improving performance of scientific applications

    Kale, V. & Gropp, W. D., 2015, OpenMP: Heterogenous Execution and Data Movements - 11th International Workshop on OpenMP, IWOMP 2015, Proceedings. Terboven, C., Reble, P., Müller, M. S., Chapman, B. M. & de Supinski, B. R. (eds.). Springer-Verlag Berlin Heidelberg, p. 18-29 12 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 9342).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • DAME: A Runtime-Compiled engine for derived datatypes

    Prabhu, T. & Gropp, W., Sep 21 2015, Proceedings of the 22nd European MPI Users' Group Meeting, EuroMPI 2015. Association for Computing Machinery, a4. (ACM International Conference Proceeding Series; vol. 21-23-September-2015).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Decoupled I/O for Data-Intensive High Performance Computing

    Chen, C., Chen, Y., Feng, K., Yin, Y., Eslami, H., Thakur, R., Sun, X. H. & Gropp, W. D., May 7 2015, Proceedings - 43rd International Conference on Parallel Processing Workshops, ICPPW 2014. Institute of Electrical and Electronics Engineers Inc., p. 312-320 9 p. 7103466. (Proceedings of the International Conference on Parallel Processing Workshops; vol. 2015-May).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Efficient disk-to-disk sorting: A case study in the Decoupled Execution Paradigm

    Eslami, H., Kougkas, A., Kotsifakou, M., Kasampalis, T., Feng, K., Lu, Y., Gropp, W., Sun, X. H., Chen, Y. & Thakur, R., Nov 15 2015, Proceedings of DISCS 2015: The 2015 International Workshop on Data-Intensive Scalable Computing Systems - Held in conjunction with SC 2015: The International Conference for High Performance Computing, Networking, Storage and Analysis. Association for Computing Machinery, Inc, 2. (Proceedings of DISCS 2015: The 2015 International Workshop on Data-Intensive Scalable Computing Systems - Held in conjunction with SC 2015: The International Conference for High Performance Computing, Networking, Storage and Analysis).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Runtime support for irregular computation in MPI-based applications

    Zhao, X., Balaji, P. & Gropp, W., Jul 7 2015, Proceedings - 2015 IEEE/ACM 15th International Symposium on Cluster, Cloud, and Grid Computing, CCGrid 2015. Institute of Electrical and Electronics Engineers Inc., p. 701-704 4 p. 7152536. (Proceedings - 2015 IEEE/ACM 15th International Symposium on Cluster, Cloud, and Grid Computing, CCGrid 2015).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2014

    Locality-optimized mixed static/dynamic scheduling for improving load balancing on SMPs

    Kale, V., Randles, A. & Gropp, W. D., Sep 9 2014, Proceedings of the 21st European MPI Users' Group Meeting, EuroMPI/ASIA 2014. Association for Computing Machinery, p. 115-116 2 p. (ACM International Conference Proceeding Series; vol. 09-12-September-2014).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Rethinking key-value store for parallel I/O optimization

    Yin, Y., Kougkas, A., Feng, K., Eslami, H., Lu, Y., Sun, X. H., Thakur, R. & Gropp, W., Apr 2 2014, Proceedings of DISCS 2014: The 2014 International Workshop on Data-Intensive Scalable Computing Systems - Held in Conjuction with SC 2014: The International Conference for High Performance Computing, Networking, Storage and Analysis. Institute of Electrical and Electronics Engineers Inc., p. 33-40 8 p. 7079024. (Proceedings of DISCS 2014: The 2014 International Workshop on Data-Intensive Scalable Computing Systems - Held in Conjuction with SC 2014: The International Conference for High Performance Computing, Networking, Storage and Analysis).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2013

    Analysis of topology-dependent MPI performance on gemini networks

    Peña, A. J., Carvalho, R. G. C., Dinan, J., Balaji, P., Thakur, R. & Gropp, W., 2013, Proceedings of the 20th European MPI Users' Group Meeting, EuroMPI 2013. Association for Computing Machinery, p. 61-66 6 p. (ACM International Conference Proceeding Series).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • MPI-interoperable generalized active messages

    Zhao, X., Balaji, P., Gropp, W. D. & Thakur, R., Jan 1 2013, Proceedings - 2013 19th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2013. IEEE Computer Society, p. 200-207 8 p. 6808175. (Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Optimization strategies for MPI-interoperable active messages

    Zhao, X., Balaji, P., Gropp, W. D. & Thakur, R., Jan 1 2013, Proceedings - 2013 IEEE 11th International Conference on Dependable, Autonomic and Secure Computing, DASC 2013. IEEE Computer Society, p. 508-515 8 p. 6844416. (Proceedings - 2013 IEEE 11th International Conference on Dependable, Autonomic and Secure Computing, DASC 2013).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Runtime system design of decoupled execution paradigm for data-intensive high-end computing

    Feng, K., Yin, Y., Chen, C., Eslami, H., Sun, X. H., Chen, Y., Thakur, R. & Gropp, W., Dec 1 2013, 2013 IEEE International Conference on Cluster Computing, CLUSTER 2013. 6702642. (Proceedings - IEEE International Conference on Cluster Computing, ICCC).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Systematic reduction of data movement in algebraic multigrid solvers

    Gahvari, H., Gropp, W., Jordan, K. E., Schulz, M. & Yang, U. M., 2013, Proceedings - IEEE 27th International Parallel and Distributed Processing Symposium Workshops and PhD Forum, IPDPSW 2013. IEEE Computer Society, p. 1675-1682 8 p. 6651065. (Proceedings - IEEE 27th International Parallel and Distributed Processing Symposium Workshops and PhD Forum, IPDPSW 2013).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2012

    A case for optimistic coordination in HPC storage systems

    Carns, P., Harms, K., Kimpe, D., Ross, R., Wozniak, J., Ward, L., Curry, M., Klundt, R., Danielson, G., Karakoyunlu, C., Chandy, J., Settlemeyer, B. & Gropp, W., 2012, Proceedings - 2012 SC Companion: High Performance Computing, Networking Storage and Analysis, SCC 2012. p. 48-53 6 p. 6495801. (Proceedings - 2012 SC Companion: High Performance Computing, Networking Storage and Analysis, SCC 2012).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Adaptive strategy for one-sided communication in MPICH2

    Zhao, X., Santhanaraman, G. & Gropp, W., 2012, Recent Advances in the Message Passing Interface - 19th European MPI Users' Group Meeting, EuroMPI 2012, Proceedings. p. 16-26 11 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 7490 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • A decoupled execution paradigm for data-intensive high-end computing

    Chen, Y., Chen, C., Sun, X. H., Gropp, W. D. & Thakur, R., Jan 1 2012, Proceedings - 2012 IEEE International Conference on Cluster Computing, CLUSTER 2012. IEEE Computer Society, p. 200-208 9 p. 6337781. (Proceedings - 2012 IEEE International Conference on Cluster Computing, CLUSTER 2012).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Efficient multithreaded context ID allocation in MPI

    Dinan, J., Goodell, D., Gropp, W., Thakur, R. & Balaji, P., Oct 24 2012, Recent Advances in the Message Passing Interface - 19th European MPI Users' Group Meeting, EuroMPI 2012, Proceedings. p. 57-66 10 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 7490 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Faster topology-aware collective algorithms through non-minimal communication

    Sack, P. & Gropp, W., Mar 22 2012, PPoPP'12 - Proceedings of the 2012 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. p. 45-54 10 p. (Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Hybrid static/dynamic scheduling for already optimized dense matrix factorization

    Donfack, S., Grigori, L., Gropp, W. D. & Kale, V., 2012, Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium, IPDPS 2012. p. 496-507 12 p. 6267853. (Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium, IPDPS 2012).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Leveraging MPI's one-sided communication interface for shared-memory programming

    Hoefler, T., Dinan, J., Buntinas, D., Balaji, P., Barrett, B. W., Brightwell, R., Gropp, W., Kale, V. & Thakur, R., 2012, Recent Advances in the Message Passing Interface - 19th European MPI Users' Group Meeting, EuroMPI 2012, Proceedings. p. 132-141 10 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 7490 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Modeling the performance of an algebraic multigrid cycle using hybrid MPI/OpenMP

    Gahvari, H., Gropp, W., Jordan, K. E., Schulz, M. & Yang, U. M., 2012, Proceedings - 41st International Conference on Parallel Processing, ICPP 2012. p. 128-137 10 p. 6337574. (Proceedings of the International Conference on Parallel Processing).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • MPI 3 and beyond: Why MPI is successful and what challenges it faces

    Gropp, W., Oct 24 2012, Recent Advances in the Message Passing Interface - 19th European MPI Users' Group Meeting, EuroMPI 2012, Proceedings. p. 1-9 9 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 7490 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Performance modeling of algebraic multigrid on blue Gene/Q: Lessons learned

    Gahvari, H., Gropp, W., Jordan, K. E., Schulz, M. & Yang, U. M., 2012, Proceedings - 2012 SC Companion: High Performance Computing, Networking Storage and Analysis, SCC 2012. p. 377-385 9 p. 6495839. (Proceedings - 2012 SC Companion: High Performance Computing, Networking Storage and Analysis, SCC 2012).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2011

    Architectural constraints to attain 1 exaflop/s for three scientific application classes

    Bhatele, A., Jetley, P., Gahvari, H., Wesolowski, L., Gropp, W. D. & Kale, L. V., Oct 3 2011, Proceedings - 25th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2011. p. 80-91 12 p. 6012827. (Proceedings - 25th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2011).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Avoiding hot-spots on two-level direct networks

    Bhatele, A., Jain, N., Gropp, W. D. & Kale, L. V., Dec 14 2011, Proceedings of 2011 SC - International Conference for High Performance Computing, Networking, Storage and Analysis. 76. (Proceedings of 2011 SC - International Conference for High Performance Computing, Networking, Storage and Analysis).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • LACIO: A new collective I/O strategy for parallel I/O systems

    Chen, Y., Sun, X. H., Thakur, R., Roth, P. C. & Gropp, W. D., 2011, Proceedings - 25th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2011. p. 794-804 11 p. 6012889. (Proceedings - 25th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2011).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Modeling the performance of an algebraic multigrid cycle on HPC platforms

    Gahvari, H., Baker, A. H., Schulz, M., Yang, U. M., Jordan, K. E. & Gropp, W. D., Jun 30 2011, ICS'11 - Proceedings of the 2011 ACM International Conference on Supercomputing. p. 172-181 10 p. (Proceedings of the International Conference on Supercomputing).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Multi-core and network aware MPI topology functions

    Rashti, M. J., Green, J., Balaji, P., Afsahi, A. & Gropp, W., 2011, Recent Advances in the Message Passing Interface - 18th European MPI Users' Group Meeting, EuroMPI 2011, Proceedings. p. 50-60 11 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 6960 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Performance expectations and guidelines for MPI derived datatypes

    Gropp, W., Hoefler, T., Thakur, R. & Träff, J. L., 2011, Recent Advances in the Message Passing Interface - 18th European MPI Users' Group Meeting, EuroMPI 2011, Proceedings. p. 150-159 10 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 6960 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Performance modeling for systematic performance tuning

    Hoefler, T., Gropp, W. D., Snir, M. & Kramer, W. T., Dec 13 2011, State of the Practice Reports, SC'11. 6. (State of the Practice Reports, SC'11).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Scalable memory use in MPI: A case study with MPICH2

    Goodell, D., Gropp, W., Zhao, X. & Thakur, R., 2011, Recent Advances in the Message Passing Interface - 18th European MPI Users' Group Meeting, EuroMPI 2011, Proceedings. p. 140-149 10 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 6960 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Weighted locality-sensitive scheduling for mitigating noise on multi-core clusters

    Kale, V., Bhatele, A. & Gropp, W. D., 2011, 18th International Conference on High Performance Computing, HiPC 2011. 6152722. (18th International Conference on High Performance Computing, HiPC 2011).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2010

    An adaptive performance modeling tool for GPU architectures

    Baghsorkhi, S. S., Delahaye, M., Patel, S. J., Gropp, W. D. & Hwu, W. M. W., Mar 15 2010, PPoPP'10 - Proceedings of the 2010 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. p. 105-114 10 p. (Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • An introductory exascale feasibility study for FFTs and multigrid

    Gahvari, H. & Gropp, W. D., Jul 1 2010, Proceedings of the 2010 IEEE International Symposium on Parallel and Distributed Processing, IPDPS 2010. 5470417. (Proceedings of the 2010 IEEE International Symposium on Parallel and Distributed Processing, IPDPS 2010).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution