Filter
Conference contribution

Search results

  • 2023

    Enabling Multi-level Network Modeling in Structural Simulation Toolkit for Next-Generation HPC Network Design Space Exploration

    Chenna, S. P., Kumar, N., Borges, L., Steyer, M., Thierry, P. & Garzaran, M., 2023, High Performance Computing - ISC High Performance 2023 International Workshops, Revised Selected Papers. Bienz, A., Weiland, M., Baboulin, M. & Kruse, C. (eds.). Springer, p. 366-377 12 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 13999 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2019

    Minimizing the usage of hardware counters for collective communication using triggered operations

    Islam, N. S., Zheng, G., Sur, S., Langer, A. & Garzaran, M., Sep 11 2019, Proceedings of the 26th European MPI Users'' Group Meeting, EuroMPI 2019. Hoefler, T. & Traff, J. L. (eds.). Association for Computing Machinery, a11. (ACM International Conference Proceeding Series).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • NoMap: Speeding-up javascript using hardware transactional memory

    Shull, T., Choi, J., Garzaran, M. J. & Torrellas, J., Mar 26 2019, Proceedings - 25th IEEE International Symposium on High Performance Computer Architecture, HPCA 2019. Institute of Electrical and Electronics Engineers Inc., p. 412-425 14 p. 8675185. (Proceedings - 25th IEEE International Symposium on High Performance Computer Architecture, HPCA 2019).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Software combining to mitigate multithreaded MPI contention

    Amer, A., Archer, C., Blocksome, M., Cao, C., Chuvelev, M., Fujita, H., Garzaran, M., Guo, Y., Hammond, J. R., Iwasaki, S., Raffenetti, K. J., Shiryaev, M., Si, M., Taura, K., Thapaliya, S. & Balaji, P., Jun 26 2019, ICS 2019 - International Conference on Supercomputing. Association for Computing Machinery, p. 367-379 13 p. (Proceedings of the International Conference on Supercomputing).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2018

    Framework for scalable intra-node collective operations using shared memory

    Jain, S., Kaleem, R., Balmana, M. G., Langer, A., Durnov, D., Sannikov, A. & Garzaran, M., Jul 2 2018, Proceedings - International Conference for High Performance Computing, Networking, Storage, and Analysis, SC 2018. Institute of Electrical and Electronics Engineers Inc., p. 374-385 12 p. 8665755. (Proceedings - International Conference for High Performance Computing, Networking, Storage, and Analysis, SC 2018).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Parallelizing MPI using tasks for hybrid programming models

    Jain, S., Zheng, G., Garzaran, M., Cownie, J. H., Doodi, T. & Wilmarth, T. L., Aug 3 2018, Proceedings - 2018 IEEE 32nd International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2018. Institute of Electrical and Electronics Engineers Inc., p. 1303-1312 10 p. 8425570. (Proceedings - 2018 IEEE 32nd International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2018).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2017

    OpenMP® runtime instrumentation for optimization

    Doodi, T., Peyton, J., Cownie, J., Garzaran, M., Kalidas, R., Kim, J., Mathuriya, A., Wilmarth, T. & Zheng, G., 2017, Scaling OpenMP for Exascale Performance and Portability - 13th International Workshop on OpenMP, IWOMP 2017, Proceedings. de Supinski, B. R., Chapman, B. M., Terboven, C., Muller, M. S. & Olivier, S. L. (eds.). Springer, p. 281-295 15 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 10468 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • ShortCut: Architectural support for fast object access in scripting languages

    Choi, J., Shull, T., Garzaran, M. J. & Torrellas, J., Jun 24 2017, ISCA 2017 - 44th Annual International Symposium on Computer Architecture - Conference Proceedings. Institute of Electrical and Electronics Engineers Inc., p. 494-506 13 p. (Proceedings - International Symposium on Computer Architecture; vol. Part F128643).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • 2016

    Breadth-First Search on Heterogeneous Platforms: A Case of Study on Social Networks

    Remis, L., Garzaran, M. J., Asenjo, R. & Navarro, A., Dec 16 2016, Proceedings - 28th IEEE International Symposium on Computer Architecture and High Performance Computing, SBAC-PAD 2016. IEEE Computer Society, p. 118-125 8 p. 7789331. (Proceedings - Symposium on Computer Architecture and High Performance Computing).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • DSMR: A parallel algorithm for Single-Source Shortest Path problem

    Maleki, S., Nguyen, D., Lenharth, A., Garzarán, M., Padua, D. & Pingali, K., Jun 1 2016, Proceedings of the 2016 International Conference on Supercomputing, ICS 2016. Association for Computing Machinery, a32. (Proceedings of the International Conference on Supercomputing; vol. 01-03-June-2016).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • DSMR: A shared and distributed memory algorithm for single-source shortest path problem

    Maleki, S., Nguyen, D., Lenharth, A., Garzarán, M., Padua, D. & Pingali, K., Feb 27 2016, 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP 2016 - Proceedings. Association for Computing Machinery, 39. (Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP; vol. 12-16-March-2016).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • Pipeline template for streaming applications on heterogeneous chips

    Rodrìguez, A., Navarro, A., Asenjo, R., Corbera, F., Vilches, A. & Garzarán, M., 2016, Parallel Computing: On the Road to Exascale. Peters, F., Parsons, M., Sawyer, M., Leather, H. & Joubert, G. R. (eds.). Elsevier B.V., p. 327-336 10 p. (Advances in Parallel Computing; vol. 27).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2015

    Directive-based compilers for GPUs

    Ghike, S., Gran, R., Garzarán, M. J. & Padua, D. A., 2015, Languages and Compilers for Parallel Computing - 27th International Workshop, LCPC 2014, Revised Selected Papers. Brodman, J. & Tu, P. (eds.). Springer, p. 19-35 17 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 8967).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • Parallel Pipeline on Heterogeneous Multi-processing Architectures

    Rodriguez, A., Navarro, A., Asenjo, R., Vilches, A., Corbera, F. & Garzaran, M., Dec 2 2015, Proceedings - 13th IEEE International Symposium on Parallel and Distributed Processing with Applications, ISPA 2015. Institute of Electrical and Electronics Engineers Inc., p. 166-171 6 p. 7345643. (Proceedings - 14th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2015; vol. 3).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Understanding the propagation of error due to a silent data corruption in a sparse matrix vector multiply

    Calhoun, J., Snir, M., Olson, L. & Garzaran, M., Oct 26 2015, Proceedings - 2015 IEEE International Conference on Cluster Computing, CLUSTER 2015. Institute of Electrical and Electronics Engineers Inc., p. 541-542 2 p. 7307650. (Proceedings - IEEE International Conference on Cluster Computing, ICCC; vol. 2015-October).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2014

    Evaluation of a feature tracking vision application on a heterogeneous chip

    Gran, R., Shi, A., Totoni, E. & Garzarán, M. J., Dec 1 2014, Proceedings - IEEE 26th International Symposium. IEEE Computer Society, p. 246-253 8 p. 6970671. (Proceedings - Symposium on Computer Architecture and High Performance Computing).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Improving javascript performance by deconstructing the type system

    Ahn, W., Choi, J., Shull, T., Garzarán, M. J. & Torrellas, J., 2014, PLDI 2014 - Proceedings of the 2014 ACM SIGPLAN Conference on Programming Language Design and Implementation. Association for Computing Machinery, p. 496-507 12 p. (Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI)).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • Optimization by runtime specialization for sparse matrix-vector multiplication

    Kamin, S., Garzarán, M. J., Aktemur, B., Xu, D., Yilmaz, B. & Chen, Z., Sep 15 2014, 13th International Conference on Generative Programming: Concepts and Experiences, GPCE 2014 - Proceedings. Association for Computing Machinery, p. 93-102 10 p. (13th International Conference on Generative Programming: Concepts and Experiences, GPCE 2014 - Proceedings).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2012

    Hierarchical overlapped tiling

    Zhou, X., Giacalone, J. P., Garzarán, M. J., Kuhn, R. H., Ni, Y. & Padua, D., 2012, Proceedings - International Symposium on Code Generation and Optimization, CGO 2012. p. 207-218 12 p. (Proceedings - International Symposium on Code Generation and Optimization, CGO 2012).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • Performance portability with the Chapel language

    Sidelnik, A., Maleki, S., Chamberlain, B. L., Garzarán, M. J. & Padua, D., 2012, Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium, IPDPS 2012. p. 582-594 13 p. 6267860. (Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium, IPDPS 2012).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • 2011

    An evaluation of vectorizing compilers

    Maleki, S., Gao, Y., Garzarán, M. J., Wong, T. & Padua, D. A., 2011, Proceedings - 2011 International Conference on Parallel Architectures and Compilation Techniques, PACT 2011. p. 372-382 11 p. 6113845. (Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • A parallel numerical solver using hierarchically tiled arrays

    Brodman, J. C., Evans, G. C., Manguoglu, M., Sameh, A., Garzarán, M. J. & Padua, D., 2011, Languages and Compilers for Parallel Computing - 23rd International Workshop, LCPC 2010, Revised Selected Papers. p. 46-61 16 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 6548 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • Scheduling of stream-based real-time applications for heterogeneous systems

    Virlet, B., Zhou, X., Giacalone, J. P., Kuhn, B., Garzarán, M. J. & Padua, D., 2011, LCTES'11 - Proceedings of the ACM SIGPLAN/SIGBED 2011 Conference on Languages, Compilers, Tools and Theory for Embedded Systems. p. 1-10 10 p. (Proceedings of the ACM SIGPLAN Conference on Languages, Compilers, and Tools for Embedded Systems (LCTES)).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • 2009

    ESoftCheck: Removal of non-vital checks for fault tolerance

    Yu, J., Garzarán, M. J. & Snir, M., 2009, Proceedings of the 2009 CGO - 7th International Symposium on Code Generation and Optimization. p. 35-46 12 p. 4907649. (Proceedings of the 2009 CGO - 7th International Symposium on Code Generation and Optimization).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • Optimization of tele-immersion codes

    Sidelnik, A., Sung, I. J., Wu, W., Garzarán, M. J., Hwu, W. M., Nahrstedt, K., Padua, D. & Patel, S. J., 2009, Proceedings of 2nd Workshop on General Purpose Processing on Graphics Processing Units, GPGPU-2. p. 85 1 p. (Proceedings of 2nd Workshop on General Purpose Processing on Graphics Processing Units, GPGPU-2).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2008

    Automatic generation of a parallel sorting algorithm

    Garber, B. A., Hoeflinger, D., Li, X., Garzarán, M. J. & Padua, D., 2008, IPDPS Miami 2008 - Proceedings of the 22nd IEEE International Parallel and Distributed Processing Symposium, Program and CD-ROM. 4536400. (IPDPS Miami 2008 - Proceedings of the 22nd IEEE International Parallel and Distributed Processing Symposium, Program and CD-ROM).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Design issues in parallel array languages for shared memory

    Brodman, J., Fraguela, B. B., Garzarán, M. J. & Padua, D., 2008, Embedded Computer Systems: Architectures, Modeling, and Simulation - 8th International Workshop, SAMOS 2008, Proceedings. p. 208-217 10 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 5114 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • Efficient software checking for fault tolerance

    Yu, J., Garzarán, M. J. & Snir, M., 2008, IPDPS Miami 2008 - Proceedings of the 22nd IEEE International Parallel and Distributed Processing Symposium, Program and CD-ROM. 4536435. (IPDPS Miami 2008 - Proceedings of the 22nd IEEE International Parallel and Distributed Processing Symposium, Program and CD-ROM).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • P-ray: A software suite for multi-core architecture characterization

    Duchateau, A. X., Sidelnik, A., Garzarán, M. J. & Padua, D., 2008, Languages and Compilers for Parallel Computing - 21st International Workshop, LCPC 2008, Revised Selected Papers. p. 187-201 15 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 5335 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Programming with tiles

    Guo, J., Bikshandit, G., Fraguela, B. B., Garzarán, M. J. & Padua, D., 2008, PPoPP'08 - Proceedings of the 2008 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. p. 1-10 10 p. (Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Techniques for efficient software checking

    Yu, J., Garzarán, M. J. & Snir, M., 2008, Languages and Compilers for Parallel Computing - 20th International Workshop, LCPC 2007, Revised Selected Papers. p. 16-31 16 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 5234 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2007

    Compiler optimizations for fault tolerance software checking

    Yu, J. & Garzarán, M. J., 2007, 16th International Conference on Parallel Architecture and Compilation Techniques, PACT 2007. p. 433 1 p. 4336261. (Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Design and use of htalib - A library for hierarchically tiled arrays

    Bikshandi, G., Guo, J., Von Praun, C., Tanase, G., Fraguela, B. B., Garzarán, M. J., Padua, D. & Rauchwerger, L., 2007, Languages and Compilers for Parallel Computing - 19th International Workshop, LCPC 2006, Revised Papers. Springer, p. 17-32 16 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 4382 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Optimizing sorting with machine learning algorithms

    Li, X., Garzarán, M. J. & Padua, D., 2007, Proceedings - 21st International Parallel and Distributed Processing Symposium, IPDPS 2007; Abstracts and CD-ROM. 4228227. (Proceedings - 21st International Parallel and Distributed Processing Symposium, IPDPS 2007; Abstracts and CD-ROM).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • 2006

    A language for the compact representation of multiple program versions

    Donadio, S., Brodman, J., Roeder, T., Yotov, K., Barthou, D., Cohen, A., Garzarán, M. J., Padua, D. & Pingali, K., 2006, Languages and Compilers for Parallel Computing - 18th International Workshop, LCPC 2005, Revised Selected Papers. Springer, p. 136-151 16 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 4339 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • Analytic models and empirical search: A hybrid approach to code optimization

    Epshteyn, A., Garzaran, M. J., DeJong, G., Padua, D., Ren, G., Li, X., Yotov, K. & Pingali, K., 2006, Languages and Compilers for Parallel Computing - 18th International Workshop, LCPC 2005, Revised Selected Papers. p. 259-273 15 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 4339 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Hierarchically tiled arrays for parallelism and locality

    Jia, G., Bikshandi, G., Hoeflinger, D., Almasi, G., Fraguela, B., Garzarán, M. J., Padua, D. & Von Praunt, C., 2006, 20th International Parallel and Distributed Processing Symposium, IPDPS 2006. IEEE Computer Society, 1639573. (20th International Parallel and Distributed Processing Symposium, IPDPS 2006; vol. 2006).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • Optimizing matrix multiplication with a classifier learning system

    Li, X. & Garzarán, M. J., 2006, Languages and Compilers for Parallel Computing - 18th International Workshop, LCPC 2005, Revised Selected Papers. p. 121-135 15 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 4339 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Programming for parallelism, and locality with hierarchically tiled arrays

    Bikshandi, G., Jia, G., Hoeflinger, D., Almasi, G., Fraguela, B. B., Garzarán, M. J., Padua, D. & Von Praun, C., 2006, Proceedings of the 2006 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP'06. Association for Computing Machinery, p. 48-57 10 p. (Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP; vol. 2006).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • 2005

    Optimizing sorting with genetic algorithms

    Li, X., Garzarán, M. J. & Padua, D., 2005, Proceedings of the 2005 International Symposium on Code Generation and Optimization, CGO 2005. p. 99-110 12 p. 1402080. (Proceedings of the 2005 International Symposium on Code Generation and Optimization, CGO 2005; vol. 2005).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2004

    A dynamically tuned sorting library

    Li, X., Garzarán, M. J. & Padua, D., 2004, International Symposium on Code Generation and Optimization, CGO 2004. p. 111-122 12 p. (International Symposium on Code Generation and Optimization, CGO).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2003

    Tradeoffs in buffering memory state for thread-level speculation in multiprocessors

    Garzaran, M. J., Prvulovic, M., Llaberia, J. M., Vinals, V., Rauchwerger, L. & Torrellas, J., 2003, Proceedings - 9th International Symposium on High-Performance Computer Architecture, HPCA 2003. IEEE Computer Society, p. 191-202 12 p. 1183537. (Proceedings - International Symposium on High-Performance Computer Architecture; vol. 12).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Using software logging to support multiversion buffering in thread-level speculation

    Garzarán, M. J., Prvulovic, M., Viñals, V., Llabería, J. M., Rauchwerger, L. & Torrellas, J., 2003, Proceedings - 12th International Conference on Parallel Architectures and Compilation Techniques, PACT 2003. Institute of Electrical and Electronics Engineers Inc., p. 170-181 12 p. 1238013. (Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT; vol. 2003-January).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • 2002

    Smartapps, an application centric approach to high performance computing: Compiler-assisted software and hardware support for reduction operations

    Dang, F., Jesús Garzarán, M., Prvulovic, M., Zhang, Y., Jula, A., Yu, H., Amato, N., Rauchwerger, L. & Torrellas, J., 2002, Proceedings - International Parallel and Distributed Processing Symposium, IPDPS 2002. Institute of Electrical and Electronics Engineers Inc., p. 172-181 10 p. 1016572. (Proceedings - International Parallel and Distributed Processing Symposium, IPDPS 2002).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2001

    Hardware prefetching in bus-based multiprocessors: Pattern characterization and cost-effective hardware

    Garzarán, M. J., Brit, J. L., Ibáñez, P. E. & Viñals, V., 2001, Proceedings - 9th Euromicro Workshop on Parallel and Distributed Processing, PDP 2001. Klockner, K. (ed.). Institute of Electrical and Electronics Engineers Inc., p. 345-354 10 p. 905061. (Proceedings - 9th Euromicro Workshop on Parallel and Distributed Processing, PDP 2001).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access