David A Padua

If you made any changes in Pure these will be visible here soon.

Research Output

2019

Dataflow Execution of Hierarchically Tiled Arrays

Yang, C. C., Pichel, J. C. & Padua, D. A., Jan 1 2019, Euro-Par 2019: Parallel Processing - 25th International Conference on Parallel and Distributed Computing, Proceedings. Yahyapour, R. (ed.). Springer, p. 304-316 13 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 11725 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Locus: A System and a Language for Program Optimization

Teixeira, S. F. X. T., Ancourt, C., Padua, D. & Gropp, W., Mar 5 2019, CGO 2019 - Proceedings of the 2019 IEEE/ACM International Symposium on Code Generation and Optimization. Moseley, T., Jimborean, A. & Kandemir, M. T. (eds.). Institute of Electrical and Electronics Engineers Inc., p. 217-228 12 p. 8661203. (CGO 2019 - Proceedings of the 2019 IEEE/ACM International Symposium on Code Generation and Optimization).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Managing code transformations for better performance portability

Teixeira, T. S. F. X., Gropp, W. & Padua, D., Nov 1 2019, In : International Journal of High Performance Computing Applications. 33, 6, p. 1290-1306 17 p.

Research output: Contribution to journalArticle

Towards an achievable performance for the loop nests

Shivam, A., Watkinson, N., Nicolau, A., Padua, D. & Veidenbaum, A. V., Jan 1 2019, Languages and Compilers for Parallel Computing - 31st International Workshop, LCPC 2018, Revised Selected Papers. Hall, M. & Sundar, H. (eds.). Springer, p. 70-77 8 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 11882 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2018

From High-Level Specification to High-Performance Code

Franchetti, F., Moura, J. M. F., Padua, D. A. & Dongarra, J., Nov 2018, In : Proceedings of the IEEE. 106, 11, p. 1875-1878 4 p., 8510981.

Research output: Contribution to journalReview article

2017

A DSL for Performance Orchestration

Teixeira, T. S. F. X., Padua, D. & Gropp, W., Oct 31 2017, Proceedings - 26th International Conference on Parallel Architectures and Compilation Techniques, PACT 2017. Institute of Electrical and Electronics Engineers Inc., 1 p. (Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT; vol. 2017-September).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

ANALYSIS OF MESSAGE SWITCHING WITH SHUFFLE-EXCHANGES IN MULTIPROCESSORS.

Lawrie, D. H. & Padua, D. A., Jan 1 2017, p. 116-123. 8 p.

Research output: Contribution to conferencePaper

High-Assurance SPIRAL: End-to-End Guarantees for Robot and Car Control

Franchetti, F., Low, T. M., Mitsch, S., Mendoza, J. P., Gui, L., Phaosawasdi, A., Padua, D., Kar, S., Moura, J. M. F., Franusich, M., Johnson, J., Platzer, A. & Veloso, M. M., Apr 2017, In : IEEE Control Systems. 37, 2, p. 82-103 22 p., 7879901.

Research output: Contribution to journalArticle

LORE: A loop repository for the evaluation of compilers

Chen, Z., Gong, Z., Szaday, J. J., Wong, D. C., Padua, D., Nicolau, A., Veidenbaum, A. V., Watkinson, N., Sura, Z., Maleki, S., Torrellas, J. & DeJong, G., Dec 5 2017, Proceedings of the 2017 IEEE International Symposium on Workload Characterization, IISWC 2017. Institute of Electrical and Electronics Engineers Inc., p. 219-228 10 p. (Proceedings of the 2017 IEEE International Symposium on Workload Characterization, IISWC 2017; vol. 2017-January).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Welcome to PACT 2017

Iyer, R. & Padua, D., Oct 31 2017, In : Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT. 2017-September, p. x-xi

Research output: Contribution to journalEditorial

2016

DSMR: A shared and distributed memory algorithm for single-source shortest path problem

Maleki, S., Nguyen, D., Lenharth, A., Garzarán, M., Padua, D. & Pingali, K., Feb 27 2016, 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP 2016 - Proceedings. Association for Computing Machinery, 39. (Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP; vol. 12-16-March-2016).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

DSMR: A parallel algorithm for Single-Source Shortest Path problem

Maleki, S., Nguyen, D., Lenharth, A., Garzarán, M., Padua, D. & Pingali, K., Jun 1 2016, Proceedings of the 2016 International Conference on Supercomputing, ICS 2016. Association for Computing Machinery, a32. (Proceedings of the International Conference on Supercomputing; vol. 01-03-June-2016).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2015

Directive-based compilers for GPUs

Ghike, S., Gran, R., Garzarán, M. J. & Padua, D. A., Jan 1 2015, Languages and Compilers for Parallel Computing - 27th International Workshop, LCPC 2014, Revised Selected Papers. Brodman, J. & Tu, P. (eds.). Springer-Verlag Berlin Heidelberg, p. 19-35 17 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 8967).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Tiled linear algebra a system for parallel graph algorithms

Maleki, S., Evans, G. C. & Padua, D. A., Jan 1 2015, Languages and Compilers for Parallel Computing - 27th International Workshop, LCPC 2014, Revised Selected Papers. Brodman, J. & Tu, P. (eds.). Springer-Verlag, p. 116-130 15 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 8967).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Vectorization of apply to reduce interpretation overhead of R

Wang, H., Padua, D. A. & Wu, P., Oct 23 2015, OOPSLA 2015 - Proceedings of the 2015 ACM SIGPLAN International Conference on Object-Oriented Programming Systems, Languages, and Applications. Eugster, P. & Aldrich, J. (eds.). Association for Computing Machinery, p. 400-415 16 p. (Proceedings of the Conference on Object-Oriented Programming Systems, Languages, and Applications, OOPSLA; vol. 25-30-Oct-2015).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2014

Hierarchically tiled array as a high-level abstraction for codelets

Yang, C. C., Pichel, J. C., Smith, A. R. & Padua, D. A., Apr 17 2014, Proceedings - 2014 4th Workshop on Data-Flow Execution Models for Extreme Scale Computing, DFM 2014. Institute of Electrical and Electronics Engineers Inc., p. 58-65 8 p. 7089032. (Proceedings - 2014 4th Workshop on Data-Flow Execution Models for Extreme Scale Computing, DFM 2014).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Optimal parallelogram selection for hierarchical tiling

Zhou, X., Garzaran, M. J. & Padua, D. A., Dec 1 2014, In : ACM Transactions on Architecture and Code Optimization. 11, 4, 58.

Research output: Contribution to journalArticle

Optimizing R VM: Allocation removal and path length reduction via interpreter-level specialization

Wang, H., Wu, P. & Padua, D., Jan 1 2014, Proceedings of the 12th ACM/IEEE International Symposium on Code Generation and Optimization, CGO 2014. Association for Computing Machinery, p. 295-305 11 p. (Proceedings of the 12th ACM/IEEE International Symposium on Code Generation and Optimization, CGO 2014).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Practices of PLDI

Boehm, H., Davidson, J., Fisher, K., Flanagan, C., Gibbons, J., Hall, M., Hutton, G., Padua, D., Tip, F., Vitek, J. & Wadler, P., Apr 2014, In : ACM SIGPLAN Notices. 49, 4, p. 33-38 6 p.

Research output: Contribution to journalArticle

Vector seeker: A tool for finding vector potential

Evans, G. C., Abraham, S., Kuhn, B. & Padua, D. A., Jan 1 2014, p. 41-48. 8 p.

Research output: Contribution to conferencePaper

2013

Hydra: Automatic algorithm exploration from linear algebra equations

Duchateau, A. X., Padua, D. & Barthou, D., May 6 2013, Proceedings of the 2013 IEEE/ACM International Symposium on Code Generation and Optimization, CGO 2013. 6494999. (Proceedings of the 2013 IEEE/ACM International Symposium on Code Generation and Optimization, CGO 2013).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Standards for graph algorithm primitives

Mattson, T., Bader, D., Berry, J., Buluc, A., Dongarra, J., Faloutsos, C., Feo, J., Gilbert, J., Gonzalez, J., Hendrickson, B., Kepner, J., Leiserson, C., Lumsdaine, A., Padua, D., Poole, S., Reinhardt, S., Stonebraker, M., Wallach, S. & Yoo, A., Jan 1 2013, 2013 IEEE High Performance Extreme Computing Conference, HPEC 2013. IEEE Computer Society, 6670338. (2013 IEEE High Performance Extreme Computing Conference, HPEC 2013).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2012

Hierarchical overlapped tiling

Zhou, X., Giacalone, J. P., Garzarán, M. J., Kuhn, R. H., Ni, Y. & Padua, D., Jul 9 2012, Proceedings - International Symposium on Code Generation and Optimization, CGO 2012. p. 207-218 12 p. (Proceedings - International Symposium on Code Generation and Optimization, CGO 2012).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Message from the program chairs

Padua, D., De Azevedo, R. J., Cascaval, C., Navaux, P. O. A., Van De Geijn, R. & Zhang, Y., Dec 1 2012, In : Proceedings - Symposium on Computer Architecture and High Performance Computing. p. x 6374807.

Research output: Contribution to journalEditorial

Optimization techniques for efficient HTA programs

Fraguela, B. B., Bikshandi, G., Guo, J., Garzarán, M. J., Padua, D. & Von Praun, C., Sep 1 2012, In : Parallel Computing. 38, 9, p. 465-484 20 p.

Research output: Contribution to journalArticle

Performance portability with the Chapel language

Sidelnik, A., Maleki, S., Chamberlain, B. L., Garzarán, M. J. & Padua, D., Oct 4 2012, Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium, IPDPS 2012. p. 582-594 13 p. 6267860. (Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium, IPDPS 2012).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Preface

D'Hollander, E., De Bosschere, K., Joubert, G. R., Padua, D. & Peters, F., 2012, Applications, Tools and Techniques on the Road to Exascale Computing. IOS Press BV, p. v-vi (Advances in Parallel Computing; vol. 22).

Research output: Chapter in Book/Report/Conference proceedingForeword/postscript

2011

An evaluation of vectorizing compilers

Maleki, S., Gao, Y., Garzarán, M. J., Wong, T. & Padua, D. A., Dec 1 2011, Proceedings - 2011 International Conference on Parallel Architectures and Compilation Techniques, PACT 2011. p. 372-382 11 p. 6113845. (Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

A parallel numerical solver using hierarchically tiled arrays

Brodman, J. C., Evans, G. C., Manguoglu, M., Sameh, A., Garzarán, M. J. & Padua, D., Mar 18 2011, Languages and Compilers for Parallel Computing - 23rd International Workshop, LCPC 2010, Revised Selected Papers. p. 46-61 16 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 6548 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

NSF/IEEE-TCPP curriculum initiative on parallel and distributed computing - Core topics for undergraduates

Prasad, S. K., Kant, K., Robert, Y., Chtchelkanova, A., Das, S., La Salle, A., Le Blanc, R., Rosenberg, A., Dehne, F., Lumsdaine, M., Sahni, S., Gouda, M., Padua, D., Shirazi, B., Gupta, A., Parashar, M., Sussman, A., Jaja, J., Prasanna, V., Weems, C. & 1 others, Wu, J., Apr 19 2011, SIGCSE'11 - Proceedings of the 42nd ACM Technical Symposium on Computer Science Education. p. 617-618 2 p. (SIGCSE'11 - Proceedings of the 42nd ACM Technical Symposium on Computer Science Education).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Scheduling of stream-based real-time applications for heterogeneous systems

Virlet, B., Zhou, X., Giacalone, J. P., Kuhn, B., Garzarán, M. J. & Padua, D., May 10 2011, LCTES'11 - Proceedings of the ACM SIGPLAN/SIGBED 2011 Conference on Languages, Compilers, Tools and Theory for Embedded Systems. p. 1-10 10 p. (Proceedings of the ACM SIGPLAN Conference on Languages, Compilers, and Tools for Embedded Systems (LCTES)).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2010

ACM SIGPLAN Notices: Message from the general chairs

Govindarajan, R. & Padua, D., May 1 2010, In : ACM SIGPLAN Notices. 45, 5

Research output: Contribution to journalEditorial

2009

Authors' Response

Hall, M., Padua, D. & Pingali, K., May 1 2009, In : Communications of the ACM. 52, 5, 1 p.

Research output: Contribution to journalLetter

Communication contention in APN list scheduling algorithm

Tang, X., Li, K. & Padua, D., Jan 1 2009, In : Science in China, Series F: Information Sciences. 52, 1, p. 59-69 11 p.

Research output: Contribution to journalArticle

Compiler research: The next 50 years

Hall, M., Padua, D. & Pingali, K., Feb 1 2009, In : Communications of the ACM. 52, 2, p. 60-67 8 p.

Research output: Contribution to journalArticle

Optimization of tele-immersion codes

Sidelnik, A., Sung, I. J., Wu, W., Garzarán, M. J., Hwu, W. M., Nahrstedt, K., Padua, D. & Patel, S. J., Jul 23 2009, Proceedings of 2nd Workshop on General Purpose Processing on Graphics Processing Units, GPGPU-2. 1 p. (Proceedings of 2nd Workshop on General Purpose Processing on Graphics Processing Units, GPGPU-2).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Task-parallel versus data-parallel library-based programming in multicore systems

Andrade, D., Fraguela, B. B., Brodman, J. & Padua, D., Sep 21 2009, Proceedings of the 17th Euromicro International Conference on Parallel, Distributed and Network-Based Processing, PDP 2009. p. 101-110 10 p. 4912921. (Proceedings of the 17th Euromicro International Conference on Parallel, Distributed and Network-Based Processing, PDP 2009).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Writing productive stencil codes with overlapped tiling

Guo, J., Bikshandi, G., Fraguela, B. B. & Padua, D., Jan 1 2009, In : Concurrency Computation Practice and Experience. 21, 1, p. 25-39 15 p.

Research output: Contribution to journalArticle

2008

Automatic generation of a parallel sorting algorithm

Garber, B. A., Hoeflinger, D., Li, X., Garzarán, M. J. & Padua, D., Sep 10 2008, IPDPS Miami 2008 - Proceedings of the 22nd IEEE International Parallel and Distributed Processing Symposium, Program and CD-ROM. 4536400. (IPDPS Miami 2008 - Proceedings of the 22nd IEEE International Parallel and Distributed Processing Symposium, Program and CD-ROM).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Design issues in parallel array languages for shared memory

Brodman, J., Fraguela, B. B., Garzarán, M. J. & Padua, D., Sep 5 2008, Embedded Computer Systems: Architectures, Modeling, and Simulation - 8th International Workshop, SAMOS 2008, Proceedings. p. 208-217 10 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 5114 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

P-ray: A software suite for multi-core architecture characterization

Duchateau, A. X., Sidelnik, A., Garzarán, M. J. & Padua, D., Dec 1 2008, Languages and Compilers for Parallel Computing - 21st International Workshop, LCPC 2008, Revised Selected Papers. p. 187-201 15 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 5335 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Programming with tiles

Guo, J., Bikshandit, G., Fraguela, B. B., Garzarán, M. J. & Padua, D., Dec 1 2008, PPoPP'08 - Proceedings of the 2008 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. p. 1-10 10 p. (Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2007

Design and use of htalib - A library for hierarchically tiled arrays

Bikshandi, G., Guo, J., Von Praun, C., Tanase, G., Fraguela, B. B., Garzarán, M. J., Padua, D. & Rauchwerger, L., Dec 1 2007, Languages and Compilers for Parallel Computing - 19th International Workshop, LCPC 2006, Revised Papers. p. 17-32 16 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 4382 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Optimizing sorting with machine learning algorithms

Li, X., Garzarán, M. J. & Padua, D., Sep 25 2007, Proceedings - 21st International Parallel and Distributed Processing Symposium, IPDPS 2007; Abstracts and CD-ROM. 4228227. (Proceedings - 21st International Parallel and Distributed Processing Symposium, IPDPS 2007; Abstracts and CD-ROM).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2006

A language for the compact representation of multiple program versions

Donadio, S., Brodman, J., Roeder, T., Yotov, K., Barthou, D., Cohen, A., Garzarán, M. J., Padua, D. & Pingali, K., Jan 1 2006, Languages and Compilers for Parallel Computing - 18th International Workshop, LCPC 2005, Revised Selected Papers. Springer-Verlag, p. 136-151 16 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 4339 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Analytic models and empirical search: A hybrid approach to code optimization

Epshteyn, A., Garzaran, M. J., DeJong, G., Padua, D., Ren, G., Li, X., Yotov, K. & Pingali, K., Dec 1 2006, Languages and Compilers for Parallel Computing - 18th International Workshop, LCPC 2005, Revised Selected Papers. p. 259-273 15 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 4339 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Evaluating the impact of thread escape analysis on a memory consistency model-aware compiler

Wong, C. L., Sura, Z., Fang, X., Lee, K., Midkiff, S. P., Lee, J. & Padua, D. A., Jan 1 2006, Languages and Compilers for Parallel Computing - 18th International Workshop, LCPC 2005, Revised Selected Papers. Springer-Verlag, p. 170-184 15 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 4339 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution