David A Padua

1979 …2019
If you made any changes in Pure, your changes will be visible here soon.

Research Output 1979 2019

2019

Locus: A System and a Language for Program Optimization

Teixeira, S. F. X. T., Ancourt, C., Padua, D. A. & Gropp, W. D., Mar 5 2019, CGO 2019 - Proceedings of the 2019 IEEE/ACM International Symposium on Code Generation and Optimization. Moseley, T., Jimborean, A. & Kandemir, M. T. (eds.). Institute of Electrical and Electronics Engineers Inc., p. 217-228 12 p. 8661203. (CGO 2019 - Proceedings of the 2019 IEEE/ACM International Symposium on Code Generation and Optimization).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Locus
Optimization
Compiler Optimization
Module
Program Transformation

Managing code transformations for better performance portability

Teixeira, T. S. F. X., Gropp, W. D. & Padua, D. A., Nov 1 2019, In : International Journal of High Performance Computing Applications. 33, 6, p. 1290-1306 17 p.

Research output: Contribution to journalArticle

Portability
Process Optimization
Optimization
Computing
Tuning
2018

From High-Level Specification to High-Performance Code

Franchetti, F., Moura, J. M. F., Padua, D. A. & Dongarra, J., Nov 2018, In : Proceedings of the IEEE. 106, 11, p. 1875-1878 4 p., 8510981.

Research output: Contribution to journalReview article

Computer systems
Specifications
Computer architecture
Particle accelerators
2017

A DSL for Performance Orchestration

Teixeira, T. S. F. X., Padua, D. A. & Gropp, W. D., Oct 31 2017, Proceedings - 26th International Conference on Parallel Architectures and Compilation Techniques, PACT 2017. Institute of Electrical and Electronics Engineers Inc., 1 p. (Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT; vol. 2017-September).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Orchestration
Domain-specific Languages
Optimization
Compiler
Baseline

ANALYSIS OF MESSAGE SWITCHING WITH SHUFFLE-EXCHANGES IN MULTIPROCESSORS.

Lawrie, D. H. & Padua, D. A., Jan 1 2017, In : Materials Engineering (Cleveland). p. 116-123 8 p.

Research output: Contribution to journalArticle

Data storage equipment
Switches

High-Assurance SPIRAL: End-to-End Guarantees for Robot and Car Control

Franchetti, F., Low, T. M., Mitsch, S., Mendoza, J. P., Gui, L., Phaosawasdi, A., Padua, D., Kar, S., Moura, J. M. F., Franusich, M., Johnson, J., Platzer, A. & Veloso, M. M., Apr 2017, In : IEEE Control Systems. 37, 2, p. 82-103 22 p., 7879901.

Research output: Contribution to journalArticle

Critical infrastructures
Power plants
Railroad cars
Robot
Robots

LORE: A loop repository for the evaluation of compilers

Chen, Z., Gong, Z., Szaday, J. J., Wong, D. C., Padua, D. A., Nicolau, A., Veidenbaum, A. V., Watkinson, N., Sura, Z., Maleki, S., Torrellas, J. & DeJong, G., Dec 5 2017, Proceedings of the 2017 IEEE International Symposium on Workload Characterization, IISWC 2017. Institute of Electrical and Electronics Engineers Inc., p. 219-228 10 p. (Proceedings of the 2017 IEEE International Symposium on Workload Characterization, IISWC 2017; vol. 2017-January).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Repository
Evaluation
Usefulness
Benchmark
Language

Welcome to PACT 2017

Iyer, R. & Padua, D., Oct 31 2017, In : Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT. 2017-September, p. x-xi

Research output: Contribution to journalEditorial

2016

DSMR: A parallel algorithm for Single-Source Shortest Path problem

Maleki, S., Nguyen, D., Lenharth, A., Garzarán, M., Padua, D. & Pingali, K., Jun 1 2016, Proceedings of the 2016 International Conference on Supercomputing, ICS 2016. Association for Computing Machinery, a32. (Proceedings of the International Conference on Supercomputing; vol. 01-03-June-2016).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Parallel algorithms
Data storage equipment
Communication

DSMR: A shared and distributed memory algorithm for single-source shortest path problem

Maleki, S., Nguyen, D., Lenharth, A., Garzaran, M. J., Padua, D. A. & Pingali, K., Feb 27 2016, 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP 2016 - Proceedings. Association for Computing Machinery, 39. (Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP; vol. 12-16-March-2016).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Data storage equipment
2015

Directive-based compilers for GPUs

Ghike, S., Gran, R., Garzarán, M. J. & Padua, D. A., Jan 1 2015, Languages and Compilers for Parallel Computing - 27th International Workshop, LCPC 2014, Revised Selected Papers. Brodman, J. & Tu, P. (eds.). Springer-Verlag, p. 19-35 17 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 8967).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Compiler
Notation
Productivity
Benchmark
Portability

Tiled linear algebra a system for parallel graph algorithms

Maleki, S., Evans, G. C. & Padua, D. A., Jan 1 2015, Languages and Compilers for Parallel Computing - 27th International Workshop, LCPC 2014, Revised Selected Papers. Brodman, J. & Tu, P. (eds.). Springer-Verlag, p. 116-130 15 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 8967).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Linear algebra
Graph Algorithms
Parallel Algorithms
kernel
Shortest Path Algorithm

Vectorization of apply to reduce interpretation overhead of R

Wang, H., Padua, D. A. & Wu, P., Oct 23 2015, OOPSLA 2015 - Proceedings of the 2015 ACM SIGPLAN International Conference on Object-Oriented Programming Systems, Languages, and Applications. Eugster, P. & Aldrich, J. (eds.). Association for Computing Machinery, p. 400-415 16 p. (Proceedings of the Conference on Object-Oriented Programming Systems, Languages, and Applications, OOPSLA; vol. 25-30-Oct-2015).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Code generation
2014

Hierarchically tiled array as a high-level abstraction for codelets

Yang, C. C., Pichel, J. C., Smith, A. R. & Padua, D. A., Apr 17 2014, Proceedings - 2014 4th Workshop on Data-Flow Execution Models for Extreme Scale Computing, DFM 2014. Institute of Electrical and Electronics Engineers Inc., p. 58-65 8 p. 7089032. (Proceedings - 2014 4th Workshop on Data-Flow Execution Models for Extreme Scale Computing, DFM 2014).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Scalability
Electric power utilization
Energy utilization
Tuning

Optimal parallelogram selection for hierarchical tiling

Zhou, X., Garzaran, M. J. & Padua, D. A., Dec 1 2014, In : ACM Transactions on Architecture and Code Optimization. 11, 4, 58.

Research output: Contribution to journalArticle

Tile
Hierarchical systems
Data storage equipment

Optimizing R VM: Allocation removal and path length reduction via interpreter-level specialization

Wang, H., Wu, P. & Padua, D. A., Jan 1 2014, Proceedings of the 12th ACM/IEEE International Symposium on Code Generation and Optimization, CGO 2014. Association for Computing Machinery, p. 295-305 11 p. (Proceedings of the 12th ACM/IEEE International Symposium on Code Generation and Optimization, CGO 2014).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Glues
Path Length
Specialization
Programming
Data analysis

Practices of PLDI

Boehm, H., Davidson, J., Fisher, K., Flanagan, C., Gibbons, J., Hall, M., Hutton, G., Padua, D., Tip, F., Vitek, J. & Wadler, P., Apr 2014, In : ACM SIGPLAN Notices. 49, 4, p. 33-38 6 p.

Research output: Contribution to journalArticle

Coastal zones

Vector seeker: A tool for finding vector potential

Evans, G. C., Abraham, S., Kuhn, B. & Padua, D. A., Jan 1 2014, p. 41-48. 8 p.

Research output: Contribution to conferencePaper

Vector Potential
Vectorization
Parallelism
Compiler
Coding
2013

Hydra: Automatic algorithm exploration from linear algebra equations

Duchateau, A. X., Padua, D. A. & Barthou, D., May 6 2013, Proceedings of the 2013 IEEE/ACM International Symposium on Code Generation and Optimization, CGO 2013. 6494999. (Proceedings of the 2013 IEEE/ACM International Symposium on Code Generation and Optimization, CGO 2013).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Linear algebra
Tile
Processing
Tiling
Term

Standards for graph algorithm primitives

Mattson, T., Bader, D., Berry, J., Buluc, A., Dongarra, J., Faloutsos, C., Feo, J., Gilbert, J., Gonzalez, J., Hendrickson, B., Kepner, J., Leiserson, C., Lumsdaine, A., Padua, D., Poole, S., Reinhardt, S., Stonebraker, M., Wallach, S. & Yoo, A., Jan 1 2013, 2013 IEEE High Performance Extreme Computing Conference, HPEC 2013. IEEE Computer Society, 6670338. (2013 IEEE High Performance Extreme Computing Conference, HPEC 2013).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2012

Hierarchical overlapped tiling

Zhou, X., Giacalone, J. P., Garzarán, M. J., Kuhn, R. H., Ni, Y. & Padua, D., Jul 9 2012, Proceedings - International Symposium on Code Generation and Optimization, CGO 2012. p. 207-218 12 p. (Proceedings - International Symposium on Code Generation and Optimization, CGO 2012).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Communication
Fusion reactions

Message from the program chairs

Padua, D. A., De Azevedo, R. J., Cascaval, C., Navaux, P. O. A., Van De Geijn, R. & Zhang, Y., Dec 1 2012, In : Proceedings - Symposium on Computer Architecture and High Performance Computing. 6374807.

Research output: Contribution to journalEditorial

Optimization techniques for efficient HTA programs

Fraguela, B. B., Bikshandi, G., Guo, J., Garzarán, M. J., Padua, D. & Von Praun, C., Sep 1 2012, In : Parallel Computing. 38, 9, p. 465-484 20 p.

Research output: Contribution to journalArticle

Optimization Techniques
Parallel programming
Object oriented programming
Tile
Productivity

Performance portability with the Chapel language

Sidelnik, A., Maleki, S., Chamberlain, B. L., Garzarán, M. J. & Padua, D., Oct 4 2012, Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium, IPDPS 2012. p. 582-594 13 p. 6267860. (Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium, IPDPS 2012).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Machine oriented languages
Throughput
Parallel architectures
Computer programming
Productivity

Preface

D'Hollander, E., De Bosschere, K., Joubert, G. R., Padua, D. A. & Peters, F., 2012, Applications, Tools and Techniques on the Road to Exascale Computing. IOS Press BV, Vol. 22. (Advances in Parallel Computing; vol. 22).

Research output: Chapter in Book/Report/Conference proceedingForeword/postscript

2011

An evaluation of vectorizing compilers

Maleki, S., Gao, Y., Garzarán, M. J., Wong, T. & Padua, D. A., Dec 1 2011, Proceedings - 2011 International Conference on Parallel Architectures and Compilation Techniques, PACT 2011. p. 372-382 11 p. 6113845. (Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Compiler
Vectorization
Evaluation
Benchmark
High level languages

A parallel numerical solver using hierarchically tiled arrays

Brodman, J. C., Evans, G. C., Manguoglu, M., Sameh, A., Garzarán, M. J. & Padua, D., Mar 18 2011, Languages and Compilers for Parallel Computing - 23rd International Workshop, LCPC 2010, Revised Selected Papers. p. 46-61 16 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 6548 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Data storage equipment
Memory Model
Distributed Memory
Tile
Shared Memory

NSF/IEEE-TCPP curriculum initiative on parallel and distributed computing - Core topics for undergraduates

Prasad, S. K., Kant, K., Robert, Y., Chtchelkanova, A., Das, S., La Salle, A., Le Blanc, R., Rosenberg, A., Dehne, F., Lumsdaine, M., Sahni, S., Gouda, M., Padua, D. A., Shirazi, B., Gupta, A., Parashar, M., Sussman, A., Jaja, J., Prasanna, V., Weems, C. & 1 others, Wu, J., Apr 19 2011, SIGCSE'11 - Proceedings of the 42nd ACM Technical Symposium on Computer Science Education. p. 617-618 2 p. (SIGCSE'11 - Proceedings of the 42nd ACM Technical Symposium on Computer Science Education).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Distributed computer systems
Parallel processing systems
Curricula
curriculum
Processing

Scheduling of stream-based real-time applications for heterogeneous systems

Virlet, B., Zhou, X., Giacalone, J. P., Kuhn, B., Garzaran, M. J. & Padua, D. A., May 10 2011, LCTES'11 - Proceedings of the ACM SIGPLAN/SIGBED 2011 Conference on Languages, Compilers, Tools and Theory for Embedded Systems. p. 1-10 10 p. (Proceedings of the ACM SIGPLAN Conference on Languages, Compilers, and Tools for Embedded Systems (LCTES)).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Mobile devices
Scheduling
Heuristic algorithms
Scheduling algorithms
Electric power utilization
2010

ACM SIGPLAN Notices: Message from the general chairs

Govindarajan, R. & Padua, D., May 1 2010, In : ACM SIGPLAN Notices. 45, 5

Research output: Contribution to journalEditorial

2009

Authors' Response

Hall, M., Padua, D. & Pingali, K., May 1 2009, In : Communications of the ACM. 52, 5, 1 p.

Research output: Contribution to journalLetter

Communication contention in APN list scheduling algorithm

Tang, X., Li, K. & Padua, D. A., Jan 1 2009, In : Science in China, Series F: Information Sciences. 52, 1, p. 59-69 11 p.

Research output: Contribution to journalArticle

Scheduling algorithms
Scheduling
Communication
Parallel algorithms
Data communication systems

Compiler research: The next 50 years

Hall, M., Padua, D. & Pingali, K., Feb 1 2009, In : Communications of the ACM. 52, 2, p. 60-67 8 p.

Research output: Contribution to journalArticle

Parallel programming
Technical presentations
Computer science
Education
Industry

Optimization of tele-immersion codes

Sidelnik, A., Sung, I. J., Wu, W., Garzarán, M. J., Hwu, W. M., Nahrstedt, K., Padua, D. & Patel, S. J., Jul 23 2009, Proceedings of 2nd Workshop on General Purpose Processing on Graphics Processing Units, GPGPU-2. 1 p. (Proceedings of 2nd Workshop on General Purpose Processing on Graphics Processing Units, GPGPU-2).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Computer vision
Parallel programming
Tuning
Productivity
Graphics processing unit

Task-parallel versus data-parallel library-based programming in multicore systems

Andrade, D., Fraguela, B. B., Brodman, J. & Padua, D. A., 2009, Proceedings of the 17th Euromicro International Conference on Parallel, Distributed and Network-Based Processing, PDP 2009. p. 101-110 10 p. 4912921

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Multicore programming
Data structures
Scheduling

Writing productive stencil codes with overlapped tiling

Guo, J., Bikshandi, G., Fraguela, B. B. & Padua, D., Jan 1 2009, In : Concurrency Computation Practice and Experience. 21, 1, p. 25-39 15 p.

Research output: Contribution to journalArticle

Tiling
Productivity
Tile
Communication
kernel
2008

Automatic generation of a parallel sorting algorithm

Garber, B. A., Hoeflinger, D., Li, X., Garzaran, M. J. & Padua, D. A., Sep 10 2008, IPDPS Miami 2008 - Proceedings of the 22nd IEEE International Parallel and Distributed Processing Symposium, Program and CD-ROM. 4536400. (IPDPS Miami 2008 - Proceedings of the 22nd IEEE International Parallel and Distributed Processing Symposium, Program and CD-ROM).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Sorting
Data storage equipment
Parallel algorithms

Design issues in parallel array languages for shared memory

Brodman, J., Fraguela, B. B., Garzarán, M. J. & Padua, D., Sep 5 2008, Embedded Computer Systems: Architectures, Modeling, and Simulation - 8th International Workshop, SAMOS 2008, Proceedings. p. 208-217 10 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 5114 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Tile
Shared Memory
Data Parallelism
Data storage equipment
Parallel programming

P-ray: A software suite for multi-core architecture characterization

Duchateau, A. X., Sidelnik, A., Garzarán, M. J. & Padua, D., Dec 1 2008, Languages and Compilers for Parallel Computing - 21st International Workshop, LCPC 2008, Revised Selected Papers. p. 187-201 15 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 5335 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Half line
Hardware
Cache
Computer architecture
Software

Programming with tiles

Guo, J., Bikshandit, G., Fraguela, B. B., Garzaran, M. J. & Padua, D. A., Dec 1 2008, PPoPP'08 - Proceedings of the 2008 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. p. 1-10 10 p. (Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Tile
Natural sciences computing
Data storage equipment
Communication
2007

Design and use of htalib - A library for hierarchically tiled arrays

Bikshandi, G., Guo, J., Von Praun, C., Tanase, G., Fraguela, B. B., Garzarán, M. J., Padua, D. & Rauchwerger, L., Dec 1 2007, Languages and Compilers for Parallel Computing - 19th International Workshop, LCPC 2006, Revised Papers. p. 17-32 16 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 4382 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Data structures
Tiling
Adaptive Mesh Refinement
MapReduce
C++

Optimizing sorting with machine learning algorithms

Li, X., Garzaran, M. J. & Padua, D. A., Sep 25 2007, Proceedings - 21st International Parallel and Distributed Processing Symposium, IPDPS 2007; Abstracts and CD-ROM. 4228227. (Proceedings - 21st International Parallel and Distributed Processing Symposium, IPDPS 2007; Abstracts and CD-ROM).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Sorting algorithm
Sorting
Learning algorithms
Learning systems
Learning Algorithm
2006

A language for the compact representation of multiple program versions

Donadio, S., Brodman, J., Roeder, T., Yotov, K., Barthou, D., Cohen, A., Garzarán, M. J., Padua, D. & Pingali, K., Jan 1 2006, Languages and Compilers for Parallel Computing - 18th International Workshop, LCPC 2005, Revised Selected Papers. Springer-Verlag, p. 136-151 16 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 4339 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Generator
Compiler
Annotation
High Performance
Tend

Analytic models and empirical search: A hybrid approach to code optimization

Epshteyn, A., Garzaran, M. J., DeJong, G., Padua, D., Ren, G., Li, X., Yotov, K. & Pingali, K., Dec 1 2006, Languages and Compilers for Parallel Computing - 18th International Workshop, LCPC 2005, Revised Selected Papers. p. 259-273 15 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 4339 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Hybrid Approach
Empirical Model
Optimization
Information Content
Compiler

Evaluating the impact of thread escape analysis on a memory consistency model-aware compiler

Wong, C. L., Sura, Z., Fang, X., Lee, K., Midkiff, S. P., Lee, J. & Padua, D. A., Jan 1 2006, Languages and Compilers for Parallel Computing - 18th International Workshop, LCPC 2005, Revised Selected Papers. Springer-Verlag, p. 170-184 15 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 4339 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Compiler
Thread
Memory Model
Data storage equipment
Synchronization

Hierarchically tiled arrays for parallelism and locality

Jia, G., Bikshandi, G., Hoeflinger, D., Almasi, G., Fraguela, B., Garzaran, M. J., Padua, D. A. & Von Praunt, C., Jan 1 2006, 20th International Parallel and Distributed Processing Symposium, IPDPS 2006. IEEE Computer Society, 1639573. (20th International Parallel and Distributed Processing Symposium, IPDPS 2006; vol. 2006).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

MATLAB
Parallel programming
Parallel algorithms
Data structures

In search of a program generator to implement generic transformations for high-performance computing

Cohen, A., Donadio, S., Garzaran, M. J., Herrmann, C., Kiselyov, O. & Padua, D. A., Sep 1 2006, In : Science of Computer Programming. 62, 1, p. 25-46 22 p.

Research output: Contribution to journalArticle

Automatic programming
Computer programming languages
Memory architecture
Microprocessor chips
Productivity