Search results

  • 2007

    Design and use of htalib - A library for hierarchically tiled arrays

    Bikshandi, G., Guo, J., Von Praun, C., Tanase, G., Fraguela, B. B., Garzarán, M. J., Padua, D. & Rauchwerger, L., 2007, Languages and Compilers for Parallel Computing - 19th International Workshop, LCPC 2006, Revised Papers. Springer, p. 17-32 16 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 4382 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Optimizing sorting with machine learning algorithms

    Li, X., Garzarán, M. J. & Padua, D., 2007, Proceedings - 21st International Parallel and Distributed Processing Symposium, IPDPS 2007; Abstracts and CD-ROM. 4228227. (Proceedings - 21st International Parallel and Distributed Processing Symposium, IPDPS 2007; Abstracts and CD-ROM).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • 2006

    A language for the compact representation of multiple program versions

    Donadio, S., Brodman, J., Roeder, T., Yotov, K., Barthou, D., Cohen, A., Garzarán, M. J., Padua, D. & Pingali, K., 2006, Languages and Compilers for Parallel Computing - 18th International Workshop, LCPC 2005, Revised Selected Papers. Springer, p. 136-151 16 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 4339 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • Analytic models and empirical search: A hybrid approach to code optimization

    Epshteyn, A., Garzaran, M. J., DeJong, G., Padua, D., Ren, G., Li, X., Yotov, K. & Pingali, K., 2006, Languages and Compilers for Parallel Computing - 18th International Workshop, LCPC 2005, Revised Selected Papers. p. 259-273 15 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 4339 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Evaluating the impact of thread escape analysis on a memory consistency model-aware compiler

    Wong, C. L., Sura, Z., Fang, X., Lee, K., Midkiff, S. P., Lee, J. & Padua, D., 2006, Languages and Compilers for Parallel Computing - 18th International Workshop, LCPC 2005, Revised Selected Papers. Springer, p. 170-184 15 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 4339 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • Hierarchically tiled arrays for parallelism and locality

    Jia, G., Bikshandi, G., Hoeflinger, D., Almasi, G., Fraguela, B., Garzarán, M. J., Padua, D. & Von Praunt, C., 2006, 20th International Parallel and Distributed Processing Symposium, IPDPS 2006. IEEE Computer Society, 1639573. (20th International Parallel and Distributed Processing Symposium, IPDPS 2006; vol. 2006).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • In search of a program generator to implement generic transformations for high-performance computing

    Cohen, A., Donadio, S., Garzaran, M. J., Herrmann, C., Kiselyov, O. & Padua, D., Sep 2006, In: Science of Computer Programming. 62, 1, p. 25-46 22 p.

    Research output: Contribution to journalArticlepeer-review

  • Optimizing data permutations for SIMD devices

    Ren, G., Wu, P. & Padua, D., 2006, Conference on Programming Language Design and Implementation - PLAS 2006: Proceedings of the 2006 Programming Languages and Analysis for Security Workshop. Association for Computing Machinery, p. 118-131 14 p. (Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI)).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Optimizing data permutations for SIMD devices

    Ren, G., Wu, P. & Padua, D., 2006, PLDI 2006 - Proceedings of the 2006 ACM SIGPLAN Conference on Programming Language Design and Implementation. p. 118-131 14 p. (Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI); vol. 2006).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Optimizing data permutations for SIMD devices

    Ren, G., Wu, P. & Padua, D., 2006, In: ACM SIGPLAN Notices. 41, 6, p. 118-131 14 p.

    Research output: Contribution to journalArticlepeer-review

  • Programming for parallelism, and locality with hierarchically tiled arrays

    Bikshandi, G., Jia, G., Hoeflinger, D., Almasi, G., Fraguela, B. B., Garzarán, M. J., Padua, D. & Von Praun, C., 2006, Proceedings of the 2006 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP'06. Association for Computing Machinery, p. 48-57 10 p. (Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP; vol. 2006).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • 2005

    An empirical study on the vectorization of multimedia applications for multimedia extensions

    Ren, G., Wu, P. & Padua, D., 2005, Proceedings - 19th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2005. p. 89b 1419916. (Proceedings - 19th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2005; vol. 2005).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • A sampling-based framework for parallel data mining

    Cong, S., Han, J., Hoeflinger, J. & Padua, D., 2005, p. 255-265. 11 p.

    Research output: Contribution to conferencePaperpeer-review

  • Automatic implementation of programming language consistency models

    Sura, Z., Wong, C. L., Fang, X., Lee, J., Midkiff, S. P. & Padua, D., 2005, Languages and Compilers for Parallel Computing - 15th Workshop, LCPC 2002, Revised Papers. p. 172-187 16 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 2481 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Compiler techniques for high performance sequentially consistent java programs

    Sura, Z., Fang, X., Wong, C. L., Midkiff, S. P., Lee, J. & Padua, D., 2005, p. 2-13. 12 p.

    Research output: Contribution to conferencePaperpeer-review

  • HiLO: High level optimization of FFTs

    Rizzolo, N. & Padua, D., 2005, In: Lecture Notes in Computer Science. 3602, p. 238-252 15 p.

    Research output: Contribution to journalConference articlepeer-review

  • Implementation of parallel numerical algorithms using hierarchically tiled arrays

    Bikshandi, G., Fraguela, B. B., Guo, J., Garzarán, M. J., Almási, G., Moreira, J. & Padua, D., 2005, In: Lecture Notes in Computer Science. 3602, p. 87-101 15 p.

    Research output: Contribution to journalConference articlepeer-review

    Open Access
  • Is search really necessary to generate high-performance BLAS?

    Yotov, K., Li, X., Ren, G., Garzarán, M. J., Padua, D., Pingali, K. & Stodghill, P., Feb 2005, In: Proceedings of the IEEE. 93, 2, p. 358-385 28 p.

    Research output: Contribution to journalArticlepeer-review

  • Optimizing sorting with genetic algorithms

    Li, X., Garzarán, M. J. & Padua, D., 2005, Proceedings of the 2005 International Symposium on Code Generation and Optimization, CGO 2005. p. 99-110 12 p. 1402080. (Proceedings of the 2005 International Symposium on Code Generation and Optimization, CGO 2005; vol. 2005).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Parallel mining of closed sequential patterns

    Cong, S., Han, J. & Padua, D., 2005, p. 562-567. 6 p.

    Research output: Contribution to conferencePaperpeer-review

  • Special Issue on Program Generation, Optimization, and Platform Adaptation

    Moura, J. M. F., Püscheu, M., Padua, D. & Dongarra, J., Feb 2005, In: Proceedings of the IEEE. 93, 2, p. 211-214 4 p.

    Research output: Contribution to journalArticlepeer-review

  • SPIRAL: Code generation for DSP transforms

    Püschel, M., Moura, J. M. F., Johnson, J. R., Padua, D., Veloso, M. M., Singer, B. W., Xiong, J., Franchetti, F., Gačić, A., Voronenko, Y., Chen, K., Johnson, R. W. & Rizzolo, N., Feb 2005, In: Proceedings of the IEEE. 93, 2, p. 232-273 42 p.

    Research output: Contribution to journalArticlepeer-review

  • 2004

    A compiler for multiple memory models

    Midkiff, S. P., Lee, J. & Padua, D. A., 2004, In: Concurrency and Computation: Practice and Experience. 16, 2-3, p. 197-220 24 p.

    Research output: Contribution to journalArticlepeer-review

  • A dynamically tuned sorting library

    Li, X., Garzarán, M. J. & Padua, D., 2004, International Symposium on Code Generation and Optimization, CGO 2004. p. 111-122 12 p. (International Symposium on Code Generation and Optimization, CGO).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • A preliminary study on the vectorization of multimedia applications for multimedia extensions

    Ren, G., Wu, P. & Padua, D., 2004, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Rauchwerger, L. (ed.). Springer, p. 420-435 16 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 2958).

    Research output: Chapter in Book/Report/Conference proceedingChapter

  • Performance modeling and programming environments for petaflops computers and the blue gene machine

    Zheng, G., Wilmarth, T., Lawlor, O. S., Kalé, L. V., Adve, S., Padua, D. & Guebelle, P., 2004, Proceedings - 18th International Parallel and Distributed Processing Symposium, IPDPS 2004 (Abstracts and CD-ROM). p. 2743-2750 8 p. (Proceedings - International Parallel and Distributed Processing Symposium, IPDPS 2004 (Abstracts and CD-ROM); vol. 18).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Programming for locality and parallelism with hierarchically tiled arrays

    Almási, G., De Rose, L., Fraguela, B. B., Moreira, J. & Padua, D., 2004, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Rauchwerger, L. (ed.). Springer, p. 162-176 15 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 2958).

    Research output: Chapter in Book/Report/Conference proceedingChapter

    Open Access
  • SPIRAL: A generator for platform-adapted libraries of signal processing algorithms

    Püschel, M., Moura, J. M. F., Singer, B., Xiong, J., Johnson, J., Padua, D., Veloso, M. & Johnson, R. W., 2004, In: International Journal of High Performance Computing Applications. 18, 1, p. 21-45 25 p.

    Research output: Contribution to journalArticlepeer-review

  • The hierarchically tiled arrays programming approach

    Fraguela, B. B., Guo, J., Bikshandi, G., Garzarán, M. J., Almási, G., Moreira, J. & Padua, D., 2004.

    Research output: Contribution to conferencePaperpeer-review

    Open Access
  • The power of belady's algorithm in register allocation for long basic blocks

    Guo, J., Garzarán, M. J. & Padua, D., 2004, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Rauchwerger, L. (ed.). Springer, p. 374-389 16 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 2958).

    Research output: Chapter in Book/Report/Conference proceedingChapter

  • 2003

    A comparison of empirical and model-driven optimization

    Yotov, K., Li, X., Ren, G., Cibulskis, M., DeJong, G., Garzaran, M., Padua, D., Pingali, K., Stodghill, P. & Wu, P., 2003, p. 63-76. 14 p.

    Research output: Contribution to conferencePaperpeer-review

  • A comparison of empirical and model-driven optimization

    Yotov, K., Li, X., Ren, G., Cibulskis, M., DeJong, G., Garzaran, M., Padua, D., Pingali, K., Stodghill, P. & Wu, P., May 2003, In: ACM SIGPLAN Notices. 38, 5, p. 63-76 14 p.

    Research output: Contribution to journalConference articlepeer-review

  • Calculating stack distances efficiently

    Almási, G., Caşcaval, C. & Padua, D. A., Feb 2003, In: ACM SIGPLAN Notices. 38, 2 SUPPL., p. 37-43 7 p.

    Research output: Contribution to journalArticlepeer-review

  • Compiler techniques for the distribution of data and computation

    Navarro, A., Zapata, E. & Padua, D., Jun 2003, In: IEEE Transactions on Parallel and Distributed Systems. 14, 6, p. 545-562 18 p.

    Research output: Contribution to journalArticlepeer-review

  • Compiling for a hybrid programming model using the LMAD representation

    Zhu, J., Hoeflinger, J. & Padua, D., 2003, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Dietz, H. G. (ed.). Springer, p. 321-335 15 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 2624).

    Research output: Chapter in Book/Report/Conference proceedingChapter

  • Estimating Cache Misses and Locality Using Stack Distances

    Caşcaval, C. & Padua, D. A., 2003, p. 150-159. 10 p.

    Research output: Contribution to conferencePaperpeer-review

  • Induction variable analysis without idiom recognition: Beyond monotonicity

    Wu, P., Cohen, A. & Padua, D., 2003, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Dietz, H. G. (ed.). Springer, p. 427-441 15 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 2624).

    Research output: Chapter in Book/Report/Conference proceedingChapter

    Open Access
  • Programming the FlexRAM parallel intelligent memory system

    Fraguela, B. B., Renau, J., Feautrier, P., Padua, D. & Torrellas, J., 2003, p. 49-60. 12 p.

    Research output: Contribution to conferencePaperpeer-review

  • Programming the FlexRAM parallel intelligent memory system

    Fraguela, B. B., Renau, J., Featrier, P., Padua, D. & Torrellas, J., Oct 2003, In: ACM SIGPLAN Notices. 38, 10, p. 49-60 12 p.

    Research output: Contribution to journalArticlepeer-review

    Open Access
  • 2002

    An advanced compiler framework for non-cache-coherent multiprocessors

    Paek, Y., Navarro, A., Zapata, E., Hoeflinger, J. & Padua, D., Mar 2002, In: IEEE Transactions on Parallel and Distributed Systems. 13, 3, p. 241-259 19 p.

    Research output: Contribution to journalArticlepeer-review

  • Calculating stack distances efficiently

    Almási, G., Caşcaval, C. & Padua, D. A., Jun 16 2002, Proceedings of the 2002 Workshop on Memory System Performance, MSP 2002. Association for Computing Machinery, p. 37-43 7 p. (Proceedings of the 2002 Workshop on Memory System Performance, MSP 2002).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Efficient and precise array access analysis

    Paek, Y., Hoeflinger, J. & Padua, D., Jan 2002, In: ACM Transactions on Programming Languages and Systems. 24, 1, p. 65-109 45 p.

    Research output: Contribution to journalArticlepeer-review

  • Instance-wise points-to analysis for loop-based dependence testing

    Wu, P., Feautrier, P., Padua, D. & Sura, Z., 2002, p. 262-273. 12 p.

    Research output: Contribution to conferencePaperpeer-review

  • Is OpenMP for grids ?

    Eigenmann, R., Hoeflinger, J., Kuhn, R. H., Padua, D., Basumallik, A., Min, S. J. & Zhu, J., 2002, Proceedings - International Parallel and Distributed Processing Symposium, IPDPS 2002. Institute of Electrical and Electronics Engineers Inc., p. 171-178 8 p. 1016571. (Proceedings - International Parallel and Distributed Processing Symposium, IPDPS 2002).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • MaJIC: Compiling MATLAB for speed and responsiveness

    Almási, G. & Padua, D., 2002, p. 294-303. 10 p.

    Research output: Contribution to conferencePaperpeer-review

  • The Pensieve project: A compiler infrastructure for memory models

    Wong, C. L., Sura, Z., Fang, X., Midkiff, S. P., Lee, J. & Padua, D., 2002, Proceedings - International Symposium on Parallel Architectures, Algorithms and Networks, I-SPAN 2002. Hsu, D. F., Saldana, R. P. & Ibarra, O. H. (eds.). IEEE Computer Society, p. 239-244 6 p. 1004288. (Proceedings of the International Symposium on Parallel Architectures, Algorithms and Networks, I-SPAN; vol. 2002-January).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2001

    A java compiler for many memory models - Extended abstract

    Midkiff, S. P., Lee, J. & Padua, D. A., 2001, Proceedings of the 1st Java Virtual Machine Research and Technology Symposium, JVM 2001. USENIX Association, (Proceedings of the 1st Java Virtual Machine Research and Technology Symposium, JVM 2001).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • A synthesis of memory mechanisms for distributed architectures

    Zhu, J., Hoeflinger, J. & Padua, D., 2001, p. 13-22. 10 p.

    Research output: Contribution to conferencePaperpeer-review

  • Hiding relaxed memory consistency with a compiler

    Lee, J. & Padua, D. A., Aug 2001, In: IEEE Transactions on Computers. 50, 8, p. 824-833 10 p.

    Research output: Contribution to journalArticlepeer-review

  • MaJIC: A Matlab just-in-time Compiler

    Almasi, G. & Padua, D. A., 2001, Languages and Compilers for Parallel Computing - 13th International Workshop, LCPC 2000, Revised Papers. Ferrante, J., Midkiff, S. P., Moreira, J. E., Gupta, M., Chatterjee, S., Prins, J., Pugh, W. & Tseng, C.-W. (eds.). Springer, p. 68-81 14 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 2017).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution