Wen-Mei W Hwu

1984 …2019

Research output per year

If you made any changes in Pure these will be visible here soon.

Research Output

Multilevel granularity parallelism synthesis on FPGAs

Papakonstantinou, A., Liang, Y., Stratton, J. A., Gururaj, K., Chen, D., Hwu, W-M. W. & Cong, J., Jun 17 2011, Proceedings - IEEE International Symposium on Field-Programmable Custom Computing Machines, FCCM 2011. p. 178-185 8 p. 5771270. (Proceedings - IEEE International Symposium on Field-Programmable Custom Computing Machines, FCCM 2011).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

More IMPATIENT: A gridding-accelerated Toeplitz-based strategy for non-Cartesian high-resolution 3D MRI on GPUs

Gai, J., Obeid, N., Holtrop, J. L., Wu, X. L., Lam, F., Fu, M., Haldar, J. P., Hwu, W. M. W., Liang, Z. P. & Sutton, B. P., May 2013, In : Journal of Parallel and Distributed Computing. 73, 5, p. 686-697 12 p.

Research output: Contribution to journalArticle

Modulo scheduling of loops in control-intensive non-numeric programs

Lavery, D. M. & Hwu, W-M. W., Dec 1 1996, In : Proceedings of the Annual International Symposium on Microarchitecture. p. 126-137 12 p.

Research output: Contribution to journalConference article

Modulo Schedule Buffers

Merten, M. C. & Hwu, W-M. W., 2001, In : Proceedings of the Annual International Symposium on Microarchitecture. p. 138-149 12 p.

Research output: Contribution to journalArticle

Modular interprocedural pointer analysis using access paths: Design, implementation, and evaluation

Cheng, B. C. & Hwu, W. W., Dec 1 2000, p. 57-69. 13 p.

Research output: Contribution to conferencePaper

MLModelScope: Evaluate and introspect cognitive pipelines

Li, C., Dakkak, A., Xiong, J. & Hwu, W. M., Jul 2019, Proceedings - 2019 IEEE World Congress on Services, SERVICES 2019. Chang, C. K., Chen, P., Goul, M., Oyama, K., Reiff-Marganiec, S., Sun, Y., Wang, S. & Wang, Z. (eds.). Institute of Electrical and Electronics Engineers Inc., p. 335-338 4 p. 8817116. (Proceedings - 2019 IEEE World Congress on Services, SERVICES 2019).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Message from the program chair

Hwu, W. M., Oct 1 2008, In : Proceedings - International Symposium on Computer Architecture. 4556707.

Research output: Contribution to journalEditorial

MemXCT: Memory-centric X-ray CT reconstruction with massive parallelization

Hidayetolu, M., Biçer, T., De Gonzalo, S. G., Ren, B., Gürsoy, D., Kettimuthu, R., Foster, I. T. & Hwu, W. M. W., Nov 17 2019, Proceedings of SC 2019: The International Conference for High Performance Computing, Networking, Storage and Analysis. IEEE Computer Society, a85. (International Conference for High Performance Computing, Networking, Storage and Analysis, SC).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

MCUDA: An efficient implementation of CUDA kernels for multi-core CPUs

Stratton, J. A., Stone, S. S. & Hwu, W-M. W., Dec 1 2008, Languages and Compilers for Parallel Computing - 21st International Workshop, LCPC 2008, Revised Selected Papers. p. 16-30 15 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 5335 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Mapping high-level programming languages to OpenCL 2.0: A compiler writer's perspective

Sung, I. J., Chung, W. H., Lee, Y. W. & Hwu, W. M., May 18 2015, Heterogeneous Computing with OpenCL 2.0: Third Edition. Elsevier Inc., p. 249-272 24 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

Long time-scale simulations of in vivo diffusion using GPU hardware

Roberts, E., Stone, J. E., Sepúlveda, L., Hwu, W-M. W. & Luthey-Schulten, Z. A., Nov 25 2009, IPDPS 2009 - Proceedings of the 2009 IEEE International Parallel and Distributed Processing Symposium. 5160930. (IPDPS 2009 - Proceedings of the 2009 IEEE International Parallel and Distributed Processing Symposium).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Locality-centric thread scheduling for bulk-synchronous programming models on CPU architectures

Kim, H. S., Hajj, I. E., Stratton, J., Lumetta, S. S. & Hwu, W-M. W., Mar 3 2015, Proceedings of the 2015 IEEE/ACM International Symposium on Code Generation and Optimization, CGO 2015. Institute of Electrical and Electronics Engineers Inc., p. 257-268 12 p. 7054205. (Proceedings of the 2015 IEEE/ACM International Symposium on Code Generation and Optimization, CGO 2015).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Large inverse-scattering solutions with DBIM on GPU-enabled supercomputers

Hidayetogglu, M., Pearson, C., Chew, W. C., Gurel, L. & Hwu, W-M. W., May 1 2017, 2017 International Applied Computational Electromagnetics Society Symposium - Italy, ACES 2017. Institute of Electrical and Electronics Engineers Inc., 7916310. (2017 International Applied Computational Electromagnetics Society Symposium - Italy, ACES 2017).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

KLAP: Kernel launch aggregation and promotion for optimizing dynamic parallelism

Hajj, I. E., Gomez-Luna, J., Li, C., Chang, L. W., Milojicic, D. & Hwu, W. M., Dec 14 2016, MICRO 2016 - 49th Annual IEEE/ACM International Symposium on Microarchitecture. IEEE Computer Society, 7783716. (Proceedings of the Annual International Symposium on Microarchitecture, MICRO; vol. 2016-December).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Java bytecode to native code translation: The Caffeine prototype and preliminary results

Hsieh, C. H. A., Gyllenhaal, J. C. & Hwu, W-M. W., 1996, In : Proceedings of the Annual International Symposium on Microarchitecture. p. 90-97 8 p.

Research output: Contribution to journalArticle

Iteration disambiguation for parallelism identification in time-sliced applications

Ryoo, S., Rodrigues, C. I. & Hwu, W. M. W., Oct 27 2008, Languages and Compilers for Parallel Computing - 20th International Workshop, LCPC 2007, Revised Selected Papers. p. 110-124 15 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 5234 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Introduction to data parallelism and CUDA C

Kirk, D. B. & Hwu, W-M. W., Jan 1 2012, Programming Massively Parallel Processors: A Hands-on Approach, Second Edition. Elsevier Science, p. 41-62 22 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

Introduction

Kirk, D. B. & Hwu, W-M. W., Jan 1 2012, Programming Massively Parallel Processors: A Hands-on Approach, Second Edition. Elsevier Science, p. 1-21 21 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

Introduction

Hwu, W-M. W., Dec 4 2015, Heterogeneous System Architecture: A New Compute Platform Infrastructure. Elsevier Inc., p. 1-5 5 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

Introduction

Hwu, W. M. W., Dec 1 2012, GPU Computing Gems Jade Edition. Elsevier Inc., p. xv-xvi

Research output: Chapter in Book/Report/Conference proceedingForeword/postscript

Introduction

Hwu, W-M. W., Dec 1 2011, GPU Computing Gems Emerald Edition. Elsevier Inc.

Research output: Chapter in Book/Report/Conference proceedingForeword/postscript

Interpretable and globally optimal prediction for textual grounding using image concepts

Yeh, R. A., Xiong, J., Hwu, W. M. W., Do, M. N. & Schwing, A. G., Jan 1 2017, In : Advances in Neural Information Processing Systems. 2017-December, p. 1913-1923 11 p.

Research output: Contribution to journalConference article

Interferometric synthetic aperture microscopy with computational adaptive optics for high-resolution tomography of scattering tissue

Adie, S. G., Ahmad, A., Shemonski, N., Graf, B. W., Kim, H., Hwu, W. M. W., Carney, P. S. & Boppart, S. A., 2012, Biomedical Optics, BIOMED 2012.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Integrated predicated and speculative execution in the IMPACT EPIC architecture

August, D. I., Connors, D. A., Mahlke, S. A., Sias, J. W., Crozier, K. M., Cheng, B. C., Eaton, P. R., Olaniran, Q. B. & Hwu, W-M. W., Jan 1 1998, In : Conference Proceedings - Annual International Symposium on Computer Architecture, ISCA. p. 227-237 11 p.

Research output: Contribution to journalConference article

In-place transposition of rectangular matrices on accelerators

Sung, I. J., Gómez-Luna, J., González-Linares, J. M., Guil, N. & Hwu, W-M. W., Aug 2014, In : ACM SIGPLAN Notices. 49, 8, p. 207-218 12 p.

Research output: Contribution to journalArticle

In-place transposition of rectangular matrices on accelerators

Sung, I. J., Gómez-Luna, J., González-Linares, J. M., Guil, N. & Hwu, W-M. W., Mar 10 2014, PPoPP 2014 - Proceedings of the 2014 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. p. 207-218 12 p. (Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

In-Place Matrix Transposition on GPUs

Gomez-Luna, J., Sung, I. J., Chang, L. W., Gonzalez-Linares, J. M., Guil, N. & Hwu, W. M. W., Mar 1 2016, In : IEEE Transactions on Parallel and Distributed Systems. 27, 3, p. 776-788 13 p., 7059219.

Research output: Contribution to journalArticle

In-place data sliding algorithms for many-core architectures

Luna, J. G., Chang, L. W., Sung, I. J., Hwu, W-M. W. & Guil, N., Dec 8 2015, Proceedings - 2015 44th International Annual Conference on Parallel Processing, ICPP 2015. Institute of Electrical and Electronics Engineers Inc., p. 210-219 10 p. 7349576. (Proceedings of the International Conference on Parallel Processing; vol. 2015-December).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Inline Function Expansion for Compiling C Programs

Chang, P. P. & Hwu, W. W., Jun 21 1989, In : ACM SIGPLAN Notices. 24, 7, p. 246-257 12 p.

Research output: Contribution to journalArticle

Incremental compiler transformations for multiple instruction retry

Chen, SK. K., Alewine, N. J., Fuchs, W. K. & Hwu, WM. W., Dec 1994, In : Software: Practice and Experience. 24, 12, p. 1179-1198 20 p.

Research output: Contribution to journalArticle

Improving static branch prediction in a compiler

Deitrich, B. L., Chen, B. C. & Hwu, W-M. W., Jan 1 1998, Proceedings - 1998 International Conference on Parallel Architectures and Compilation Techniques, PACT 1998. Institute of Electrical and Electronics Engineers Inc., p. 214-221 8 p. (Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Improved Superblock optimization in GCC

Kidd, R. & Hwu, W-M. W., 2006, Proceedings of the GCC Developers' Summit 2006. p. 85-96 12 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Importance of heap specialization in pointer analysis

Nystrom, E. M., Kim, H. S. & Hwu, W-M. W., Sep 29 2004, p. 43-48. 6 p.

Research output: Contribution to conferencePaper

Implicitly parallel programming models for thousand-core microprocessors

Hwu, W-M. W., Ryoo, S., Ueng, S. Z., Keim, J. H., Gelado, I., Stone, S. S., Kidd, R. E., Baghsorkhi, S. S., Mahesri, A. A., Tsao, S. C., Navarro, N., Lumetta, S. S., Frank, M. I. & Patel, S. J., Aug 2 2007, 2007 44th ACM/IEEE Design Automation Conference, DAC'07. p. 754-759 6 p. 4261284. (Proceedings - Design Automation Conference).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Implementing neural machine translation with bi-directional GRU and attention mechanism on FPGAs using HLS

Li, Q., Zhang, X., Xiong, J. J., Hwu, W-M. W. & Chen, D., Jan 21 2019, ASP-DAC 2019 - 24th Asia and South Pacific Design Automation Conference. Institute of Electrical and Electronics Engineers Inc., p. 693-698 6 p. (Proceedings of the Asia and South Pacific Design Automation Conference, ASP-DAC).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Implementing a GPU programming model on a non-GPU accelerator architecture

Kofsky, S. M., Johnson, D. R., Stratton, J. A., Hwu, W. M. W., Patel, S. J. & Lumetta, S. S., Mar 8 2012, Computer Architecture - ISCA 2010 International Workshops, A4MMC, AMAS-BT, EAMA, WEED, WIOSCA, Revised Selected Papers. p. 40-51 12 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 6161 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Impatient MRI: Illinois Massively Parallel Acceleration Toolkit for image reconstruction with enhanced throughput in MRI

Wu, X. L., Gai, J., Lam, F., Fu, M., Haldar, J. P., Zhuo, Y., Liang, Z. P., Hwu, W. M. & Sutton, B. P., Nov 2 2011, 2011 8th IEEE International Symposium on Biomedical Imaging: From Nano to Macro, ISBI'11. p. 69-72 4 p. 5872356. (Proceedings - International Symposium on Biomedical Imaging).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

IMPACT: An architectural framework for multiple-instruction-issue processors

Chang, P. P., Mahlke, S. A., Chen, W. Y., Warter, N. J. & Hwu, W-M. W., May 1 1991, Conference Proceedings - Annual Symposium on Computer Architecture. Publ by IEEE, p. 266-275 10 p. (Conference Proceedings - Annual Symposium on Computer Architecture).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

HPS papers: A retrospective

Patt, Y. N., Hwu, W. M. W., Melvin, S. W. & Shebanow, M. C., Jan 1 2016, In : IEEE Micro. 36, 4, p. 76-79 4 p., 7542473.

Research output: Contribution to journalArticle

HPSM2: A REFINED SINGLE-CHIP MICROENGINE.

Hwu, W-M. W. & Patt, Y. N., 1988, Proceedings of the Hawaii International Conference on System Science. Hoevel, L. W. & NCR Corp, D. (eds.). IEEE, p. 30-40 11 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

HPSM, A HIGH PERFORMANCE RESTRICTED DATA FLOW ARCHITECTURE HAVING MINIMAL FUNCTIONALITY.

Hwu, W-M. W. & Patt, Y. N., 1986, Conference Proceedings - Annual Symposium on Computer Architecture. IEEE, p. 297-306 10 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

HPS IMPLEMENTATION OF VAX; INITIAL DESIGN AND ANALYSIS.

Hwu, W-M. W., Melvin, S., Shebanow, M., Chen, C., Wei, J. J. & Patt, Y., 1986, In : Proceedings of the Hawaii International Conference on System Science. 1, p. 282-291 10 p.

Research output: Contribution to journalArticle

HPS, A NEW MICROARCHITECTURE: RATIONALE AND INTRODUCTION.

Patt, Y. N., Hwo, W. M. & Shebanow, M. C., Dec 1 1985, MICRO: Annual Microprogramming Workshop. ACM, p. 103-108 6 p. (MICRO: Annual Microprogramming Workshop).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

History of GPU computing

Kirk, D. B. & Hwu, W-M. W., Jan 1 2012, Programming Massively Parallel Processors: A Hands-on Approach, Second Edition. Elsevier Science, p. 23-39 17 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

High-throughput Ant Colony Optimization on graphics processing units

Cecilia, J. M., Llanes, A., Abellán, J. L., Gómez-Luna, J., Chang, L. W. & Hwu, W. M. W., Mar 2018, In : Journal of Parallel and Distributed Computing. 113, p. 261-274 14 p.

Research output: Contribution to journalArticle

High-speed interferometric synthetic aperture microscopy on a graphics processing unit

Ahmad, A., Shemonski, N., Adie, S. G., Kim, H., Hwu, W. M. W., Carney, P. S. & Boppart, S. A., 2012, Frontiers in Optics, FIO 2012.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

High-performance CUDA kernel execution on FPGAs

Papakonstantinou, A., Gururaj, K., Stratton, J. A., Chen, D., Cong, J. & Hwu, W-M. W., Nov 24 2009, ICS'09 - Proceedings of the 23rd International Conference on Supercomputing. p. 515-516 2 p. 1542357. (Proceedings of the International Conference on Supercomputing).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

High-performance computing with accelerators

Kindratenko, V., Wilhelmson, R., Brunner, R., Martíez, T. J. & Hwu, W. M., Jul 1 2010, In : Computing in Science and Engineering. 12, 4, p. 12-16 5 p., 5492949.

Research output: Contribution to journalEditorial