Wen-Mei W Hwu

If you made any changes in Pure these will be visible here soon.

Research Output

Conference article

Field-testing IMPACT EPIC research results in Itanium 2

Sias, J. W., Ueng, S. Z., Kent, G. A., Steiner, I. M. & Hwu, W. M. W., Oct 8 2004, In : Conference Proceedings - Annual International Symposium on Computer Architecture, ISCA. 31, p. 26-37 12 p.

Research output: Contribution to journalConference article

Hardware-driven profiling scheme for identifying program hot spots to support runtime optimization

Merten, M. C., Trick, A. R., George, C. N., Gyllenhaal, J. C. & Hwu, W. M. W., Jan 1 1999, In : Conference Proceedings - Annual International Symposium on Computer Architecture, ISCA. p. 136-147 12 p.

Research output: Contribution to journalConference article

Hardware mechanism for dynamic extraction and relayout of program hot spots

Merten, M. C., Trick, A. R., Nystrom, E. M., Barnes, R. D. & Hwu, W. M. W., 2000, In : Conference Proceedings - Annual International Symposium on Computer Architecture, ISCA. p. 59-70 12 p.

Research output: Contribution to journalConference article

Integrated predicated and speculative execution in the IMPACT EPIC architecture

August, D. I., Connors, D. A., Mahlke, S. A., Sias, J. W., Crozier, K. M., Cheng, B. C., Eaton, P. R., Olaniran, Q. B. & Hwu, W-M. W., Jan 1 1998, In : Conference Proceedings - Annual International Symposium on Computer Architecture, ISCA. p. 227-237 11 p.

Research output: Contribution to journalConference article

Interpretable and globally optimal prediction for textual grounding using image concepts

Yeh, R. A., Xiong, J., Hwu, W. M. W., Do, M. N. & Schwing, A. G., Jan 1 2017, In : Advances in Neural Information Processing Systems. 2017-December, p. 1913-1923 11 p.

Research output: Contribution to journalConference article

Modulo scheduling of loops in control-intensive non-numeric programs

Lavery, D. M. & Hwu, W-M. W., Dec 1 1996, In : Proceedings of the Annual International Symposium on Microarchitecture. p. 126-137 12 p.

Research output: Contribution to journalConference article

Optimization of machine descriptions for efficient use

Gyllenhaal, J. C., Hwu, W. M. W. & Rau, B. R., Dec 1 1996, In : Proceedings of the Annual International Symposium on Microarchitecture. p. 349-358 10 p.

Research output: Contribution to journalConference article

Program decision logic approach to predicated execution

August, D. I., Sias, J. W., Puiatti, J. M., Mahlke, S. A., Connors, D. A., Crozier, K. M. & Hwu, W. M. W., Jan 1 1999, In : Conference Proceedings - Annual International Symposium on Computer Architecture, ISCA. p. 208-219 12 p.

Research output: Contribution to journalConference article

Region-based compilation: an introduction and motivation

Hank, R. E., Hwu, W. M. W. & Rau, B. R., Jan 1 1995, In : Proceedings of the Annual International Symposium on Microarchitecture. p. 158-168 11 p.

Research output: Contribution to journalConference article

Run-time adaptive cache hierarchy management via reference analysis

Johnson, T. L. & Hwu, W. M. W., Jan 1 1997, In : Conference Proceedings - Annual International Symposium on Computer Architecture, ISCA. p. 315-326 12 p.

Research output: Contribution to journalConference article

Run-time adaptive cache management

Johnson, T. L., Connors, D. A. & Hwu, W. M. W., Jan 1 1998, In : Proceedings of the Hawaii International Conference on System Sciences. 7, p. 774-775 2 p.

Research output: Contribution to journalConference article

Simulation study of simultaneous vector prefetch performance in multiprocessor memory subsystems

Hwu, W. M. W. & Conte, T. M., May 1 1989, In : Performance Evaluation Review. 17, 1, 1 p.

Research output: Contribution to journalConference article

Speculative hedge: Regulating compile-time speculation against profile variations

Deitrich, B. L. & Hwu, W-M. W., Dec 1 1996, In : Proceedings of the Annual International Symposium on Microarchitecture. p. 70-79 10 p.

Research output: Contribution to journalConference article

Trace selection for compiling large C application programs to microcode

Chang, P. P. & Hwu, W-M. W., Dec 1 1988, In : MICRO: Annual Microprogramming Workshop. p. 21-29 9 p.

Research output: Contribution to journalConference article

Transmission power control for multiple access wireless packet networks

Monks, J. P., Bharghavan, V. & Hwu, W. M. W., Dec 1 2000, In : Conference on Local Computer Networks. p. 12-21 10 p.

Research output: Contribution to journalConference article

Trimaran: An infrastructure for research in instruction-level parallelism

Chakrapani, L. N., Gyllenhaal, J., Hwu, W. M. W., Mahlke, S. A., Palem, K. V. & Rabbah, R. M., 2005, In : Lecture Notes in Computer Science. 3602, p. 32-41 10 p.

Research output: Contribution to journalConference article

Unrolling-based optimizations for modulo scheduling

Lavery, D. M. & Hwu, W. M. W., Jan 1 1995, In : Proceedings of the Annual International Symposium on Microarchitecture. p. 327-337 11 p.

Research output: Contribution to journalConference article

Chapter

A guide for implementing tridiagonal solvers on GPUs

Chang, L. W. & Hwu, W. M. W., Jan 1 2014, Numerical Computations with GPUs. Springer International Publishing, p. 29-44 16 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

An introduction to OpenCLTM

Kirk, D. B. & Hwu, W-M. W., Jan 1 2012, Programming Massively Parallel Processors: A Hands-on Approach, Second Edition. Elsevier Science, p. 297-313 17 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

Application case study: Advanced MRI reconstruction

Kirk, D. B. & Hwu, W-M. W., Jan 1 2012, Programming Massively Parallel Processors: A Hands-on Approach, Second Edition. Elsevier Science, p. 235-264 30 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

Architecture

Connors, D. A. & Hwu, W. M. W., Jan 1 2018, Mechatronic System Control, Logic, and Data Acquisition. CRC Press, p. 24-1-24-22

Research output: Chapter in Book/Report/Conference proceedingChapter

Architecture

Connors, D. A. & Hwu, W. M. W., Apr 19 2016, The VLSI Handbook: Second Edition. CRC Press, p. 66.1-66.23

Research output: Chapter in Book/Report/Conference proceedingChapter

Architecture

Connors, D. A. & Hwu, W. M. W., Jan 1 2002, The Mechatronics Handbook. CRC Press, p. 42-1-42-21

Research output: Chapter in Book/Report/Conference proceedingChapter

Architecture

Connors, D. A. & Hwu, W-M. W., Jan 1 2003, Memory, Microprocessor, and ASIC. CRC Press, p. 11-1-11-22

Research output: Chapter in Book/Report/Conference proceedingChapter

Architecture

Connors, D. A. & Hwu, W. M. W., Jan 1 2017, Mechatronic System Control, Logic, and Data Acquisition. CRC Press

Research output: Chapter in Book/Report/Conference proceedingChapter

Architecture

Connors, D. A. & Hwu, W-M. W., Jan 1 2017, Mechatronic System Control, Logic, and Data Acquisition. CRC Press

Research output: Chapter in Book/Report/Conference proceedingChapter

Compiler Technology

Chung, W. H. J., Lyu, Y. H., Sung, I. J. R., Lee, Y. W. & Hwu, W. M. W., 2016, Heterogeneous System Architecture: A New Compute Platform Infrastructure. Elsevier Inc., p. 97-129 33 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

Conclusion and future outlook

Kirk, D. B. & Hwu, W-M. W., Jan 1 2012, Programming Massively Parallel Processors: A Hands-on Approach, Second Edition. Elsevier Science, p. 459-469 11 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

CUDA dynamic parallelism

Kirk, D. B. & Hwu, W-M. W., Jan 1 2012, Programming Massively Parallel Processors: A Hands-on Approach, Second Edition. Elsevier Science, p. 435-457 23 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

CUDA memories

Kirk, D. B. & Hwu, W-M. W., Jan 1 2012, Programming Massively Parallel Processors: A Hands-on Approach, Second Edition. Elsevier Science, p. 95-121 27 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

Data-parallel execution model

Kirk, D. B. & Hwu, W-M. W., Jan 1 2012, Programming Massively Parallel Processors: A Hands-on Approach, Second Edition. Elsevier Science, p. 63-94 32 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

Floating-point considerations

Kirk, D. B. & Hwu, W-M. W., Jan 1 2012, Programming Massively Parallel Processors: A Hands-on Approach, Second Edition. Elsevier Science, p. 151-171 21 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

History of GPU computing

Kirk, D. B. & Hwu, W-M. W., Jan 1 2012, Programming Massively Parallel Processors: A Hands-on Approach, Second Edition. Elsevier Science, p. 23-39 17 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

Introduction

Kirk, D. B. & Hwu, W-M. W., Jan 1 2012, Programming Massively Parallel Processors: A Hands-on Approach, Second Edition. Elsevier Science, p. 1-21 21 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

Introduction

Hwu, W. M. W., Jan 1 2016, Heterogeneous System Architecture: A New Compute Platform Infrastructure. Elsevier Inc., p. 1-5 5 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

Introduction to data parallelism and CUDA C

Kirk, D. B. & Hwu, W-M. W., Jan 1 2012, Programming Massively Parallel Processors: A Hands-on Approach, Second Edition. Elsevier Science, p. 41-62 22 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

Mapping high-level programming languages to OpenCL 2.0: A compiler writer's perspective

Sung, I. J., Chung, W. H., Lee, Y. W. & Hwu, W. M., May 18 2015, Heterogeneous Computing with OpenCL 2.0: Third Edition. Elsevier Inc., p. 249-272 24 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

Parallel patterns: Prefix sum: An introduction to work efficiency in parallel algorithms

Kirk, D. B. & Hwu, W-M. W., Jan 1 2012, Programming Massively Parallel Processors: A Hands-on Approach, Second Edition. Elsevier Science, p. 197-216 20 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

Parallel patterns: Sparse matrix-vector multiplication: An introduction to compaction and regularization in parallel algorithms

Kirk, D. B. & Hwu, W-M. W., Jan 1 2012, Programming Massively Parallel Processors: A Hands-on Approach, Second Edition. Elsevier Science, p. 217-234 18 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

Parallel patterns: Convolution: With an introduction to constant memory and caches

Kirk, D. B. & Hwu, W-M. W., Jan 1 2012, Programming Massively Parallel Processors: A Hands-on Approach, Second Edition. Elsevier Science, p. 173-196 24 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

Parallel programming and computational thinking

Kirk, D. B. & Hwu, W-M. W., Jan 1 2012, Programming Massively Parallel Processors: A Hands-on Approach, Second Edition. Elsevier Science, p. 281-295 15 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

Performance analysis and tuning for general purpose graphics processing units (GPGPU)

Kim, H., Vuduc, R., Baghsorkhi, S., Hwu, W. M. & Jee Choi, C., Nov 21 2012, Performance Analysis and Tuning for General Purpose Graphics Processing Units (GPGPU). p. 1-94 94 p. (Synthesis Lectures on Computer Architecture; vol. 20).

Research output: Chapter in Book/Report/Conference proceedingChapter

Performance considerations

Kirk, D. B. & Hwu, W-M. W., Jan 1 2012, Programming Massively Parallel Processors: A Hands-on Approach, Second Edition. Elsevier Science, p. 123-149 27 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

Using GPUs to accelerate advanced MRI reconstruction with field inhomogeneity compensation

Zhuo, Y., Wu, X. L., Haldar, J. P., Marin, T., Hwu, W. M. W., Liang, Z. P. & Sutton, B. P., Dec 1 2011, GPU Computing Gems Emerald Edition. Elsevier Inc., p. 709-722 14 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

Book

GPU Computing Gems Emerald Edition

Hwu, W-M. W., Jan 1 2011, Elsevier Inc.

Research output: Book/Report/Conference proceedingBook

GPU Computing Gems Jade Edition

Hwu, W-M. W., Jan 1 2012, Elsevier Inc.

Research output: Book/Report/Conference proceedingBook

Heterogeneous System Architecture: A New Compute Platform Infrastructure

Hwu, W-M. W., Dec 4 2015, Elsevier Inc. 189 p.

Research output: Book/Report/Conference proceedingBook

Programming massively parallel processors: A hands-on approach, second edition

Kirk, D. B. & Hwu, W-M. W., Jan 1 2013, Elsevier Science. 496 p.

Research output: Book/Report/Conference proceedingBook

Programming Massively Parallel Processors: A Hands-on Approach: Third Edition

Kirk, D. B. & Hwu, W-M. W., Dec 7 2016, Elsevier Inc. 550 p.

Research output: Book/Report/Conference proceedingBook

Article

Accelerating advanced MRI reconstructions on GPUs

Stone, S. S., Haldar, J. P., Tsao, S. C., Hwu, W. M. W., Sutton, B. P. & Liang, Z. P., Oct 2008, In : Journal of Parallel and Distributed Computing. 68, 10, p. 1307-1318 12 p.

Research output: Contribution to journalArticle