Wen-Mei W Hwu

1984 …2019

Research output per year

If you made any changes in Pure these will be visible here soon.

Research Output

Article

Optimized Data Transfers Based on the OpenCL Event Management Mechanism

Takizawa, H., Hirasawa, S., Sugawara, M., Gelado, I., Kobayashi, H. & Hwu, W. M. W., Jan 1 2015, In : Scientific Programming. 2015, 576498.

Research output: Contribution to journalArticle

Optimizing NET compilers for improved java performance

Hsieh, C. H. A., Conte, M. T., Johnson, T. L., Gyllenhaal, J. C. & Hwu, W. M. W., Jun 1 1997, Computer, 30, 6, p. 67-75 9 p.

Research output: Contribution to specialist publicationArticle

Partial reverse if-conversion framework for balancing control flow and predication

August, D. I., Hwu, W. M. W. & Mahlke, S. A., Jan 1 1999, In : International Journal of Parallel Programming. 27, 5, p. 381-423 43 p.

Research output: Contribution to journalArticle

Performance implications of synchronization support for parallel fortran programs

Anik, S. & Hwu, W. M. W., Aug 1994, In : Journal of Parallel and Distributed Computing. 22, 2, p. 202-215 14 p.

Research output: Contribution to journalArticle

Profile-assisted instruction scheduling

Chen, W. Y., Mahlke, S. A., Warter, N. J., Anik, S. & Hwu, W. M. W., Apr 1 1994, In : International Journal of Parallel Programming. 22, 2, p. 151-181 31 p.

Research output: Contribution to journalArticle

Profile‐guided automatic inline expansion for C programs

Chang, P. P., Mahlke, S. A., Chen, W. Y. & Hwu, WM. W., May 1992, In : Software: Practice and Experience. 22, 5, p. 349-369 21 p.

Research output: Contribution to journalArticle

Program Decision Logic Optimization Using Predication and Control Speculation

Hwu, W. M. W., August, D. I. & Sias, J. W., Nov 2001, In : Proceedings of the IEEE. 89, 11, p. 1660-1675 16 p.

Research output: Contribution to journalArticle

Program optimization carving for GPU computing

Ryoo, S., Rodrigues, C. I., Stone, S. S., Stratton, J. A., Ueng, S. Z., Baghsorkhi, S. S. & Hwu, W-M. W., Oct 1 2008, In : Journal of Parallel and Distributed Computing. 68, 10, p. 1389-1401 13 p.

Research output: Contribution to journalArticle

Rapid computation of sodium bioscales using gpu-accelerated image reconstruction

Atkinson, I. C., Liu, G., Obeid, N., Thulborn, K. R. & Hwu, W. M., Mar 1 2013, In : International Journal of Imaging Systems and Technology. 23, 1, p. 29-35 7 p.

Research output: Contribution to journalArticle

Real-time in vivo computed optical interferometric tomography

Ahmad, A., Shemonski, N. D., Adie, S. G., Kim, H. S., Hwu, W. M. W., Carney, P. S. & Boppart, S. A., Jun 1 2013, In : Nature Photonics. 7, 6, p. 444-448 5 p.

Research output: Contribution to journalArticle

Region-based compilation: Introduction, motivation, and initial experience

Hank, R. E., Hwu, W. M. W. & Rau, B. R., Jan 1 1997, In : International Journal of Parallel Programming. 25, 2, p. 113-146 34 p.

Research output: Contribution to journalArticle

Reverse If-Conversion

Warter, N. J., Mahlke, S. A., Hwu, W-M. W. & Rau, B. R., Jan 6 1993, In : ACM SIGPLAN Notices. 28, 6, p. 290-299 10 p.

Research output: Contribution to journalArticle

Runtime and Architecture Support for Efficient Data Exchange in Multi-Accelerator Applications

Cabezas, J., Gelado, I., Stone, J. E., Navarro, N., Kirk, D. B. & Hwu, W. M., May 1 2015, In : IEEE Transactions on Parallel and Distributed Systems. 26, 5, p. 1405-1418 14 p., 6803940.

Research output: Contribution to journalArticle

Run-time cache bypassing

Johnson, T. L., Connors, D. A., Merten, M. C. & Hwu, W. M. W., Dec 1 1999, In : IEEE Transactions on Computers. 48, 12, p. 1338-1354 17 p.

Research output: Contribution to journalArticle

Run-time spatial locality detection and optimization

Johnson, T. L., Merten, M. C. & Hwu, W-M. W., 1997, In : Proceedings of the Annual International Symposium on Microarchitecture. p. 57-64 8 p.

Research output: Contribution to journalArticle

Scalable SIMD-parallel memory allocation for many-core machines

Huang, X., Rodrigues, C. I., Jones, S., Buck, I. & Hwu, W-M. W., Jun 1 2013, In : Journal of Supercomputing. 64, 3, p. 1008-1020 13 p.

Research output: Contribution to journalArticle

Semi-Coherent DMA: An Alternative I/OCoherency Management for Embedded Systems

Min, S. W., Alian, M., Hwu, W-M. W. & Kim, N. S., Aug 22 2018, (Accepted/In press) In : IEEE Computer Architecture Letters.

Research output: Contribution to journalArticle

Sentinel Scheduling: A Model for Compiler-Controlled Speculative Execution

Mahlke, S. A., Chen, W. Y., Bringmann, R. A., Hank, R. E., Hwu, W-M. W., Rau, B. R. & Schlansker, M. S., Jan 11 1993, In : ACM Transactions on Computer Systems (TOCS). 11, 4, p. 376-408 33 p.

Research output: Contribution to journalArticle

Sentinel Scheduling for VLIW and Superscalar Processors

Mahlke, S. A., Chen, W. Y., Hwu, W. M. W., Rau, B. R. & Schlansker, M. S., Jan 9 1992, In : ACM SIGPLAN Notices. 27, 9, p. 238-247 10 p.

Research output: Contribution to journalArticle

Simulation study of simultaneous vector prefetch performance in multiprocessor memory subsystems

Hwu, W. M. W. & Conte, T. M., May 1989, In : Performance Evaluation Review. 17, 1, 1 p.

Research output: Contribution to journalArticle

The concurrency challenge

Hwu, W. M., Keutzer, K. & Mattson, T. G., Aug 21 2008, In : IEEE Design and Test of Computers. 25, 4, p. 312-320 9 p.

Research output: Contribution to journalArticle

The Effect of Code Expanding Optimizations on Instruction Cache Design

Chen, W. Y., Chung, P. P. & Hwu, W. M. W., Sep 1993, In : IEEE Transactions on Computers. 42, 9, p. 1045-1057 13 p.

Research output: Contribution to journalArticle

The Importance of Prepass Code Scheduling for Superscalar and Superpipelined Processors

Chang, P. P., Lavery, D. M., Mahlke, S. A., Chen, W. Y. & Hwu, W. M. W., Mar 1995, In : IEEE Transactions on Computers. 44, 3, p. 353-370 18 p.

Research output: Contribution to journalArticle

The superblock: An effective technique for VLIW and superscalar compilation

Hwu, W. M. W., Mahlke, S. A., Chen, W. Y., Chang, P. P., Warter, N. J., Bringmann, R. A., Ouellette, R. G., Hank, R. E., Kiyohara, T., Haab, G. E., Holm, J. G. & Lavery, D. M., May 1 1993, In : The Journal of Supercomputing. 7, 1-2, p. 229-248 20 p.

Research output: Contribution to journalArticle

The Susceptibility of Programs to Context Switching

Hwu, W. M. W., Sep 1994, In : IEEE Transactions on Computers. 43, 9, p. 994-1003 10 p.

Research output: Contribution to journalArticle

Three Architectural Models for Compiler-Controlled Speculative Execution

Chang, P. P., Warter, N. J., Mahlke, S. A., Chen, W. Y. & Hwu, W. M. W., Apr 1995, In : IEEE Transactions on Computers. 44, 4, p. 481-494 14 p.

Research output: Contribution to journalArticle

TIGER: tiled iterative genome assembler.

Wu, X. L., Heo, Y., El Hajj, I., Hwu, W. M., Chen, D. & Ma, J., 2012, In : Unknown Journal. 13 Suppl 19

Research output: Contribution to journalArticle

Tolerating cache-miss latency with multipass pipelines

Barnes, R. D., Ryoo, S. & Hwu, W. M. W., Jan 1 2006, In : IEEE Micro. 26, 1, p. 40-47 8 p.

Research output: Contribution to journalArticle

Toward application-aware security and reliability

Iyer, R. K., Kalbarczyk, Z., Pattabiraman, K., Healey, W., Hwu, W. W., Klemperer, P. & Farivar, R., Jan 1 2007, In : IEEE Security and Privacy. 5, 1, p. 57-62 6 p.

Research output: Contribution to journalArticle

Triolet: A programming system that unifies algorithmic skeleton interfaces for high-performance cluster computing

Rodrigues, C., Jablin, T., Dakkak, A. & Hwu, W. M., Aug 2014, In : ACM SIGPLAN Notices. 49, 8, p. 247-258 12 p.

Research output: Contribution to journalArticle

Using profile information to assist classic code optimizations

Chang, P. P., Mahlke, S. A. & Hwu, WM. W., Dec 1991, In : Software: Practice and Experience. 21, 12, p. 1301-1321 21 p.

Research output: Contribution to journalArticle

What is ahead for parallel computing

Hwu, W-M. W., Jul 2014, In : Journal of Parallel and Distributed Computing. 74, 7, p. 2574-2581 8 p.

Research output: Contribution to journalArticle

Book

GPU Computing Gems Emerald Edition

Hwu, W-M. W., Jan 1 2011, Elsevier Inc.

Research output: Book/ReportBook

GPU Computing Gems Jade Edition

Hwu, W-M. W., Jan 1 2012, Elsevier Inc.

Research output: Book/ReportBook

Programming massively parallel processors: A hands-on approach, second edition

Kirk, D. B. & Hwu, W-M. W., Jan 1 2013, Elsevier Science. 496 p.

Research output: Book/ReportBook

Programming Massively Parallel Processors: A Hands-on Approach: Third Edition

Kirk, D. B. & Hwu, W-M. W., Dec 7 2016, Elsevier Inc. 550 p.

Research output: Book/ReportBook

Chapter

A guide for implementing tridiagonal solvers on GPUs

Chang, L. W. & Hwu, W. M. W., Jan 1 2014, Numerical Computations with GPUs. Springer International Publishing, p. 29-44 16 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

An introduction to OpenCLTM

Kirk, D. B. & Hwu, W-M. W., Jan 1 2012, Programming Massively Parallel Processors: A Hands-on Approach, Second Edition. Elsevier Science, p. 297-313 17 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

Application case study: Advanced MRI reconstruction

Kirk, D. B. & Hwu, W-M. W., Jan 1 2012, Programming Massively Parallel Processors: A Hands-on Approach, Second Edition. Elsevier Science, p. 235-264 30 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

Architecture

Connors, D. A. & Hwu, W-M. W., Apr 19 2016, The VLSI Handbook: Second Edition. CRC Press, p. 66.1-66.23

Research output: Chapter in Book/Report/Conference proceedingChapter

Architecture

Connors, D. A. & Hwu, W. M. W., Jan 1 2018, Mechatronic System Control, Logic, and Data Acquisition. CRC Press, p. 24-1-24-22

Research output: Chapter in Book/Report/Conference proceedingChapter

Architecture

Connors, D. A. & Hwu, W-M. W., Jan 1 2017, Mechatronic System Control, Logic, and Data Acquisition. CRC Press

Research output: Chapter in Book/Report/Conference proceedingChapter

Architecture

Connors, D. A. & Hwu, W-M. W., Jan 1 2003, Memory, Microprocessor, and ASIC. CRC Press, p. 11-1-11-22

Research output: Chapter in Book/Report/Conference proceedingChapter

Architecture

Connors, D. A. & Hwu, W-M. W., Jan 1 2002, The Mechatronics Handbook. CRC Press, p. 42-1-42-21

Research output: Chapter in Book/Report/Conference proceedingChapter

Architecture

Connors, D. A. & Hwu, W. M. W., Jan 1 2017, Mechatronic System Control, Logic, and Data Acquisition. CRC Press

Research output: Chapter in Book/Report/Conference proceedingChapter

Compiler Technology

Chung, W. H. J., Lyu, Y. H., Sung, I. J. R., Lee, Y. W. & Hwu, W-M. W., Dec 4 2015, Heterogeneous System Architecture: A New Compute Platform Infrastructure. Elsevier Inc., p. 97-129 33 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

Conclusion and future outlook

Kirk, D. B. & Hwu, W-M. W., Jan 1 2012, Programming Massively Parallel Processors: A Hands-on Approach, Second Edition. Elsevier Science, p. 459-469 11 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

CUDA dynamic parallelism

Kirk, D. B. & Hwu, W-M. W., Jan 1 2012, Programming Massively Parallel Processors: A Hands-on Approach, Second Edition. Elsevier Science, p. 435-457 23 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

CUDA memories

Kirk, D. B. & Hwu, W-M. W., Jan 1 2012, Programming Massively Parallel Processors: A Hands-on Approach, Second Edition. Elsevier Science, p. 95-121 27 p.

Research output: Chapter in Book/Report/Conference proceedingChapter