Wen-Mei W Hwu

If you made any changes in Pure these will be visible here soon.

Research Output

Programming massively parallel processors: A hands-on approach, second edition

Kirk, D. B. & Hwu, W-M. W., Jan 1 2013, Elsevier Science. 496 p.

Research output: Book/ReportBook

Programming Massively Parallel Processors: A Hands-on Approach: Third Edition

Kirk, D. B. & Hwu, W-M. W., Dec 7 2016, Elsevier Inc. 550 p.

Research output: Book/ReportBook

Program optimization carving for GPU computing

Ryoo, S., Rodrigues, C. I., Stone, S. S., Stratton, J. A., Ueng, S. Z., Baghsorkhi, S. S. & Hwu, W. M. W., Oct 1 2008, In : Journal of Parallel and Distributed Computing. 68, 10, p. 1389-1401 13 p.

Research output: Contribution to journalArticle

Program optimization space pruning for a multithreaded GPU

Ryoo, S., Rodrigues, C. I., Stone, S. S., Baghsorkhi, S. S., Ueng, S. Z., Stratton, J. A. & Hwu, W. M. W., May 19 2008, Proceedings of the 2008 CGO - Sixth International Symposium on Code Generation and Optimization. p. 195-204 10 p. (Proceedings of the 2008 CGO - Sixth International Symposium on Code Generation and Optimization).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

PUMA: A Programmable Ultra-efficient Memristor-based Accelerator for Machine Learning Inference

Ankit, A., El Hajj, I., Rahul Chalamalasetti, S., Ndu, G., Foltin, M., Williams, R. S., Faraboschi, P., Hwu, W. M., Paul Strachan, J., Roy, K. & Milojicic, D. S., Apr 4 2019, ASPLOS 2019 - 24th International Conference on Architectural Support for Programming Languages and Operating Systems. Association for Computing Machinery, p. 715-731 17 p. (International Conference on Architectural Support for Programming Languages and Operating Systems - ASPLOS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Open Access

RAI: A scalable project submission system for parallel programming courses

Dakkak, A., Pearson, C., Li, C. & Hwu, W. M., Jun 30 2017, Proceedings - 2017 IEEE 31st International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2017. Institute of Electrical and Electronics Engineers Inc., p. 315-322 8 p. 7965062. (Proceedings - 2017 IEEE 31st International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2017).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Rapid computation of sodium bioscales using gpu-accelerated image reconstruction

Atkinson, I. C., Liu, G., Obeid, N., Thulborn, K. R. & Hwu, W. M., Mar 1 2013, In : International Journal of Imaging Systems and Technology. 23, 1, p. 29-35 7 p.

Research output: Contribution to journalArticle

Real-time in vivo computed optical interferometric tomography

Ahmad, A., Shemonski, N. D., Adie, S. G., Kim, H. S., Hwu, W. M. W., Carney, P. S. & Boppart, S. A., Jun 1 2013, In : Nature Photonics. 7, 6, p. 444-448 5 p.

Research output: Contribution to journalArticle

Rebooting the data access hierarchy of computing systems

Hwu, W-M. W., Hajj, I. E., De Gonzalo, S. G., Pearson, C., Kim, N. S., Chen, D., Xiong, J. & Sura, Z., Nov 28 2017, 2017 IEEE International Conference on Rebooting Computing, ICRC 2017 - Proceedings. Institute of Electrical and Electronics Engineers Inc., p. 1-4 4 p. (2017 IEEE International Conference on Rebooting Computing, ICRC 2017 - Proceedings; vol. 2017-January).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Region-based compilation: Introduction, motivation, and initial experience

Hank, R. E., Hwu, W. M. W. & Rau, B. R., Jan 1 1997, In : International Journal of Parallel Programming. 25, 2, p. 113-146 34 p.

Research output: Contribution to journalArticle

Region-based compilation: an introduction and motivation

Hank, R. E., Hwu, W. M. W. & Rau, B. R., Jan 1 1995, In : Proceedings of the Annual International Symposium on Microarchitecture. p. 158-168 11 p.

Research output: Contribution to journalConference article

Reinforcement learning based text style transfer without parallel training corpus

Gong, H., Bhat, S., Wu, L., Xiong, J. & Hwu, W. M., Jan 1 2019, Long and Short Papers. Association for Computational Linguistics (ACL), p. 3168-3180 13 p. (NAACL HLT 2019 - 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference; vol. 1).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Reverse if-conversion

Warter, N. J., Mahlke, S. A., Hwu, W. M. W. & Rau, B. R., Dec 1 1993, Proc ACM SIGPLAN 93 Conf Program Lang Des Implementation. Anon (ed.). Publ by ACM, p. 290-299 10 p. (Proc ACM SIGPLAN 93 Conf Program Lang Des Implementation).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Reverse If-Conversion

Warter, N. J., Mahlke, S. A., Hwu, W-M. W. & Rau, B. R., Jan 6 1993, In : ACM SIGPLAN Notices. 28, 6, p. 290-299 10 p.

Research output: Contribution to journalArticle

Run-time adaptive cache hierarchy management via reference analysis

Johnson, T. L. & Hwu, W. M. W., Jan 1 1997, In : Conference Proceedings - Annual International Symposium on Computer Architecture, ISCA. p. 315-326 12 p.

Research output: Contribution to journalConference article

Run-time adaptive cache management

Johnson, T. L., Connors, D. A. & Hwu, W. M. W., Jan 1 1998, In : Proceedings of the Hawaii International Conference on System Sciences. 7, p. 774-775 2 p.

Research output: Contribution to journalConference article

Runtime and Architecture Support for Efficient Data Exchange in Multi-Accelerator Applications

Cabezas, J., Gelado, I., Stone, J. E., Navarro, N., Kirk, D. B. & Hwu, W. M., May 1 2015, In : IEEE Transactions on Parallel and Distributed Systems. 26, 5, p. 1405-1418 14 p., 6803940.

Research output: Contribution to journalArticle

Run-time cache bypassing

Johnson, T. L., Connors, D. A., Merten, M. C. & Hwu, W. M. W., Dec 1 1999, In : IEEE Transactions on Computers. 48, 12, p. 1338-1354 17 p.

Research output: Contribution to journalArticle

RUN-TIME GENERATION OF HPS MICROINSTRUCTIONS FROM A VAX INSTRUCTION STREAM.

Patt, Y. N., Melvin, S. W., Hwu, W. M., Shebanow, M. C., Chen, C. & We, J., Dec 1 1986, MICRO: Annual Microprogramming Workshop. IEEE, p. 75-81 7 p. (MICRO: Annual Microprogramming Workshop).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Run-time spatial locality detection and optimization

Johnson, T. L., Merten, M. C. & Hwu, W-M. W., 1997, In : Proceedings of the Annual International Symposium on Microarchitecture. p. 57-64 8 p.

Research output: Contribution to journalArticle

Scalable parallel DBIM solutions of inverse-scattering problems

Hidayetogglu, M., Pearson, C., Gurel, L., Hwu, W. M. & Chew, W. C., Jul 25 2017, CEM 2017 - 2017 Computing and Electromagnetics International Workshop. Gurel, L. (ed.). Institute of Electrical and Electronics Engineers Inc., p. 65-66 2 p. 7991889. (CEM 2017 - 2017 Computing and Electromagnetics International Workshop).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Scalable SIMD-parallel memory allocation for many-core machines

Huang, X., Rodrigues, C. I., Jones, S., Buck, I. & Hwu, W. M., Jun 1 2013, In : Journal of Supercomputing. 64, 3, p. 1008-1020 13 p.

Research output: Contribution to journalArticle

Seeing the invisible: Limited-view imaging with multiple-scattering reconstruction

Hidayetoglu, M., Hwu, W. M. & Chew, W. C., Feb 21 2018, 2018 United States National Committee of URSI National Radio Science Meeting, USNC-URSI NRSM 2018. Institute of Electrical and Electronics Engineers Inc., p. 1-2 2 p. (2018 United States National Committee of URSI National Radio Science Meeting, USNC-URSI NRSM 2018; vol. 2018-January).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Semi-Coherent DMA: An Alternative I/O Coherency Management for Embedded Systems

Min, S., Alian, M., Hwu, W. M. & Kim, N. S., Jul 1 2018, In : IEEE Computer Architecture Letters. 17, 2, p. 221-224 4 p., 8444757.

Research output: Contribution to journalArticle

Sentinel Scheduling: A Model for Compiler-Controlled Speculative Execution

Mahlke, S. A., Chen, W. Y., Bringmann, R. A., Hank, R. E., Hwu, W. M. W., Rau, B. R. & Schlansker, M. S., Jan 11 1993, In : ACM Transactions on Computer Systems (TOCS). 11, 4, p. 376-408 33 p.

Research output: Contribution to journalArticle

Sentinel scheduling for VLIW and superscalar processors

Mahlke, S. A., Chen, W. Y., Hwu, W-M. W., Rau, B. R. & Schlansker, M. S., Jan 1 1992, International Conference on Architectural Support for Programming Languages and Operating Systems - ASPLOS. 9 ed. Publ by ACM, p. 238-247 10 p. (International Conference on Architectural Support for Programming Languages and Operating Systems - ASPLOS; vol. 27, no. 9).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Sentinel Scheduling for VLIW and Superscalar Processors

Mahlke, S. A., Chen, W. Y., Hwu, W. M. W., Rau, B. R. & Schlansker, M. S., Jan 9 1992, In : ACM SIGPLAN Notices. 27, 9, p. 238-247 10 p.

Research output: Contribution to journalArticle

Simulation study of simultaneous vector prefetch performance in multiprocessor memory subsystems

Hwu, W. M. W. & Conte, T. M., May 1 1989, In : Performance Evaluation Review. 17, 1, 1 p.

Research output: Contribution to journalConference article

SpaceJMP: Programming with multiple virtual address spaces

El Hajj, I., Merritt, A., Zellweger, G., Milojicic, D., Achermann, R., Faraboschi, P., Hwu, W. M., Roscoe, T. & Schwan, K., Mar 25 2016, ASPLOS 2016 - 21st International Conference on Architectural Support for Programming Languages and Operating Systems. Association for Computing Machinery, p. 353-368 16 p. (International Conference on Architectural Support for Programming Languages and Operating Systems - ASPLOS; vol. 02-06-April-2016).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Sparse regularization in MRI iterative reconstruction using GPUs

Zhuo, Y., Sutton, B., Wu, X. L., Haldar, J., Hwu, W. M. & Liang, Z. P., Dec 1 2010, Proceedings - 2010 3rd International Conference on Biomedical Engineering and Informatics, BMEI 2010. p. 578-582 5 p. 5640008. (Proceedings - 2010 3rd International Conference on Biomedical Engineering and Informatics, BMEI 2010; vol. 2).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

SPEC ACCEL: A standard application suite for measuring hardware accelerator performance

Juckeland, G., Brantley, W., Chandrasekaran, S., Chapman, B., Che, S., Colgrove, M., Feng, H., Grund, A., Henschel, R., Hwu, W. M. W., Li, H., Müller, M. S., Nagel, W. E., Perminov, M., Shelepugin, P., Skadron, K., Stratton, J., Titov, A., Wang, K., Van Waveren, M. & 4 others, Whitney, B., Wienke, S., Xu, R. & Kumaran, K., Jan 1 2015, High Performance Computing Systems: Performance Modeling, Benchmarking, and Simulation - 5th International Workshop, PMBS 2014, Revised Selected Papers. Hammond, S. D., Jarvis, S. A. & Wright, S. A. (eds.). Springer-Verlag, p. 46-67 22 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 8966).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Speculative execution exception recovery using write-back suppression

Bringmann, R. A., Mahlke, S. A., Hank, R. E., Gyllenhaal, J. C. & Hwu, W-M. W., 1994, Proceedings of the Annual International Symposium on Microarchitecture. Anon (ed.). Publ by IEEE, p. 214-223 10 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Speculative hedge: Regulating compile-time speculation against profile variations

Deitrich, B. L. & Hwu, W-M. W., Dec 1 1996, In : Proceedings of the Annual International Symposium on Microarchitecture. p. 70-79 10 p.

Research output: Contribution to journalConference article

Study of the cache and branch performance issues with running Java on current hardware platforms

Hsieh, C. H. A., Conte, M. T., Johnson, T. L., Gyllenhaal, J. C. & Hwu, W. M. W., Jan 1 1997, p. 211-216. 6 p.

Research output: Contribution to conferencePaper

Superblock formation using static program analysis

Hank, R. E., Mahlke, S. A., Bringmann, R. A., Gyllenhaal, J. C. & Hwu, W-M. W., Jan 1 1994, Proceedings of the Annual International Symposium on Microarchitecture. Anon (ed.). Publ by IEEE, p. 247-255 9 p. (Proceedings of the Annual International Symposium on Microarchitecture).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Supercomputing for Full-Wave Tomographic Image Reconstruction in Near-Real Time

Hidayetoǧlu, M., Hwu, W. M. & Cho Chew, W., Jan 1 2018, 2018 IEEE Antennas and Propagation Society International Symposium and USNC/URSI National Radio Science Meeting, APSURSI 2018 - Proceedings. Institute of Electrical and Electronics Engineers Inc., p. 1841-1842 2 p. 8608869. (2018 IEEE Antennas and Propagation Society International Symposium and USNC/URSI National Radio Science Meeting, APSURSI 2018 - Proceedings).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Systematic prototyping of superscalar computer architectures

Conte, T. M. & Hwu, W-M. W., Jan 1 1992, Proceedings - 3rd International Workshop on Rapid System Prototyping: Shortening the Path from Specification to Prototype, RSP 1992. IEEE Computer Society, p. 161-170 10 p. 243910. (Proceedings of the International Workshop on Rapid System Prototyping; vol. 1992-January).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

The application of compiler-assisted multiple-instruction retry to VLIW architectures

Chen, S. K., Fuchs, W. K. & Hwu, W. M. W., Jan 1 1994, Proceedings of IEEE Workshop on Fault-Tolerant Parallel and Distributed Systems, FTPDS 1994. Pradhan, D. & Avresky, D. (eds.). Institute of Electrical and Electronics Engineers Inc., p. 51-58 8 p. 494474. (Proceedings of IEEE Workshop on Fault-Tolerant Parallel and Distributed Systems, FTPDS 1994).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

The benefit of predicated execution for software pipelining

Warter, N. J., Lavery, D. R. & Hwu, W. M. W., Jan 1 1993, Proceedings of the 26th Hawaii International Conference on System Sciences, HICSS 1993. IEEE Computer Society, p. 497-506 10 p. 1198122. (Proceedings of the Annual Hawaii International Conference on System Sciences; vol. 1).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

The concurrency challenge

Hwu, W. M., Keutzer, K. & Mattson, T. G., Aug 21 2008, In : IEEE Design and Test of Computers. 25, 4, p. 312-320 9 p.

Research output: Contribution to journalArticle

The design and implementation of the wolfram language compiler

Dakkak, A., Wickham-Jones, T. & Hwu, W. M., Feb 22 2020, CGO 2020 - Proceedings of the 18th ACM/IEEE International Symposium on Code Generation and Optimization. Mars, J., Tang, L., Xue, J. & Wu, P. (eds.). Association for Computing Machinery, Inc, p. 212-228 17 p. (CGO 2020 - Proceedings of the 18th ACM/IEEE International Symposium on Code Generation and Optimization).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Open Access

The Effect of Code Expanding Optimizations on Instruction Cache Design

Chen, W. Y., Chung, P. P. & Hwu, W. M. W., Sep 1993, In : IEEE Transactions on Computers. 42, 9, p. 1045-1057 13 p.

Research output: Contribution to journalArticle

The future of computer architecture research: An industrial perspective

Hwu, W. M. & Patel, S., Dec 12 2005, Proceedings - 11th International Symposium on High-Performance Computer Architecture, HPCA-11 2005. 1 p. (Proceedings - International Symposium on High-Performance Computer Architecture).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

The Importance of Prepass Code Scheduling for Superscalar and Superpipelined Processors

Chang, P. P., Lavery, D. M., Mahlke, S. A., Chen, W. Y. & Hwu, W. M. W., Mar 1995, In : IEEE Transactions on Computers. 44, 3, p. 353-370 18 p.

Research output: Contribution to journalArticle

The parallelization of video processing: From programming models to applications

Lin, D., Huang, X., Nguyen, Q., Blackburn, J., Rodrigues, C., Huang, T., Do, M. N., Patel, S. J. & Hwu, W. M. W., Jan 1 2009, In : IEEE Signal Processing Magazine. 26, 6, p. 103-112 10 p.

Research output: Contribution to journalReview article

The superblock: An effective technique for VLIW and superscalar compilation

Hwu, W. M. W., Mahlke, S. A., Chen, W. Y., Chang, P. P., Warter, N. J., Bringmann, R. A., Ouellette, R. G., Hank, R. E., Kiyohara, T., Haab, G. E., Holm, J. G. & Lavery, D. M., May 1 1993, In : The Journal of Supercomputing. 7, 1-2, p. 229-248 20 p.

Research output: Contribution to journalArticle

The Susceptibility of Programs to Context Switching

Hwu, W. M. W., Sep 1994, In : IEEE Transactions on Computers. 43, 9, p. 994-1003 10 p.

Research output: Contribution to journalArticle

Thoughts on massively-parallel heterogeneous computing for solving large problems

Hwu, W. M., Hidayetogglu, M., Chew, W. C., Pearson, C., Garcia, S., Huang, S. & Dakkak, A., Jul 25 2017, CEM 2017 - 2017 Computing and Electromagnetics International Workshop. Gurel, L. (ed.). Institute of Electrical and Electronics Engineers Inc., p. 67-68 2 p. 7991890. (CEM 2017 - 2017 Computing and Electromagnetics International Workshop).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Three Architectural Models for Compiler-Controlled Speculative Execution

Chang, P. P., Warter, N. J., Mahlke, S. A., Chen, W. Y. & Hwu, W. M. W., Apr 1995, In : IEEE Transactions on Computers. 44, 4, p. 481-494 14 p.

Research output: Contribution to journalArticle

Throughput-oriented kernel porting onto FPGAs

Papakonstantinou, A., Chen, D., Hwu, W. M., Cong, J. & Yun, L., Jul 12 2013, Proceedings of the 50th Annual Design Automation Conference, DAC 2013. 11. (Proceedings - Design Automation Conference).

Research output: Chapter in Book/Report/Conference proceedingConference contribution