Wen-Mei W Hwu

If you made any changes in Pure these will be visible here soon.

Research Output

FPGA accelerated DNA error correction

Ramachandran, A., Heo, Y., Hwu, W-M. W., Ma, J. & Chen, D., Apr 22 2015, Proceedings of the 2015 Design, Automation and Test in Europe Conference and Exhibition, DATE 2015. Institute of Electrical and Electronics Engineers Inc., Vol. 2015-April. p. 1371-1376 6 p. 7092605

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Framework for balancing control flow and predication

August, D. I., Hwu, W. M. W. & Mahlke, S. A., Dec 1 1997, Proceedings of the Annual International Symposium on Microarchitecture. p. 92-103 12 p. (Proceedings of the Annual International Symposium on Microarchitecture).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

From the guest editors

Hwu, W. M. & Nicolau, A., Jun 1 1994, In : International Journal of Parallel Programming. 22, 3, p. 207-208 2 p.

Research output: Contribution to journalEditorial

Generalize or die: Operating systems support for memristor-based accelerators

Bruel, P., Chalamalasetti, S. R., Dalton, C., Hajj, I. E., Goldman, A., Graves, C., Hwu, W. M., Laplante, P., Milojicic, D., Ndu, G. & Strachan, J. P., Nov 28 2017, 2017 IEEE International Conference on Rebooting Computing, ICRC 2017 - Proceedings. Institute of Electrical and Electronics Engineers Inc., p. 1-8 8 p. (2017 IEEE International Conference on Rebooting Computing, ICRC 2017 - Proceedings; vol. 2017-January).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

GPU acceleration of cutoff pair potentials for molecular modeling applications

Rodrigues, C. I., Hardy, D. J., Stone, J. E., Schulten, K. & Hwu, W. M. W., Dec 1 2008, Conference on Computing Frontiers - Proceedings of the 2008 Conference on Computing Frontiers, CF'08. p. 273-282 10 p. (Conference on Computing Frontiers - Proceedings of the 2008 Conference on Computing Frontiers, CF'08).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

GPU clusters for high-performance computing

Kindratenko, V., Enos, J. J., Shi, G., Showerman, M. T., Arnold, G. W., Stone, J. E., Phillips, J. C. & Hwu, W-M. W., Dec 21 2009, 2009 IEEE International Conference on Cluster Computing and Workshops, CLUSTER '09. 5289128. (Proceedings - IEEE International Conference on Cluster Computing, ICCC).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

GPU Computing Gems Emerald Edition

Hwu, W-M. W., Jan 1 2011, Elsevier Inc.

Research output: Book/ReportBook

GPU Computing Gems Jade Edition

Hwu, W-M. W., Jan 1 2012, Elsevier Inc.

Research output: Book/ReportBook

GPU-SM: Shared memory multi-GPU programming

Cabezas, J., Jordà, M., Gelado, I., Navarro, N. & Hwu, W-M. W., Feb 7 2015, ACM International Conference Proceeding Series. Gong, X. (ed.). Association for Computing Machinery, p. 13-24 12 p. (ACM International Conference Proceeding Series; vol. 2015-February).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Hardware acceleration of the pair-HMM algorithm for DNA variant calling

Huang, S., Manikandan, G. J., Ramachandran, A., Rupnow, K., Hwu, W. M. W. & Chen, D., Feb 22 2017, FPGA 2017 - Proceedings of the 2017 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays. Association for Computing Machinery, Inc, p. 275-284 10 p. (FPGA 2017 - Proceedings of the 2017 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Hardware-compiler co-design for adjustable data power savings

Hunter, H. C., Nystrom, E. M., Connors, D. A. & Hwu, W. M. W., Jun 1 2009, In : Microprocessors and Microsystems. 33, 4, p. 244-253 10 p.

Research output: Contribution to journalArticle

Hardware-driven profiling scheme for identifying program hot spots to support runtime optimization

Merten, M. C., Trick, A. R., George, C. N., Gyllenhaal, J. C. & Hwu, W. M. W., Jan 1 1999, In : Conference Proceedings - Annual International Symposium on Computer Architecture, ISCA. p. 136-147 12 p.

Research output: Contribution to journalConference article

Hardware mechanism for dynamic extraction and relayout of program hot spots

Merten, M. C., Trick, A. R., Nystrom, E. M., Barnes, R. D. & Hwu, W. M. W., Jan 1 2000, In : Conference Proceedings - Annual International Symposium on Computer Architecture, ISCA. p. 59-70 12 p.

Research output: Contribution to journalConference article

Hardware-Software Co-Design for an Analog-Digital Accelerator for Machine Learning

Ambrosi, J., Ankit, A., Antunes, R., Chalamalasetti, S. R., Chatterjee, S., El Hajj, I., Fachini, G., Faraboschi, P., Foltin, M., Huang, S., Hwu, W. M., Knuppe, G., Lakshminarasimha, S. V., Milojicic, D., Parthasarathy, M., Ribeiro, F., Rosa, L., Roy, K., Silveira, P. & Strachan, J. P., Feb 8 2019, 2018 IEEE International Conference on Rebooting Computing, ICRC 2018. Institute of Electrical and Electronics Engineers Inc., 8638612. (2018 IEEE International Conference on Rebooting Computing, ICRC 2018).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Hardware support for dynamic activation of compiler-directed computation reuse

Connors, D. A., Hunter, H. C., Cheng, B. C. & Hwu, W. M. W., Dec 2000, In : Operating Systems Review (ACM). 34, 5, p. 222-233 12 p.

Research output: Contribution to journalArticle

Hardware support for dynamic activation of compiler-directed computation reuse

Connors, D. A., Hunter, H. C., Cheng, B. C. & Hwu, W. M. W., Nov 2000, In : SIGPLAN Notices (ACM Special Interest Group on Programming Languages). 35, 11, p. 222-233 12 p.

Research output: Contribution to journalArticle

Hardware support for dynamic activation of Compiler-directed Computation Reuse

Connors, D. A., Hunter, H. C., Cheng, B. C. & Hwu, W. M. W., Jan 1 2000, In : International Conference on Architectural Support for Programming Languages and Operating Systems - ASPLOS. p. 222-233 12 p.

Research output: Contribution to journalArticle

Heterogeneous Computing Meets Near-Memory Acceleration and High-Level Synthesis in the Post-Moore Era

Kim, N. S., Chen, D., Xiong, J. & Hwu, W. M. W., Jan 1 2017, In : IEEE Micro. 37, 4, p. 10-18 9 p., 8013455.

Research output: Contribution to journalArticle

High performance computation and display of molecular orbitals on and multi-core cpus

Stone, J. E., Saam, J., Hardy, D. J., Vandivort, K. L., Hwu, W-M. W. & Schulten, K. J., Jul 23 2009, Proceedings of 2nd Workshop on General Purpose Processing on Graphics Processing Units, GPGPU-2. 1 p. (Proceedings of 2nd Workshop on General Purpose Processing on Graphics Processing Units, GPGPU-2).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

High-performance computing with accelerators

Kindratenko, V., Wilhelmson, R., Brunner, R., Martíez, T. J. & Hwu, W. M., Jul 1 2010, In : Computing in Science and Engineering. 12, 4, p. 12-16 5 p., 5492949.

Research output: Contribution to journalEditorial

High-performance CUDA kernel execution on FPGAs

Papakonstantinou, A., Gururaj, K., Stratton, J. A., Chen, D., Cong, J. & Hwu, W-M. W., Nov 24 2009, ICS'09 - Proceedings of the 23rd International Conference on Supercomputing. p. 515-516 2 p. 1542357. (Proceedings of the International Conference on Supercomputing).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

High-speed interferometric synthetic aperture microscopy on a graphics processing unit

Ahmad, A., Shemonski, N., Adie, S. G., Kim, H., Hwu, W. M. W., Carney, P. S. & Boppart, S. A., 2012, Frontiers in Optics, FIO 2012.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

High-throughput Ant Colony Optimization on graphics processing units

Cecilia, J. M., Llanes, A., Abellán, J. L., Gómez-Luna, J., Chang, L. W. & Hwu, W. M. W., Mar 2018, In : Journal of Parallel and Distributed Computing. 113, p. 261-274 14 p.

Research output: Contribution to journalArticle

History of GPU computing

Kirk, D. B. & Hwu, W-M. W., Jan 1 2012, Programming Massively Parallel Processors: A Hands-on Approach, Second Edition. Elsevier Science, p. 23-39 17 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

HPS, A NEW MICROARCHITECTURE: RATIONALE AND INTRODUCTION.

Patt, Y. N., Hwo, W. M. & Shebanow, M. C., Dec 1 1985, MICRO: Annual Microprogramming Workshop. ACM, p. 103-108 6 p. (MICRO: Annual Microprogramming Workshop).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

HPS IMPLEMENTATION OF VAX; INITIAL DESIGN AND ANALYSIS.

Hwu, W-M. W., Melvin, S., Shebanow, M., Chen, C., Wei, J. J. & Patt, Y., 1986, In : Proceedings of the Hawaii International Conference on System Science. 1, p. 282-291 10 p.

Research output: Contribution to journalArticle

HPSM, A HIGH PERFORMANCE RESTRICTED DATA FLOW ARCHITECTURE HAVING MINIMAL FUNCTIONALITY.

Hwu, W-M. W. & Patt, Y. N., 1986, Conference Proceedings - Annual Symposium on Computer Architecture. IEEE, p. 297-306 10 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

HPSM2: A REFINED SINGLE-CHIP MICROENGINE.

Hwu, W-M. W. & Patt, Y. N., 1988, Proceedings of the Hawaii International Conference on System Science. Hoevel, L. W. & NCR Corp, D. (eds.). IEEE, p. 30-40 11 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

HPS papers: A retrospective

Patt, Y. N., Hwu, W. M. W., Melvin, S. W. & Shebanow, M. C., Jan 1 2016, In : IEEE Micro. 36, 4, p. 76-79 4 p., 7542473.

Research output: Contribution to journalArticle

IMPACT: An architectural framework for multiple-instruction-issue processors

Chang, P. P., Mahlke, S. A., Chen, W. Y., Warter, N. J. & Hwu, W-M. W., May 1 1991, Conference Proceedings - Annual Symposium on Computer Architecture. Publ by IEEE, p. 266-275 10 p. (Conference Proceedings - Annual Symposium on Computer Architecture).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Impatient MRI: Illinois Massively Parallel Acceleration Toolkit for image reconstruction with enhanced throughput in MRI

Wu, X. L., Gai, J., Lam, F., Fu, M., Haldar, J. P., Zhuo, Y., Liang, Z. P., Hwu, W. M. & Sutton, B. P., Nov 2 2011, 2011 8th IEEE International Symposium on Biomedical Imaging: From Nano to Macro, ISBI'11. p. 69-72 4 p. 5872356. (Proceedings - International Symposium on Biomedical Imaging).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Implementing a GPU programming model on a non-GPU accelerator architecture

Kofsky, S. M., Johnson, D. R., Stratton, J. A., Hwu, W. M. W., Patel, S. J. & Lumetta, S. S., Mar 8 2012, Computer Architecture - ISCA 2010 International Workshops, A4MMC, AMAS-BT, EAMA, WEED, WIOSCA, Revised Selected Papers. p. 40-51 12 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 6161 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Implementing neural machine translation with bi-directional GRU and attention mechanism on FPGAs using HLS

Li, Q., Zhang, X., Xiong, J. J., Hwu, W. M. & Chen, D., Jan 21 2019, ASP-DAC 2019 - 24th Asia and South Pacific Design Automation Conference. Institute of Electrical and Electronics Engineers Inc., p. 693-698 6 p. (Proceedings of the Asia and South Pacific Design Automation Conference, ASP-DAC).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Implicitly parallel programming models for thousand-core microprocessors

Hwu, W-M. W., Ryoo, S., Ueng, S. Z., Keim, J. H., Gelado, I., Stone, S. S., Kidd, R. E., Baghsorkhi, S. S., Mahesri, A. A., Tsao, S. C., Navarro, N., Lumetta, S. S., Frank, M. I. & Patel, S. J., Aug 2 2007, 2007 44th ACM/IEEE Design Automation Conference, DAC'07. p. 754-759 6 p. 4261284. (Proceedings - Design Automation Conference).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Importance of heap specialization in pointer analysis

Nystrom, E. M., Kim, H. S. & Hwu, W-M. W., Sep 29 2004, p. 43-48. 6 p.

Research output: Contribution to conferencePaper

Improved Superblock optimization in GCC

Kidd, R. & Hwu, W-M. W., 2006, Proceedings of the GCC Developers' Summit 2006. p. 85-96 12 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Improving static branch prediction in a compiler

Deitrich, B. L., Chen, B. C. & Hwu, W. M. W., Jan 1 1998, Proceedings - 1998 International Conference on Parallel Architectures and Compilation Techniques, PACT 1998. Institute of Electrical and Electronics Engineers Inc., p. 214-221 8 p. (Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Incremental compiler transformations for multiple instruction retry

Chen, SK. K., Alewine, N. J., Fuchs, W. K. & Hwu, WM. W., Dec 1994, In : Software: Practice and Experience. 24, 12, p. 1179-1198 20 p.

Research output: Contribution to journalArticle

Inline Function Expansion for Compiling C Programs

Chang, P. P. & Hwu, W. W., Jun 21 1989, In : ACM SIGPLAN Notices. 24, 7, p. 246-257 12 p.

Research output: Contribution to journalArticle

In-place data sliding algorithms for many-core architectures

Luna, J. G., Chang, L. W., Sung, I. J., Hwu, W-M. W. & Guil, N., Dec 8 2015, Proceedings - 2015 44th International Annual Conference on Parallel Processing, ICPP 2015. Institute of Electrical and Electronics Engineers Inc., p. 210-219 10 p. 7349576. (Proceedings of the International Conference on Parallel Processing; vol. 2015-December).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

In-Place Matrix Transposition on GPUs

Gomez-Luna, J., Sung, I. J., Chang, L. W., Gonzalez-Linares, J. M., Guil, N. & Hwu, W. M. W., Mar 1 2016, In : IEEE Transactions on Parallel and Distributed Systems. 27, 3, p. 776-788 13 p., 7059219.

Research output: Contribution to journalArticle

In-place transposition of rectangular matrices on accelerators

Sung, I. J., Gómez-Luna, J., González-Linares, J. M., Guil, N. & Hwu, W-M. W., Aug 2014, In : ACM SIGPLAN Notices. 49, 8, p. 207-218 12 p.

Research output: Contribution to journalArticle

In-place transposition of rectangular matrices on accelerators

Sung, I. J., Gómez-Luna, J., González-Linares, J. M., Guil, N. & Hwu, W-M. W., Mar 10 2014, PPoPP 2014 - Proceedings of the 2014 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. p. 207-218 12 p. (Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Integrated predicated and speculative execution in the IMPACT EPIC architecture

August, D. I., Connors, D. A., Mahlke, S. A., Sias, J. W., Crozier, K. M., Cheng, B. C., Eaton, P. R., Olaniran, Q. B. & Hwu, W-M. W., Jan 1 1998, In : Conference Proceedings - Annual International Symposium on Computer Architecture, ISCA. p. 227-237 11 p.

Research output: Contribution to journalConference article

Interferometric synthetic aperture microscopy with computational adaptive optics for high-resolution tomography of scattering tissue

Adie, S. G., Ahmad, A., Shemonski, N., Graf, B. W., Kim, H., Hwu, W. M. W., Carney, P. S. & Boppart, S. A., 2012, Biomedical Optics, BIOMED 2012.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Interpretable and globally optimal prediction for textual grounding using image concepts

Yeh, R. A., Xiong, J., Hwu, W. M. W., Do, M. N. & Schwing, A. G., Jan 1 2017, In : Advances in Neural Information Processing Systems. 2017-December, p. 1913-1923 11 p.

Research output: Contribution to journalConference article

Introduction

Hwu, W. M. W., Dec 1 2012, GPU Computing Gems Jade Edition. Elsevier Inc., p. xv-xvi

Research output: Chapter in Book/Report/Conference proceedingForeword/postscript

Introduction

Hwu, W. M. W., Jan 1 2016, Heterogeneous System Architecture: A New Compute Platform Infrastructure. Elsevier Inc., p. 1-5 5 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

Introduction

Kirk, D. B. & Hwu, W-M. W., Jan 1 2012, Programming Massively Parallel Processors: A Hands-on Approach, Second Edition. Elsevier Science, p. 1-21 21 p.

Research output: Chapter in Book/Report/Conference proceedingChapter