Wen-Mei W Hwu

1984 …2019
If you made any changes in Pure, your changes will be visible here soon.

Research Output 1984 2019

Algorithm and data optimization techniques for scaling to massively threaded systems

Stratton, J. A., Rodrigues, C., Sung, I. J. R., Chang, L. W., Anssari, N., Liu, G. D., Hwu, W-M. W. & Obeid, N., Aug 29 2012, Computer, 45, 8, p. 26-32 7 p.

Research output: Contribution to specialist publicationArticle

Scalability
Graphics processing unit

A guide for implementing tridiagonal solvers on GPUs

Chang, L. W. & Hwu, W. M. W., Jan 1 2014, Numerical Computations with GPUs. Springer International Publishing, p. 29-44 16 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

Graphics processing unit

A fast and massively-parallel inverse solver for multiple-scattering tomographic image reconstruction

Hidayetoglu, M., Pearson, C., El Hajj, I., Gurel, L., Chew, W. C. & Hwu, W-M. W., Aug 3 2018, Proceedings - 2018 IEEE 32nd International Parallel and Distributed Processing Symposium, IPDPS 2018. Institute of Electrical and Electronics Engineers Inc., p. 64-74 11 p. 8425161. (Proceedings - 2018 IEEE 32nd International Parallel and Distributed Processing Symposium, IPDPS 2018).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Multiple scattering
Image reconstruction
Scattering
Forward scattering
Iterative methods

Advances in Benchmarking Techniques: New Standards and Quantitative Metrics

Conte, T. M. & Hwu, W. M. W., Jan 1 1995, In : Advances in Computers. 41, C, p. 231-253 23 p.

Research output: Contribution to journalArticle

Benchmarking
Systems analysis
Computer workstations
Computer systems
Specifications

Advanced MRI reconstruction toolbox with accelerating on GPU

Wu, X. L., Zhuo, Y., Gai, J., Lam, F., Fu, M., Haldar, J. P., Hwu, W. M., Liang, Z. P. & Sutton, B. P., Feb 11 2011, Proceedings of SPIE-IS and T Electronic Imaging - Parallel Processing for Imaging Applications. 78720Q. (Proceedings of SPIE - The International Society for Optical Engineering; vol. 7872).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Magnetic Resonance Imaging
Magnetic resonance
magnetic resonance
Imaging techniques
Reconstruction Algorithm

Adaptive Cache Management for Energy-Efficient GPU Computing

Chen, X., Chang, L. W., Rodrigues, C. I., Lv, J., Wang, Z. & Hwu, W-M. W., Jan 15 2015, Proceedings - 47th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 2014. January ed. IEEE Computer Society, p. 343-355 13 p. 7011400. (Proceedings of the Annual International Symposium on Microarchitecture, MICRO; vol. 2015-January, no. January).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Data storage equipment
Energy efficiency
Throughput
Graphics processing unit

Adaptive cache bypass and insertion for many-core accelerators

Chen, X., Wu, S., Chang, L. W., Huang, W. S., Pearson, C., Wang, Z. & Hwu, W. M. W., Jan 1 2014, 2nd ACM International Workshop on Many-Core Embedded Systems, MES 2014 - In Conjunction with the 41st International Symposium on Computer Architecture, ISCA 2014. Association for Computing Machinery, p. 1-8 8 p. (ACM International Conference Proceeding Series).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Particle accelerators
Data storage equipment
Energy efficiency
Graphics processing unit

Achieving high instruction cache performance with an optimizing compiler.

Hwu, W-M. W. & Chang, P. P., May 1 1989, In : Conference Proceedings - Annual Symposium on Computer Architecture. 16, p. 242-251 10 p.

Research output: Contribution to journalConference article

Data storage equipment
Hardware
Bandwidth

Accurate and efficient predicate analysis with binary decision diagrams

Sias, J. W., Hwu, W-M. W. & August, D. I., Dec 1 2000, In : Proceedings of the Annual International Symposium on Microarchitecture. p. 112-123 12 p.

Research output: Contribution to journalConference article

Binary decision diagrams
Flow control
Substrates

Accelerator architectures

Patel, S. J. & Hwu, W-M. W., Oct 17 2008, In : IEEE Micro. 28, 4, p. 4-12 9 p.

Research output: Contribution to journalEditorial

Particle accelerators
Sampling

Accelerator architectures: A ten-year retrospective

Hwu, W. M. & Patel, S., Nov 1 2018, In : IEEE Micro. 38, 6, p. 56-62 7 p., 8585394.

Research output: Contribution to journalArticle

Particle accelerators
Learning systems
Field programmable gate arrays (FPGA)
Education
Throughput

Acceleration of the Pair-HMM Algorithm for DNA Variant Calling

Manikandan, G. J., Huang, S., Rupnow, K., Hwu, W. M. W. & Chen, D., Aug 16 2016, Proceedings - 24th IEEE International Symposium on Field-Programmable Custom Computing Machines, FCCM 2016. Institute of Electrical and Electronics Engineers Inc., 1 p. 7544765. (Proceedings - 24th IEEE International Symposium on Field-Programmable Custom Computing Machines, FCCM 2016).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

DNA
Particle accelerators
High level synthesis
System-on-chip

Accelerating sparse deep neural networks on FPGAs

Huang, S., Pearson, C., Nagi, R., Xiong, J., Chen, D. & Hwu, W. M., Sep 2019, 2019 IEEE High Performance Extreme Computing Conference, HPEC 2019. Institute of Electrical and Electronics Engineers Inc., 8916419. (2019 IEEE High Performance Extreme Computing Conference, HPEC 2019).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Field programmable gate arrays (FPGA)
Inference engines
Particle accelerators
Mobile computing
Health care

Accelerating reduction and scan using tensor core units

Dakkak, A., Li, C., Xiong, J., Gelado, I. & Hwu, W. M., Jun 26 2019, ICS 2019 - International Conference on Supercomputing. Association for Computing Machinery, p. 46-57 12 p. (Proceedings of the International Conference on Supercomputing).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Tensors
Energy efficiency
Electric power utilization
Bandwidth
Data storage equipment

Accelerating mr image reconstruction on GPUs

Hwu, W. M. W., Nandakumar, D., Haldar, J., Atkinson, I. C., Sutton, B., Liang, Z. P. & Thulborn, K. R., Nov 17 2009, Proceedings - 2009 IEEE International Symposium on Biomedical Imaging: From Nano to Macro, ISBI 2009. p. 1283-1286 4 p. 5193297. (Proceedings - 2009 IEEE International Symposium on Biomedical Imaging: From Nano to Macro, ISBI 2009).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Computer-Assisted Image Processing
Image reconstruction
Imaging techniques
Graphics processing unit

Accelerating iterative field-compensated MR image reconstruction on GPUs

Zhuo, Y., Wu, X. L., Haldar, J. P., Hwu, W. M., Liang, Z. P. & Sutton, B. P., Aug 9 2010, 2010 7th IEEE International Symposium on Biomedical Imaging: From Nano to Macro, ISBI 2010 - Proceedings. p. 820-823 4 p. 5490112. (2010 7th IEEE International Symposium on Biomedical Imaging: From Nano to Macro, ISBI 2010 - Proceedings).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Computer-Assisted Image Processing
Image reconstruction
Magnetic Fields
Magnetic fields
Physics

Accelerating advanced MRI reconstructions on GPUs

Stone, S. S., Haldar, J. P., Tsao, S. C., Hwu, W. M. W., Sutton, B. P. & Liang, Z. P., Oct 1 2008, In : Journal of Parallel and Distributed Computing. 68, 10, p. 1307-1318 12 p.

Research output: Contribution to journalArticle

Magnetic Resonance Imaging
Graphics Processing Unit
Magnetic resonance
Imaging techniques
Percent

Accelerating advanced MRI reconstructions on GPUs

Stone, S. S., Haldar, J. P., Tsao, S. C., Hwu, W. M. W., Liang, Z. P. & Sutton, B. P., Jan 1 2008, Conference on Computing Frontiers - Proceedings of the 2008 Conference on Computing Frontiers, CF'08. Association for Computing Machinery, p. 261-272 12 p. (Conference on Computing Frontiers - Proceedings of the 2008 Conference on Computing Frontiers, CF'08).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Magnetic resonance
Imaging techniques
Image quality
Data storage equipment
Image reconstruction

AccDNN: An IP-Based DNN Generator for FPGAs

Zhang, X., Wang, J., Zhu, C., Lin, Y., Xiong, J., Hwu, W-M. W. & Chen, D., Sep 7 2018, Proceedings - 26th IEEE International Symposium on Field-Programmable Custom Computing Machines, FCCM 2018. Institute of Electrical and Electronics Engineers Inc., 1 p. 8457659. (Proceedings - 26th IEEE International Symposium on Field-Programmable Custom Computing Machines, FCCM 2018).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Field programmable gate arrays (FPGA)
Data storage equipment
Network layers
Cloud computing
Resource allocation