Wen-Mei W Hwu

1984 …2019
If you made any changes in Pure, your changes will be visible here soon.

Fingerprint Fingerprint is based on mining the text of the expert's scholarly documents to create an index of weighted terms, which defines the key subjects of each individual researcher.

  • 15 Similar Profiles
Data storage equipment Engineering & Materials Science
Hardware Engineering & Materials Science
Program processors Engineering & Materials Science
Particle accelerators Engineering & Materials Science
Scheduling Engineering & Materials Science
Field programmable gate arrays (FPGA) Engineering & Materials Science
Compiler Mathematics
Parallel programming Engineering & Materials Science

Network Recent external collaboration on country level. Dive into details by clicking on the dots.

Research Output 1984 2019

Accelerating reduction and scan using tensor core units

Dakkak, A., Li, C., Xiong, J., Gelado, I. & Hwu, W. M., Jun 26 2019, ICS 2019 - International Conference on Supercomputing. Association for Computing Machinery, p. 46-57 12 p. (Proceedings of the International Conference on Supercomputing).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Tensors
Energy efficiency
Electric power utilization
Bandwidth
Data storage equipment

Analysis and modeling of collaborative execution strategies for heterogeneous CPU-FPGA architectures

Huang, S., De Gonzalo, S. G., El-Hadedy, M., Chang, L. W., Gómez-Luna, J., Milojicic, D., El Hajj, I., Chalamalasetti, S. R., Mutlu, O., Chen, D. & Hwu, W. M., Apr 4 2019, ICPE 2019 - Proceedings of the 2019 ACM/SPEC International Conference on Performance Engineering. Association for Computing Machinery, Inc, p. 79-90 12 p. (ICPE 2019 - Proceedings of the 2019 ACM/SPEC International Conference on Performance Engineering).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Program processors
Field programmable gate arrays (FPGA)
Particle accelerators
Data storage equipment
Computer programming languages

An efficient GPU implementation technique for higher-order 3D stencils

Anjum, O., Simon, G. D. G., Hidayetoglu, M. & Hwu, W. M., Aug 2019, Proceedings - 21st IEEE International Conference on High Performance Computing and Communications, 17th IEEE International Conference on Smart City and 5th IEEE International Conference on Data Science and Systems, HPCC/SmartCity/DSS 2019. Xiao, Z., Yang, L. T., Balaji, P., Li, T., Li, K. & Zomaya, A. (eds.). Institute of Electrical and Electronics Engineers Inc., p. 552-561 10 p. 8855722. (Proceedings - 21st IEEE International Conference on High Performance Computing and Communications, 17th IEEE International Conference on Smart City and 5th IEEE International Conference on Data Science and Systems, HPCC/SmartCity/DSS 2019).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Data storage equipment
Bandwidth
Graphics processing unit
Grid
Scaling

Automatic Generation of Warp-Level Primitives and Atomic Instructions for Fast and Portable Parallel Reduction on GPUs

Gonzalo, S. G. D., Huang, S., Gomez-Luna, J., Hammond, S., Mutlu, O. & Hwu, W. M., Mar 5 2019, CGO 2019 - Proceedings of the 2019 IEEE/ACM International Symposium on Code Generation and Optimization. Moseley, T., Jimborean, A. & Kandemir, M. T. (eds.). Institute of Electrical and Electronics Engineers Inc., p. 73-84 12 p. 8661187. (CGO 2019 - Proceedings of the 2019 IEEE/ACM International Symposium on Code Generation and Optimization).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Portability
Shuffle
Hardware
Domain-specific Languages
Programming

DeepStore: In-storage acceleration for intelligent queries

Mailthody, V. S., Qureshi, Z., Liang, W., Feng, Z., Gonzalo, S. G. D., Li, Y., Franke, H., Xiong, J., Huang, J. & Hwu, W. M., Oct 12 2019, MICRO 2019 - 52nd Annual IEEE/ACM International Symposium on Microarchitecture, Proceedings. IEEE Computer Society, p. 224-238 15 p. (Proceedings of the Annual International Symposium on Microarchitecture, MICRO).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Particle accelerators
Energy efficiency
Texturing
Image retrieval
Simulators