Wen-Mei W Hwu

If you made any changes in Pure these will be visible here soon.

Research Output

2016

A programming system for future proofing performance critical libraries

Chang, L. W., El Hajj, I., Kim, H. S., Gómez-Luna, J., Dakkak, A. & Hwu, W-M. W., Feb 27 2016, 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP 2016 - Proceedings. Association for Computing Machinery, 32. (Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP; vol. 12-16-March-2016).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Architecture

Connors, D. A. & Hwu, W. M. W., Apr 19 2016, The VLSI Handbook: Second Edition. CRC Press, p. 66.1-66.23

Research output: Chapter in Book/Report/Conference proceedingChapter

BLESS 2: Accurate, memory-efficient and fast error correction method

Heo, Y., Ramachandran, A., Hwu, W. M., Ma, J. & Chen, D., Aug 1 2016, In : Bioinformatics. 32, 15, p. 2369-2371 3 p.

Research output: Contribution to journalArticle

Compiler Technology

Chung, W. H. J., Lyu, Y. H., Sung, I. J. R., Lee, Y. W. & Hwu, W. M. W., Jan 1 2016, Heterogeneous System Architecture: A New Compute Platform Infrastructure. Elsevier Inc., p. 97-129 33 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

CUDA application development

Hwu, W. M., May 20 2016, 2008 IEEE Hot Chips 20 Symposium, HCS 2008. Institute of Electrical and Electronics Engineers Inc., 7476522. (2008 IEEE Hot Chips 20 Symposium, HCS 2008).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Design of a power-efficient ARM processor with a timing-error detection and correction mechanism

Chen, S. J., Liu, G., Yang, H. P., Luo, C. H. & Hwu, W. M., Jul 2 2016, Proceedings - 29th IEEE International System on Chip Conference, SOCC 2016. Bhatia, K., Alioto, M., Zhao, D., Marshall, A. & Sridhar, R. (eds.). IEEE Computer Society, p. 217-222 6 p. 7905471. (International System on Chip Conference; vol. 0).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

DySel: Lightweight dynamic selection for kernel-based data-parallel programming model

Chang, L. W., Kim, H. S. & Hwu, W. M., Mar 25 2016, ASPLOS 2016 - 21st International Conference on Architectural Support for Programming Languages and Operating Systems. Association for Computing Machinery, p. 667-680 14 p. (International Conference on Architectural Support for Programming Languages and Operating Systems - ASPLOS; vol. 02-06-April-2016).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Efficient and scalable workflows for genomic analyses

Banerjee, S. S., Athreya, A. P., Mainzer, L. S., Jongeneel, C. V., Hwu, W. M., Kalbarczyk, Z. T. & Iyer, R. K., Jun 1 2016, DIDC 2016 - Proceedings of the ACM International Workshop on Data-Intensive Distributed Computing. Association for Computing Machinery, Inc, p. 27-36 10 p. (DIDC 2016 - Proceedings of the ACM International Workshop on Data-Intensive Distributed Computing).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Efficient kernel synthesis for performance portable programming

Chang, L. W., Hajj, I. E., Rodrigues, C., Gomez-Luna, J. & Hwu, W. M., Dec 14 2016, MICRO 2016 - 49th Annual IEEE/ACM International Symposium on Microarchitecture. IEEE Computer Society, 7783715. (Proceedings of the Annual International Symposium on Microarchitecture, MICRO; vol. 2016-December).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

FCUDA-HB: Hierarchical and Scalable Bus Architecture Generation on FPGAs With the FCUDA Flow

Chen, Y., Nguyen, T., Chen, Y., Gurumani, S. T., Liang, Y., Rupnow, K., Cong, J., Hwu, W. M. & Chen, D., Dec 2016, In : IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems. 35, 12, p. 2032-2045 14 p., 7450674.

Research output: Contribution to journalArticle

HPS papers: A retrospective

Patt, Y. N., Hwu, W. M. W., Melvin, S. W. & Shebanow, M. C., Jan 1 2016, In : IEEE Micro. 36, 4, p. 76-79 4 p., 7542473.

Research output: Contribution to journalArticle

In-Place Matrix Transposition on GPUs

Gomez-Luna, J., Sung, I. J., Chang, L. W., Gonzalez-Linares, J. M., Guil, N. & Hwu, W. M. W., Mar 1 2016, In : IEEE Transactions on Parallel and Distributed Systems. 27, 3, p. 776-788 13 p., 7059219.

Research output: Contribution to journalArticle

Introduction

Hwu, W. M. W., Jan 1 2016, Heterogeneous System Architecture: A New Compute Platform Infrastructure. Elsevier Inc., p. 1-5 5 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

KLAP: Kernel launch aggregation and promotion for optimizing dynamic parallelism

Hajj, I. E., Gomez-Luna, J., Li, C., Chang, L. W., Milojicic, D. & Hwu, W. M., Dec 14 2016, MICRO 2016 - 49th Annual IEEE/ACM International Symposium on Microarchitecture. IEEE Computer Society, 7783716. (Proceedings of the Annual International Symposium on Microarchitecture, MICRO; vol. 2016-December).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Parallel solutions of inverse multiple scattering problems with born-type fast solvers

Hidayetoǧlu, M., Yang, C., Wang, L., Podkowa, A., Oelze, M., Hwu, W. M. & Chew, W. C., Nov 3 2016, 2016 Progress In Electromagnetics Research Symposium, PIERS 2016 - Proceedings. Institute of Electrical and Electronics Engineers Inc., p. 916-920 5 p. 7734520. (2016 Progress In Electromagnetics Research Symposium, PIERS 2016 - Proceedings).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Performance insights on executing non-graphics applications on CUDA on the NVIDIA GeForce 8800 GTX

Hwu, W. M., Kiirk, D., Ryoo, S., Rodriigues, C., Stratton, J. & Huang, K., May 31 2016, 2007 IEEE Hot Chips 19 Symposium, HCS 2007. Institute of Electrical and Electronics Engineers Inc., 7482492. (2007 IEEE Hot Chips 19 Symposium, HCS 2007).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Preface

Kirk, D. B. & Hwu, W-M. W., Dec 7 2016, Programming Massively Parallel Processors: A Hands-on Approach: Third Edition. Elsevier Inc., p. xv-xx

Research output: Chapter in Book/Report/Conference proceedingForeword/postscript

Programming Massively Parallel Processors: A Hands-on Approach: Third Edition

Kirk, D. B. & Hwu, W-M. W., Dec 7 2016, Elsevier Inc. 550 p.

Research output: Book/ReportBook

SpaceJMP: Programming with multiple virtual address spaces

El Hajj, I., Merritt, A., Zellweger, G., Milojicic, D., Achermann, R., Faraboschi, P., Hwu, W. M., Roscoe, T. & Schwan, K., Mar 25 2016, ASPLOS 2016 - 21st International Conference on Architectural Support for Programming Languages and Operating Systems. Association for Computing Machinery, p. 353-368 16 p. (International Conference on Architectural Support for Programming Languages and Operating Systems - ASPLOS; vol. 02-06-April-2016).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

WebGPU: A scalable online development platform for GPU programming courses

Dakkak, A., Pearson, C. & Hwu, W. M., Jul 18 2016, Proceedings - 2016 IEEE 30th International Parallel and Distributed Processing Symposium, IPDPS 2016. Institute of Electrical and Electronics Engineers Inc., p. 942-949 8 p. 7529962. (Proceedings - 2016 IEEE 30th International Parallel and Distributed Processing Symposium, IPDPS 2016).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2017

Architecture

Connors, D. A. & Hwu, W. M. W., Jan 1 2017, Mechatronic System Control, Logic, and Data Acquisition. CRC Press

Research output: Chapter in Book/Report/Conference proceedingChapter

Architecture

Connors, D. A. & Hwu, W-M. W., Jan 1 2017, Mechatronic System Control, Logic, and Data Acquisition. CRC Press

Research output: Chapter in Book/Report/Conference proceedingChapter

Chai: Collaborative heterogeneous applications for integrated-Architectures

Ǵomez-Luna, J., Hajj, I. E., Chang, L. W., Garćia-Flores, V., De Gonzalo, S. G., Jablin, T. B., Pẽna, A. J. & Hwu, W. M., Jul 11 2017, ISPASS 2017 - IEEE International Symposium on Performance Analysis of Systems and Software. Institute of Electrical and Electronics Engineers Inc., p. 43-54 12 p. 7975269. (ISPASS 2017 - IEEE International Symposium on Performance Analysis of Systems and Software).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Collaborative (CPU + GPU) algorithms for triangle counting and truss decomposition on the Minsky architecture: Static graph challenge: Subgraph isomorphism

Date, K., Feng, K., Nagi, R., Xiong, J., Kim, N. S. & Hwu, W-M. W., Oct 30 2017, 2017 IEEE High Performance Extreme Computing Conference, HPEC 2017. Institute of Electrical and Electronics Engineers Inc., 8091042. (2017 IEEE High Performance Extreme Computing Conference, HPEC 2017).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Collaborative computing for heterogeneous integrated systems

Chang, L. W., Gómez-Luna, J., El Hajj, I., Huang, S., Chen, D. & Hwu, W. M., Apr 17 2017, ICPE 2017 - Proceedings of the 2017 ACM/SPEC International Conference on Performance Engineering. Association for Computing Machinery, Inc, p. 385-388 4 p. (ICPE 2017 - Proceedings of the 2017 ACM/SPEC International Conference on Performance Engineering).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Comparative performance evaluation of multi-GPU MLFMA implementation for 2-D VIE problems

Pearson, C., Hidayetoglu, M., Ren, W., Chew, W. C. & Hwu, W. M., Jul 25 2017, CEM 2017 - 2017 Computing and Electromagnetics International Workshop. Gurel, L. (ed.). Institute of Electrical and Electronics Engineers Inc., p. 63-64 2 p. 7991888. (CEM 2017 - 2017 Computing and Electromagnetics International Workshop).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Generalize or die: Operating systems support for memristor-based accelerators

Bruel, P., Chalamalasetti, S. R., Dalton, C., Hajj, I. E., Goldman, A., Graves, C., Hwu, W. M., Laplante, P., Milojicic, D., Ndu, G. & Strachan, J. P., Nov 28 2017, 2017 IEEE International Conference on Rebooting Computing, ICRC 2017 - Proceedings. Institute of Electrical and Electronics Engineers Inc., p. 1-8 8 p. (2017 IEEE International Conference on Rebooting Computing, ICRC 2017 - Proceedings; vol. 2017-January).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Hardware acceleration of the pair-HMM algorithm for DNA variant calling

Huang, S., Manikandan, G. J., Ramachandran, A., Rupnow, K., Hwu, W. M. W. & Chen, D., Feb 22 2017, FPGA 2017 - Proceedings of the 2017 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays. Association for Computing Machinery, Inc, p. 275-284 10 p. (FPGA 2017 - Proceedings of the 2017 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Heterogeneous Computing Meets Near-Memory Acceleration and High-Level Synthesis in the Post-Moore Era

Kim, N. S., Chen, D., Xiong, J. & Hwu, W. M. W., Jan 1 2017, In : IEEE Micro. 37, 4, p. 10-18 9 p., 8013455.

Research output: Contribution to journalArticle

Interpretable and globally optimal prediction for textual grounding using image concepts

Yeh, R. A., Xiong, J., Hwu, W. M. W., Do, M. N. & Schwing, A. G., Jan 1 2017, In : Advances in Neural Information Processing Systems. 2017-December, p. 1913-1923 11 p.

Research output: Contribution to journalConference article

Large inverse-scattering solutions with DBIM on GPU-enabled supercomputers

Hidayetogglu, M., Pearson, C., Chew, W. C., Gurel, L. & Hwu, W-M. W., May 1 2017, 2017 International Applied Computational Electromagnetics Society Symposium - Italy, ACES 2017. Institute of Electrical and Electronics Engineers Inc., 7916310. (2017 International Applied Computational Electromagnetics Society Symposium - Italy, ACES 2017).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Preface

Kirk, D. B. & Hwu, W. M. W., Jan 1 2017, Programming Massively Parallel Processors: A Hands-on Approach: Third Edition. Elsevier Inc., p. xv-xx

Research output: Chapter in Book/Report/Conference proceedingForeword/postscript

RAI: A scalable project submission system for parallel programming courses

Dakkak, A., Pearson, C., Li, C. & Hwu, W. M., Jun 30 2017, Proceedings - 2017 IEEE 31st International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2017. Institute of Electrical and Electronics Engineers Inc., p. 315-322 8 p. 7965062. (Proceedings - 2017 IEEE 31st International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2017).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Rebooting the data access hierarchy of computing systems

Hwu, W-M. W., Hajj, I. E., De Gonzalo, S. G., Pearson, C., Kim, N. S., Chen, D., Xiong, J. & Sura, Z., Nov 28 2017, 2017 IEEE International Conference on Rebooting Computing, ICRC 2017 - Proceedings. Institute of Electrical and Electronics Engineers Inc., p. 1-4 4 p. (2017 IEEE International Conference on Rebooting Computing, ICRC 2017 - Proceedings; vol. 2017-January).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Scalable parallel DBIM solutions of inverse-scattering problems

Hidayetogglu, M., Pearson, C., Gurel, L., Hwu, W. M. & Chew, W. C., Jul 25 2017, CEM 2017 - 2017 Computing and Electromagnetics International Workshop. Gurel, L. (ed.). Institute of Electrical and Electronics Engineers Inc., p. 65-66 2 p. 7991889. (CEM 2017 - 2017 Computing and Electromagnetics International Workshop).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Thoughts on massively-parallel heterogeneous computing for solving large problems

Hwu, W. M., Hidayetogglu, M., Chew, W. C., Pearson, C., Garcia, S., Huang, S. & Dakkak, A., Jul 25 2017, CEM 2017 - 2017 Computing and Electromagnetics International Workshop. Gurel, L. (ed.). Institute of Electrical and Electronics Engineers Inc., p. 67-68 2 p. 7991890. (CEM 2017 - 2017 Computing and Electromagnetics International Workshop).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2018

AccDNN: An IP-Based DNN Generator for FPGAs

Zhang, X., Wang, J., Zhu, C., Lin, Y., Xiong, J., Hwu, W. M. & Chen, D., Sep 7 2018, Proceedings - 26th IEEE International Symposium on Field-Programmable Custom Computing Machines, FCCM 2018. Institute of Electrical and Electronics Engineers Inc., 1 p. 8457659. (Proceedings - 26th IEEE International Symposium on Field-Programmable Custom Computing Machines, FCCM 2018).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Accelerator architectures: A ten-year retrospective

Hwu, W. M. & Patel, S., Nov 1 2018, In : IEEE Micro. 38, 6, p. 56-62 7 p., 8585394.

Research output: Contribution to journalArticle

A fast and massively-parallel inverse solver for multiple-scattering tomographic image reconstruction

Hidayetoglu, M., Pearson, C., El Hajj, I., Gurel, L., Chew, W. C. & Hwu, W. M., Aug 3 2018, Proceedings - 2018 IEEE 32nd International Parallel and Distributed Processing Symposium, IPDPS 2018. Institute of Electrical and Electronics Engineers Inc., p. 64-74 11 p. 8425161. (Proceedings - 2018 IEEE 32nd International Parallel and Distributed Processing Symposium, IPDPS 2018).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Application-Transparent near-memory processing architecture with memory channel network

Alian, M., Min, S. W., Asgharimoghaddam, H., Dhar, A., Wang, D. K., Roewer, T., McPadden, A., O'Halloran, O., Chen, D., Xiong, J., Kim, D., Hwu, W. M. & Kim, N. S., Dec 12 2018, Proceedings - 51st Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 2018. IEEE Computer Society, p. 802-814 13 p. 8574587. (Proceedings of the Annual International Symposium on Microarchitecture, MICRO; vol. 2018-October).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Architecture

Connors, D. A. & Hwu, W. M. W., Jan 1 2018, Mechatronic System Control, Logic, and Data Acquisition. CRC Press, p. 24-1-24-22

Research output: Chapter in Book/Report/Conference proceedingChapter

Collaborative (CPU + GPU) Algorithms for Triangle Counting and Truss Decomposition

Mailthody, V. S., Date, K., Qureshi, Z., Pearson, C., Nagi, R., Xiong, J. & Hwu, W. M., Nov 26 2018, 2018 IEEE High Performance Extreme Computing Conference, HPEC 2018. Institute of Electrical and Electronics Engineers Inc., 8547517. (2018 IEEE High Performance Extreme Computing Conference, HPEC 2018).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

DNNBuilder: An automated tool for building high-performance DNN hardware accelerators for FPGAs

Zhang, X., Wang, J., Zhu, C., Lin, Y., Xiong, J., Hwu, W. M. & Chen, D., Nov 5 2018, 2018 IEEE/ACM International Conference on Computer-Aided Design, ICCAD 2018 - Digest of Technical Papers. Institute of Electrical and Electronics Engineers Inc., a56. (IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Enabling GPU support for the COMPSs-mobile framework

Lordan, F., Badia, R. M. & Hwu, W. M., Jan 1 2018, Accelerator Programming Using Directives - 4th International Workshop, WACCPD 2017, Held in Conjunction with the International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2017, Proceedings. Juckeland, G. & Chandrasekaran, S. (eds.). Springer-Verlag Berlin Heidelberg, p. 83-102 20 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 10732 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

High-throughput Ant Colony Optimization on graphics processing units

Cecilia, J. M., Llanes, A., Abellán, J. L., Gómez-Luna, J., Chang, L. W. & Hwu, W. M. W., Mar 2018, In : Journal of Parallel and Distributed Computing. 113, p. 261-274 14 p.

Research output: Contribution to journalArticle

NUMA-Aware Data-Transfer Measurements for Power/NVLink Multi-GPU Systems

Pearson, C., Chung, I. H., Sura, Z., Hwu, W. M. & Xiong, J., Jan 1 2018, High Performance Computing - ISC High Performance 2018 International Workshops, Revised Selected Papers. Weiland, M., Yokota, R., Alam, S. & Shalf, J. (eds.). Springer-Verlag Berlin Heidelberg, p. 448-454 7 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 11203 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Seeing the invisible: Limited-view imaging with multiple-scattering reconstruction

Hidayetoglu, M., Hwu, W. M. & Chew, W. C., Feb 21 2018, 2018 United States National Committee of URSI National Radio Science Meeting, USNC-URSI NRSM 2018. Institute of Electrical and Electronics Engineers Inc., p. 1-2 2 p. (2018 United States National Committee of URSI National Radio Science Meeting, USNC-URSI NRSM 2018; vol. 2018-January).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Semi-Coherent DMA: An Alternative I/O Coherency Management for Embedded Systems

Min, S., Alian, M., Hwu, W. M. & Kim, N. S., Jul 1 2018, In : IEEE Computer Architecture Letters. 17, 2, p. 221-224 4 p., 8444757.

Research output: Contribution to journalArticle

Supercomputing for Full-Wave Tomographic Image Reconstruction in Near-Real Time

Hidayetoǧlu, M., Hwu, W. M. & Cho Chew, W., Jan 1 2018, 2018 IEEE Antennas and Propagation Society International Symposium and USNC/URSI National Radio Science Meeting, APSURSI 2018 - Proceedings. Institute of Electrical and Electronics Engineers Inc., p. 1841-1842 2 p. 8608869. (2018 IEEE Antennas and Propagation Society International Symposium and USNC/URSI National Radio Science Meeting, APSURSI 2018 - Proceedings).

Research output: Chapter in Book/Report/Conference proceedingConference contribution