Wen-Mei W Hwu

If you made any changes in Pure these will be visible here soon.

Research Output

Filter
Conference contribution
Conference contribution

Superblock formation using static program analysis

Hank, R. E., Mahlke, S. A., Bringmann, R. A., Gyllenhaal, J. C. & Hwu, W-M. W., Jan 1 1994, Proceedings of the Annual International Symposium on Microarchitecture. Anon (ed.). Publ by IEEE, p. 247-255 9 p. (Proceedings of the Annual International Symposium on Microarchitecture).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Supercomputing for Full-Wave Tomographic Image Reconstruction in Near-Real Time

Hidayetoǧlu, M., Hwu, W. M. & Cho Chew, W., Jan 1 2018, 2018 IEEE Antennas and Propagation Society International Symposium and USNC/URSI National Radio Science Meeting, APSURSI 2018 - Proceedings. Institute of Electrical and Electronics Engineers Inc., p. 1841-1842 2 p. 8608869. (2018 IEEE Antennas and Propagation Society International Symposium and USNC/URSI National Radio Science Meeting, APSURSI 2018 - Proceedings).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Systematic prototyping of superscalar computer architectures

Conte, T. M. & Hwu, W-M. W., Jan 1 1992, Proceedings - 3rd International Workshop on Rapid System Prototyping: Shortening the Path from Specification to Prototype, RSP 1992. IEEE Computer Society, p. 161-170 10 p. 243910. (Proceedings of the International Workshop on Rapid System Prototyping; vol. 1992-January).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

The application of compiler-assisted multiple-instruction retry to VLIW architectures

Chen, S. K., Fuchs, W. K. & Hwu, W. M. W., Jan 1 1994, Proceedings of IEEE Workshop on Fault-Tolerant Parallel and Distributed Systems, FTPDS 1994. Pradhan, D. & Avresky, D. (eds.). Institute of Electrical and Electronics Engineers Inc., p. 51-58 8 p. 494474. (Proceedings of IEEE Workshop on Fault-Tolerant Parallel and Distributed Systems, FTPDS 1994).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

The benefit of predicated execution for software pipelining

Warter, N. J., Lavery, D. R. & Hwu, W. M. W., Jan 1 1993, Proceedings of the 26th Hawaii International Conference on System Sciences, HICSS 1993. IEEE Computer Society, p. 497-506 10 p. 1198122. (Proceedings of the Annual Hawaii International Conference on System Sciences; vol. 1).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

The design and implementation of the wolfram language compiler

Dakkak, A., Wickham-Jones, T. & Hwu, W. M., Feb 22 2020, CGO 2020 - Proceedings of the 18th ACM/IEEE International Symposium on Code Generation and Optimization. Mars, J., Tang, L., Xue, J. & Wu, P. (eds.). Association for Computing Machinery, Inc, p. 212-228 17 p. (CGO 2020 - Proceedings of the 18th ACM/IEEE International Symposium on Code Generation and Optimization).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Open Access

The future of computer architecture research: An industrial perspective

Hwu, W. M. & Patel, S., Dec 12 2005, Proceedings - 11th International Symposium on High-Performance Computer Architecture, HPCA-11 2005. 1 p. (Proceedings - International Symposium on High-Performance Computer Architecture).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Thoughts on massively-parallel heterogeneous computing for solving large problems

Hwu, W. M., Hidayetogglu, M., Chew, W. C., Pearson, C., Garcia, S., Huang, S. & Dakkak, A., Jul 25 2017, CEM 2017 - 2017 Computing and Electromagnetics International Workshop. Gurel, L. (ed.). Institute of Electrical and Electronics Engineers Inc., p. 67-68 2 p. 7991890. (CEM 2017 - 2017 Computing and Electromagnetics International Workshop).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Throughput-oriented kernel porting onto FPGAs

Papakonstantinou, A., Chen, D., Hwu, W. M., Cong, J. & Yun, L., Jul 12 2013, Proceedings of the 50th Annual Design Automation Conference, DAC 2013. 11. (Proceedings - Design Automation Conference).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Tolerating data access latency with register preloading

Chen, W. Y., Mahlke, S. A., Hwu, W. M. W., Kiyohara, T. & Chang, P. P., Aug 1 1992, Proceedings of the 6th International Conference on Supercomputing, ICS 1992. Association for Computing Machinery, p. 104-113 10 p. (Proceedings of the International Conference on Supercomputing; vol. Part F129617).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Transitioning HPC software to exascale heterogeneous computing

Hwu, W-M. W., Chang, L. W., Kim, H. S., Dakkak, A. & El Hajj, I., Sep 2 2015, 2015 Computational Electromagnetics International Workshop, CEM 2015. Institute of Electrical and Electronics Engineers Inc., p. 4-5 2 p. 7237412. (2015 Computational Electromagnetics International Workshop, CEM 2015).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Triangle Counting and Truss Decomposition using FPGA

Huang, S., El-Hadedy, M., Hao, C., Li, Q., Mailthody, V. S., Date, K., Xiong, J., Chen, D., Nagi, R. & Hwu, W. M., Nov 26 2018, 2018 IEEE High Performance Extreme Computing Conference, HPEC 2018. Institute of Electrical and Electronics Engineers Inc., 8547536. (2018 IEEE High Performance Extreme Computing Conference, HPEC 2018).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TrIMS: Transparent and isolated model sharing for low latency deep learning inference in function-as-a-service

Dakkak, A., Li, C., De Gonzalo, S. G., Xiong, J. & Hwu, W. M., Jul 2019, Proceedings - 2019 IEEE International Conference on Cloud Computing, CLOUD 2019 - Part of the 2019 IEEE World Congress on Services. Bertino, E., Chang, C. K., Chen, P., Damiani, E., Goul, M. & Oyama, K. (eds.). IEEE Computer Society, p. 372-382 11 p. 8814494. (IEEE International Conference on Cloud Computing, CLOUD; vol. 2019-July).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Triolet: A programming system that unifies algorithmic skeleton interfaces for high-performance cluster computing

Rodrigues, C., Jablin, T., Dakkak, A. & Hwu, W. M., Mar 10 2014, PPoPP 2014 - Proceedings of the 2014 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. p. 247-258 12 p. (Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Update on k-truss Decomposition on GPU

Almasri, M., Anjum, O., Pearson, C., Qureshi, Z., Mailthody, V. S., Nagi, R., Xiong, J. & Hwu, W. M., Sep 2019, 2019 IEEE High Performance Extreme Computing Conference, HPEC 2019. Institute of Electrical and Electronics Engineers Inc., 8916285. (2019 IEEE High Performance Extreme Computing Conference, HPEC 2019).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Update on triangle counting on GPU

Pearson, C., Almasri, M., Anjum, O., Mailthody, V. S., Qureshi, Z., Nagi, R., Xiong, J. & Hwu, W. M., Sep 2019, 2019 IEEE High Performance Extreme Computing Conference, HPEC 2019. Institute of Electrical and Electronics Engineers Inc., 8916547. (2019 IEEE High Performance Extreme Computing Conference, HPEC 2019).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Using profile information to assist advanced compiler optimization and scheduling

Chen, W., Bringmann, R., Mahlke, S., Anik, S., Kiyohara, T., Warter, N., Lavery, D., Hwu, W. M., Hank, R. & Gyllenhaal, J., Jan 1 1993, Languages and Compilers for Parallel Computing - 5th International Workshop, Proceedings. Padua, D., Nicolau, A., Gelernter, D. & Banerjee, U. (eds.). Springer-Verlag, p. 31-48 18 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 757 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Vacuum packing: Extracting hardware-detected program phases for post-link optimization

Barnes, R. D., Nystrom, E. M., Merten, M. C. & Hwu, W-M. W., Jan 1 2002, Proceedings - 35th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 2002. IEEE Computer Society, p. 233-244 12 p. 1176253. (Proceedings of the Annual International Symposium on Microarchitecture, MICRO; vol. 2002-January).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Visualization and analysis of GPU summer school applicants and participants

Wah, E., Johnson, E., Auvil, L., Thakkar, U., Hwu, W. M., Kirk, D., Dunning, T. H. & Glotzer, S. C., Dec 1 2008, Proceedings - 4th IEEE International Conference on eScience, eScience 2008. p. 362-363 2 p. 4736797. (Proceedings - 4th IEEE International Conference on eScience, eScience 2008).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

WebGPU: A scalable online development platform for GPU programming courses

Dakkak, A., Pearson, C. & Hwu, W. M., Jul 18 2016, Proceedings - 2016 IEEE 30th International Parallel and Distributed Processing Symposium, IPDPS 2016. Institute of Electrical and Electronics Engineers Inc., p. 942-949 8 p. 7529962. (Proceedings - 2016 IEEE 30th International Parallel and Distributed Processing Symposium, IPDPS 2016).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

XMalloc: A scalable lock-free dynamic memory allocator for many-core machines

Huang, X., Rodrigues, C. I., Jones, S., Buck, I. & Hwu, W. M., Nov 19 2010, Proceedings - 10th IEEE International Conference on Computer and Information Technology, CIT-2010, 7th IEEE International Conference on Embedded Software and Systems, ICESS-2010, ScalCom-2010. p. 1134-1139 6 p. 5577907. (Proceedings - 10th IEEE International Conference on Computer and Information Technology, CIT-2010, 7th IEEE International Conference on Embedded Software and Systems, ICESS-2010, ScalCom-2010).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Xprof profiling the execution of x window programs

Gupta, A. & Hwu, W. M. W., Jun 1 1992, Proceedings of the 1992 ACM SIGMETRICS Joint International Conference on Measurement and Modeling of Computer Systems, SIGMETRICS/PERFORMANCE 1992. Gaither, B. D. (ed.). Association for Computing Machinery, Inc, p. 253-254 2 p. (Proceedings of the 1992 ACM SIGMETRICS Joint International Conference on Measurement and Modeling of Computer Systems, SIGMETRICS/PERFORMANCE 1992).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

XSP: Across-Stack Profiling and Analysis of Machine Learning Models on GPUs

Li, C., Dakkak, A., Xiong, J., Wei, W., Xu, L. & Hwu, W. M., May 2020, Proceedings - 2020 IEEE 34th International Parallel and Distributed Processing Symposium, IPDPS 2020. Institute of Electrical and Electronics Engineers Inc., p. 326-327 2 p. 9139875. (Proceedings - 2020 IEEE 34th International Parallel and Distributed Processing Symposium, IPDPS 2020).

Research output: Chapter in Book/Report/Conference proceedingConference contribution