Wen-Mei W Hwu

1984 …2019
If you made any changes in Pure, your changes will be visible here soon.

Research Output 1984 2019

Filter
Conference contribution

Comparison of full and partial predicated execution support for ILP processors

Mahlke, S. A., Hank, R. E., McCormick, J. E., August, D. I. & Hwu, W-M. W., 1995, ACM SIGARCH (Association for Computing Nachinery Special Interest Group on Computer Architecture) - Conference Proceedings. ACM, p. 138-149 12 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Inductive logic programming (ILP)
Code generation

COMPARISON OF SEVERAL EVOLVING (UNIVERSITY) SUPERCOMPUTER ARCHITECTURES.

Patt, Y. N., Sheldon, R. G., Shebanow, M., Ponder, C. & Hwu, W-M. W., 1984, Unknown Host Publication Title. IEEE, p. 15-26 12 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Supercomputers

Compiler code transformations for superscalar-based high-performance systems

Mahlke, S. A., Chen, W. Y., Gyuenhaal, J. C., Hwu, W-M. W., Chang, P. P. & Kiyohara, T., Dec 1 1992, Proceedings of the 1992 ACM/IEEE conference on Supercomputing, Supercomputing 1992. Werner, R. (ed.). Association for Computing Machinery, p. 808-817 10 p. (Proceedings of the International Conference on Supercomputing; vol. Part F129723).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Supercomputers

Control flow optimization for supercomputer scalar processing

Chang, P. P. & Hwu, W-M. W., Jun 1 1989, Proceedings of the 3rd International Conference on Supercomputing, ICS 1989. Association for Computing Machinery, p. 145-153 9 p. (Proceedings of the International Conference on Supercomputing; vol. Part F130180).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Supercomputers
Flow control
Processing
Pipelines
Hardware

Corezilla: Build and tame the multicore beast?

Sarno, L., Hwu, W. M. W., Lund, C., Levy, M., Larus, J. R., Reinders, J., Cameron, G., Lennard, C. & Corporation, T., Aug 2 2007, 2007 44th ACM/IEEE Design Automation Conference, DAC'07. p. 632-633 2 p. 4261259. (Proceedings - Design Automation Conference).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Software engineering
Systems analysis
Hardware

CRITICAL ISSUES REGARDING HPS, A HIGH PERFORMANCE MICROARCHITECTURE.

Patt, Y. N., Melvin, S. W., Hwu, W. M. & Shebanow, M. C., Dec 1 1985, MICRO: Annual Microprogramming Workshop. ACM, p. 109-116 8 p. (MICRO: Annual Microprogramming Workshop).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

CUBA: An architecture for efficient CPU/Co-processor data communication

Gelado, I., Kelm, J. H., Ryoo, S., Lumetta, S. S., Navarro, N. & Hwu, W-M. W., Dec 15 2008, ICS'08 - Proceedings of the 2008 ACM International Conference on Supercomputing. p. 299-308 10 p. (Proceedings of the International Conference on Supercomputing).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Program processors
Communication
Data structures
Data storage equipment
Coprocessor

CUDA application development

Hwu, W. M., May 20 2016, 2008 IEEE Hot Chips 20 Symposium, HCS 2008. Institute of Electrical and Electronics Engineers Inc., 7476522. (2008 IEEE Hot Chips 20 Symposium, HCS 2008).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

CUDA-Lite: Reducing GPU programming complexity

Ueng, S. Z., Lathara, M., Baghsorkhi, S. S. & Hwu, W. M. W., Dec 1 2008, Languages and Compilers for Parallel Computing - 21st International Workshop, LCPC 2008, Revised Selected Papers. p. 1-15 15 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 5335 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Computer programming
Programming
Many-core
Data storage equipment
Coding

Data access microarchitectures for superscalar processors with compiler-assisted data prefetching

Chen, W. Y., Mahlke, S. A., Chang, P. P. & Hwu, W-M. W., Sep 1 1991, MICRO 1991 - Proceedings of the 24th Annual International Symposium on Microarchitecture. IEEE Computer Society, p. 69-73 5 p. (Proceedings of the Annual International Symposium on Microarchitecture, MICRO).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Pollution
Data storage equipment

Data layout transformation exploiting memory-level parallelism in structured grid many-core applications

Sung, I. J., Stratton, J. A. & Hwu, W. M. W., Jan 1 2010, PACT'10 - Proceedings of the 19th International Conference on Parallel Architectures and Compilation Techniques. Institute of Electrical and Electronics Engineers Inc., p. 513-522 10 p. (Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT; vol. 2010).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Many-core
Parallelism
Layout
Grid
Data storage equipment

Data relocation and prefetching for programs with large data sets

Yamada, Y., Gyllenhall, J., Haab, G. & Hwu, W-M. W., Nov 30 1994, Proceedings of the 27th Annual International Symposium on Microarchitecture, MICRO 1994. IEEE Computer Society, p. 118-127 10 p. (Proceedings of the Annual International Symposium on Microarchitecture, MICRO; vol. Part F129425).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Copying
Relocation
Hardware
Data storage equipment

DeepStore: In-storage acceleration for intelligent queries

Mailthody, V. S., Qureshi, Z., Liang, W., Feng, Z., Gonzalo, S. G. D., Li, Y., Franke, H., Xiong, J., Huang, J. & Hwu, W. M., Oct 12 2019, MICRO 2019 - 52nd Annual IEEE/ACM International Symposium on Microarchitecture, Proceedings. IEEE Computer Society, p. 224-238 15 p. (Proceedings of the Annual International Symposium on Microarchitecture, MICRO).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Particle accelerators
Energy efficiency
Texturing
Image retrieval
Simulators

Design evaluation of OpenCL compiler framework for coarse-grained reconfigurable arrays

Kim, H. S., Ahn, M., Stratton, J. A. & Hwu, W-M. W., Dec 1 2012, FPT 2012 - 2012 International Conference on Field-Programmable Technology. p. 313-320 8 p. 6412155. (FPT 2012 - 2012 International Conference on Field-Programmable Technology).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Compiler
Evaluation
Parallel Programming
kernel
Programming Model

Design of a power-efficient ARM processor with a timing-error detection and correction mechanism

Chen, S. J., Liu, G., Yang, H. P., Luo, C. H. & Hwu, W-M. W., Jul 2 2016, Proceedings - 29th IEEE International System on Chip Conference, SOCC 2016. Bhatia, K., Alioto, M., Zhao, D., Marshall, A. & Sridhar, R. (eds.). IEEE Computer Society, p. 217-222 6 p. 7905471. (International System on Chip Conference; vol. 0).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

ARM processors
Error detection
Error correction
Electric potential
Mobile devices

Direct numerical simulation of turbulent flow in a square duct using a Graphics Processing Unit (GPU)

Shinn, A. F., Vanka, S. P. & Hwu, W-M. W., 2010, 40th AIAA Fluid Dynamics Conference. 2010-5029

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Direct numerical simulation
Ducts
Turbulent flow
Large eddy simulation
Reynolds number

DL: A data layout transformation system for heterogeneous computing

Sung, I. J., Liu, G. D. & Hwu, W. M. W., Dec 12 2012, 2012 Innovative Parallel Computing, InPar 2012. 6339606. (2012 Innovative Parallel Computing, InPar 2012).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Data storage equipment
Program processors
Bandwidth
Dynamic random access storage
Graphics processing unit

DNNBuilder: An automated tool for building high-performance DNN hardware accelerators for FPGAs

Zhang, X., Wang, J., Zhu, C., Lin, Y., Xiong, J., Hwu, W-M. W. & Chen, D., Nov 5 2018, 2018 IEEE/ACM International Conference on Computer-Aided Design, ICCAD 2018 - Digest of Technical Papers. Institute of Electrical and Electronics Engineers Inc., a56. (IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Particle accelerators
Field programmable gate arrays (FPGA)
Hardware
Throughput
Cloud computing

Dynamic memory disambiguation using the memory conflict buffer

Gallagher, D. M., Chen, W. Y., Mahlke, S. A., Gyllenhaal, J. C. & Hwu, W-M. W., Nov 1 1994, Proceedings of the 6th International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS 1994. Association for Computing Machinery, p. 183-193 11 p. (International Conference on Architectural Support for Programming Languages and Operating Systems - ASPLOS; vol. Part F129531).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Data storage equipment
Scheduling
Computer hardware
Repair

DySel: Lightweight dynamic selection for kernel-based data-parallel programming model

Chang, L. W., Kim, H. S. & Hwu, W. M., Mar 25 2016, ASPLOS 2016 - 21st International Conference on Architectural Support for Programming Languages and Operating Systems. Association for Computing Machinery, p. 667-680 14 p. (International Conference on Architectural Support for Programming Languages and Operating Systems - ASPLOS; vol. 02-06-April-2016).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Parallel programming
Hardware

Efficient and scalable workflows for genomic analyses

Banerjee, S. S., Athreya, A. P., Mainzer, L. S., Jongeneel, C., Hwu, W-M. W., Kalbarczyk, Z. T. & Iyer, R. K., Jun 1 2016, DIDC 2016 - Proceedings of the ACM International Workshop on Data-Intensive Distributed Computing. Association for Computing Machinery, Inc, p. 27-36 10 p. (DIDC 2016 - Proceedings of the ACM International Workshop on Data-Intensive Distributed Computing).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Work Flow
Genomics
DNA sequences
Parallel processing systems
Cloud computing

Efficient compilation of fine-grained SPMD-threaded programs for multicore CPUs

Stratton, J. A., Grover, V., Marathe, J., Aarts, B., Murphy, M., Hu, Z. & Hwu, W-M. W., Jul 1 2010, Proceedings of the 2010 CGO - The 8th International Symposium on Code Generation and Optimization. p. 111-119 9 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Compilation
Program processors
Thread
Programming Model
Multithreading

Efficient kernel synthesis for performance portable programming

Chang, L. W., Hajj, I. E., Rodrigues, C., Gomez-Luna, J. & Hwu, W. M., Dec 14 2016, MICRO 2016 - 49th Annual IEEE/ACM International Symposium on Microarchitecture. IEEE Computer Society, 7783715. (Proceedings of the Annual International Symposium on Microarchitecture, MICRO; vol. 2016-December).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Chemical analysis
Computer systems programming
Energy efficiency
Costs
Tuning

Efficient pattern-based time series classification on GPU

Chang, K. W., Deka, B., Hwu, W-M. W. & Roth, D., 2012, Proceedings - 12th IEEE International Conference on Data Mining, ICDM 2012. p. 131-140 10 p. 6413748

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Chemical reactions
Time series
Dynamic programming

Efficient performance evaluation of memory hierarchy for highly multithreaded graphics processors

Baghsorkhi, S. S., Gelado, I., Delahaye, M. & Hwu, W-M. W., Mar 22 2012, PPoPP'12 - Proceedings of the 2012 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. p. 23-33 11 p. (Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Data storage equipment
Sampling
Monitoring
Hardware
Graphics processing unit

Enabling GPU support for the COMPSs-mobile framework

Lordan, F., Badia, R. M. & Hwu, W. M., Jan 1 2018, Accelerator Programming Using Directives - 4th International Workshop, WACCPD 2017, Held in Conjunction with the International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2017, Proceedings. Juckeland, G. & Chandrasekaran, S. (eds.). Springer-Verlag, p. 83-102 20 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 10732 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Heterogeneous Computing
Mobile Applications
Mobile Devices
Programming Model
Energy Consumption

Enhancing the Usability and Utilization of Accelerated Architectures via Docker

Haydel, N., Gesing, S., Taylor, I., Madey, G., Dakkak, A., De Gonzalo, S. G. & Hwu, W. M. W., Jan 1 2015, Proceedings - 2015 IEEE/ACM 8th International Conference on Utility and Cloud Computing, UCC 2015. Rana, O., Buyya, R. & Raicu, I. (eds.). Institute of Electrical and Electronics Engineers Inc., p. 361-367 7 p. 7431432. (Proceedings - 2015 IEEE/ACM 8th International Conference on Utility and Cloud Computing, UCC 2015).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Containers
Application programming interfaces (API)
Particle accelerators
Interfaces (computer)
Program processors

Evaluating characteristics of CUDA communication primitives on high-bandwidth interconnects

Pearson, C., Dakkak, A., Hashash, S., Li, C., Chung, I. H., Xiong, J. & Hwu, W-M. W., Apr 4 2019, ICPE 2019 - Proceedings of the 2019 ACM/SPEC International Conference on Performance Engineering. Association for Computing Machinery, Inc, p. 209-218 10 p. (ICPE 2019 - Proceedings of the 2019 ACM/SPEC International Conference on Performance Engineering).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Open Access
Data transfer
Bandwidth
Program processors
Communication
Data storage equipment

EXPERIMENTS WITH HPS, A RESTRICTED DATA FLOW MICROARCHITECTURE FOR HIGH PERFORMANCE COMPUTERS.

Patt, Y., Hwu, W. M., Melvin, S., Shebanow, M., Chen, C. & Wei, J., Jan 1 1986, Proceedings - IEEE Computer Society International Conference. Bell, A. G. (ed.). IEEE, p. 254-258 5 p. (Proceedings - IEEE Computer Society International Conference).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Substrates
Experiments
Engines

EXPLOITING HORIZONTAL AND VERTICAL CONCURRENCY VIA THE HPSM MICROPROCESSOR.

Hwu, W. M. W. & Patt, Y. N., Dec 1 1987, MICRO: Annual Microprogramming Workshop. ACM, p. 154-161 8 p. (MICRO: Annual Microprogramming Workshop).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Reduced instruction set computing
Microprocessor chips
Fabrication
Experiments

Exploiting more parallelism from applications having generalized reductions on GPU architectures

Wu, X. L., Obeid, N. & Hwu, W-M. W., Nov 19 2010, Proceedings - 10th IEEE International Conference on Computer and Information Technology, CIT-2010, 7th IEEE International Conference on Embedded Software and Systems, ICESS-2010, ScalCom-2010. p. 1175-1180 6 p. 5577899. (Proceedings - 10th IEEE International Conference on Computer and Information Technology, CIT-2010, 7th IEEE International Conference on Embedded Software and Systems, ICESS-2010, ScalCom-2010).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Graphics processing unit
Communication

EXPLOITING PARALLEL MICROPROCESSOR MICROARCHITECTURES WITH A COMPILER CODE GENERATOR.

Hwu, W. M. W. & Chang, P. P., Jan 1 1988, Unknown Host Publication Title. IEEE, p. 45-53 9 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Microprocessor chips
Subroutines
Experiments
Costs

FCUDA: Enabling efficient compilation of CUDA kernels onto FPGAs

Papakonstantinou, A., Gururaj, K., Stratton, J. A., Chen, D., Cong, J. & Hwu, W. M. W., Nov 11 2009, 2009 IEEE 7th Symposium on Application Specific Processors, SASP 2009. p. 35-42 8 p. 5226333. (2009 IEEE 7th Symposium on Application Specific Processors, SASP 2009).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Field programmable gate arrays (FPGA)
Application programming interfaces (API)
Parallel processing systems
Thermal effects
Particle accelerators

FlatFlash: Exploiting the Byte-Accessibility of SSDs within A Unified Memory-Storage Hierarchy

Abulila, A., Mailthody, V. S., Qureshi, Z., Huang, J., Kim, N. S., Xiong, J. & Hwu, W. M., Apr 4 2019, ASPLOS 2019 - 24th International Conference on Architectural Support for Programming Languages and Operating Systems. Association for Computing Machinery, p. 971-985 15 p. (International Conference on Architectural Support for Programming Languages and Operating Systems - ASPLOS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Open Access
Data storage equipment
Dynamic random access storage
Flash-based SSDs
Cost effectiveness
Metadata

"Flea-flicker" Multipass pipelining: An alternative to the high-power out-of-order offense

Barnes, R. D., Ryoo, S. & Hwu, W-M. W., Dec 1 2005, MICRO-38: Proceedings of the 38th Annual IEEE/ACM International Symposium on Microarchitecture. p. 319-330 12 p. 1540970. (Proceedings of the Annual International Symposium on Microarchitecture, MICRO).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Scheduling
Pipelines
Energy efficiency
Microprocessor chips
Hardware

Forward semantic: A compiler-assisted instruction fetch method for heavily pipelined processors

Chang, P. P. & Hwu, W-M. W., Aug 1 1989, Proceedings of the Annual International Symposium on Microarchitecture, MICRO. Allan, V. H. (ed.). IEEE Computer Society, p. 188-198 11 p. (Proceedings of the Annual International Symposium on Microarchitecture, MICRO).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Pipelines
Semantics
Costs
UNIX
Computer aided design

FPGA/DNN co-design: An efficient design methodology for IoT intelligence on the edge

Hao, C., Zhang, X., Li, Y., Huang, S., Xiong, J., Rupnow, K., Hwu, W-M. W. & Chen, D., Jun 2 2019, Proceedings of the 56th Annual Design Automation Conference 2019, DAC 2019. Institute of Electrical and Electronics Engineers Inc., a206. (Proceedings - Design Automation Conference).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Co-design
Field Programmable Gate Array
Design Methodology
Field programmable gate arrays (FPGA)
Accelerator

FPGA accelerated DNA error correction

Ramachandran, A., Heo, Y., Hwu, W-M. W., Ma, J. & Chen, D., Apr 22 2015, Proceedings of the 2015 Design, Automation and Test in Europe Conference and Exhibition, DATE 2015. Institute of Electrical and Electronics Engineers Inc., Vol. 2015-April. p. 1371-1376 6 p. 7092605

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Error correction
Field programmable gate arrays (FPGA)
DNA
Genes
Throughput

Framework for balancing control flow and predication

August, D. I., Hwu, W. M. W. & Mahlke, S. A., Dec 1 1997, Proceedings of the Annual International Symposium on Microarchitecture. p. 92-103 12 p. (Proceedings of the Annual International Symposium on Microarchitecture).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Flow control
Scheduling

Generalize or die: Operating systems support for memristor-based accelerators

Bruel, P., Chalamalasetti, S. R., Dalton, C., Hajj, I. E., Goldman, A., Graves, C., Hwu, W-M. W., Laplante, P., Milojicic, D., Ndu, G. & Strachan, J. P., Nov 28 2017, 2017 IEEE International Conference on Rebooting Computing, ICRC 2017 - Proceedings. Institute of Electrical and Electronics Engineers Inc., p. 1-8 8 p. (2017 IEEE International Conference on Rebooting Computing, ICRC 2017 - Proceedings; vol. 2017-January).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Memristors
Particle accelerators
accelerators
engines
hardware

GPU acceleration of cutoff pair potentials for molecular modeling applications

Rodrigues, C. I., Hardy, D. J., Stone, J. E., Schulten, K. & Hwu, W. M. W., Dec 1 2008, Conference on Computing Frontiers - Proceedings of the 2008 Conference on Computing Frontiers, CF'08. p. 273-282 10 p. (Conference on Computing Frontiers - Proceedings of the 2008 Conference on Computing Frontiers, CF'08).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Molecular modeling
Atoms
Decomposition
Data storage equipment
Computer programming

GPU clusters for high-performance computing

Kindratenko, V., Enos, J. J., Shi, G., Showerman, M. T., Arnold, G. W., Stone, J. E., Phillips, J. C. & Hwu, W-M. W., Dec 21 2009, 2009 IEEE International Conference on Cluster Computing and Workshops, CLUSTER '09. 5289128. (Proceedings - IEEE International Conference on Cluster Computing, ICCC).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Natural sciences computing
Graphics processing unit

GPU-SM: Shared memory multi-GPU programming

Cabezas, J., Jordà, M., Gelado, I., Navarro, N. & Hwu, W-M. W., Feb 7 2015, ACM International Conference Proceeding Series. Gong, X. (ed.). Association for Computing Machinery, p. 13-24 12 p. (ACM International Conference Proceeding Series; vol. 2015-February).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Computer programming
Data storage equipment
Graphics processing unit
Computer systems
Data structures

Hardware acceleration of the pair-HMM algorithm for DNA variant calling

Huang, S., Manikandan, G. J., Ramachandran, A., Rupnow, K., Hwu, W-M. W. & Chen, D., Feb 22 2017, FPGA 2017 - Proceedings of the 2017 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays. Association for Computing Machinery, Inc, p. 275-284 10 p. (FPGA 2017 - Proceedings of the 2017 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

DNA
Hardware
Field programmable gate arrays (FPGA)
DNA sequences
Program processors

Hardware-Software Co-Design for an Analog-Digital Accelerator for Machine Learning

Ambrosi, J., Ankit, A., Antunes, R., Chalamalasetti, S. R., Chatterjee, S., El Hajj, I., Fachini, G., Faraboschi, P., Foltin, M., Huang, S., Hwu, W. M., Knuppe, G., Lakshminarasimha, S. V., Milojicic, D., Parthasarathy, M., Ribeiro, F., Rosa, L., Roy, K., Silveira, P. & Strachan, J. P., Feb 8 2019, 2018 IEEE International Conference on Rebooting Computing, ICRC 2018. Institute of Electrical and Electronics Engineers Inc., 8638612. (2018 IEEE International Conference on Rebooting Computing, ICRC 2018).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Particle accelerators
Learning systems
Hardware
Memristors
Neural networks

High performance computation and display of molecular orbitals on and multi-core cpus

Stone, J. E., Saam, J., Hardy, D. J., Vandivort, K. L., Hwu, W-M. W. & Schulten, K. J., Jul 23 2009, Proceedings of 2nd Workshop on General Purpose Processing on Graphics Processing Units, GPGPU-2. 1 p. (Proceedings of 2nd Workshop on General Purpose Processing on Graphics Processing Units, GPGPU-2).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Molecular orbitals
Display devices
Program processors
Quantum chemistry
Orbital calculations

High-performance CUDA kernel execution on FPGAs

Papakonstantinou, A., Gururaj, K., Stratton, J. A., Chen, D., Cong, J. & Hwu, W-M. W., Nov 24 2009, ICS'09 - Proceedings of the 23rd International Conference on Supercomputing. p. 515-516 2 p. 1542357. (Proceedings of the International Conference on Supercomputing).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Field programmable gate arrays (FPGA)
Particle accelerators

High-speed interferometric synthetic aperture microscopy on a graphics processing unit

Ahmad, A., Shemonski, N., Adie, S. G., Kim, H., Hwu, W. M. W., Carney, P. S. & Boppart, S. A., 2012, Frontiers in Optics, FIO 2012.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

synthetic apertures
high speed
microscopy
tomography
imaging techniques

HPS, A NEW MICROARCHITECTURE: RATIONALE AND INTRODUCTION.

Patt, Y. N., Hwo, W. M. & Shebanow, M. C., Dec 1 1985, MICRO: Annual Microprogramming Workshop. ACM, p. 103-108 6 p. (MICRO: Annual Microprogramming Workshop).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Engines
Substrates

HPSM, A HIGH PERFORMANCE RESTRICTED DATA FLOW ARCHITECTURE HAVING MINIMAL FUNCTIONALITY.

Hwu, W-M. W. & Patt, Y. N., 1986, Conference Proceedings - Annual Symposium on Computer Architecture. IEEE, p. 297-306 10 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Reduced instruction set computing
Simulators
Engines