Wen-Mei W Hwu

1984 …2019
If you made any changes in Pure, your changes will be visible here soon.

Research Output 1984 2019

Filter
Conference contribution
2008

MCUDA: An efficient implementation of CUDA kernels for multi-core CPUs

Stratton, J. A., Stone, S. S. & Hwu, W-M. W., Dec 1 2008, Languages and Compilers for Parallel Computing - 21st International Workshop, LCPC 2008, Revised Selected Papers. p. 16-30 15 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 5335 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Efficient Implementation
Program processors
Parallel programming
Parallel Programming
kernel

Optimization principles and application performance evaluation of a multithreaded GPU using CUDA

Ryoo, S., Rodrigues, C. I., Baghsorkhi, S. S., Stone, S. S., Kirk, D. B. & Hwu, W. M. W., Dec 1 2008, PPoPP'08 - Proceedings of the 2008 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. p. 73-82 10 p. (Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Data storage equipment
Bandwidth
Graphics processing unit
Hardware

Program optimization space pruning for a multithreaded GPU

Ryoo, S., Rodrigues, C. I., Stone, S. S., Baghsorkhi, S. S., Ueng, S. Z., Stratton, J. A. & Hwu, W-M. W., May 19 2008, Proceedings of the 2008 CGO - Sixth International Symposium on Code Generation and Optimization. p. 195-204 10 p. (Proceedings of the 2008 CGO - Sixth International Symposium on Code Generation and Optimization).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Graphics processing unit
Tuning
Inspection

Visualization and analysis of GPU summer school applicants and participants

Wah, E., Johnson, E., Auvil, L., Thakkar, U., Hwu, W-M. W., Kirk, D., Dunning, T. H. & Glotzer, S. C., 2008, Proceedings - 4th IEEE International Conference on eScience, eScience 2008. p. 362-363 2 p. 4736797

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Visualization
Association rules
Parallel processing systems
Particle accelerators
Data mining
2007

Automatic discovery of coarse-grained parallelism in media applications

Ryoo, S., Ueng, S. Z., Rodrigues, C. I., Kidd, R. E., Frank, M. I. & Hwu, W. M. W., Dec 1 2007, Transactions on High-Performance Embedded Architectures and Compilers I. p. 194-213 20 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 4050 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Parallelism
Hardware
Processing
Computer programming languages
Particle accelerators

CIGAR: Application partitioning for a CPU/coprocessor architecture

Kelm, J. H., Gelado, I., Murphy, M. J., Navarro, N., Lumetta, S. S. & Hwu, W-M. W., Dec 1 2007, 16th International Conference on Parallel Architecture and Compilation Techniques, PACT 2007. p. 317-326 10 p. 4336222. (Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Program processors
Partitioning
Prototyping
Embedded Processor
Methodology

Corezilla: Build and tame the multicore beast?

Sarno, L., Hwu, W. M. W., Lund, C., Levy, M., Larus, J. R., Reinders, J., Cameron, G., Lennard, C. & Corporation, T., Aug 2 2007, 2007 44th ACM/IEEE Design Automation Conference, DAC'07. p. 632-633 2 p. 4261259. (Proceedings - Design Automation Conference).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Software engineering
Systems analysis
Hardware

Implicitly parallel programming models for thousand-core microprocessors

Hwu, W-M. W., Ryoo, S., Ueng, S. Z., Keim, J. H., Gelado, I., Stone, S. S., Kidd, R. E., Baghsorkhi, S. S., Mahesri, A. A., Tsao, S. C., Navarro, N., Lumetta, S. S., Frank, M. I. & Patel, S. J., Aug 2 2007, 2007 44th ACM/IEEE Design Automation Conference, DAC'07. p. 754-759 6 p. 4261284. (Proceedings - Design Automation Conference).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Parallel programming
Microprocessor chips
Hardware
Parallel algorithms
Computer programming languages
2006

Improved Superblock optimization in GCC

Kidd, R. & Hwu, W-M. W., 2006, Proceedings of the GCC Developers' Summit 2006. p. 85-96 12 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Flow control
Scheduling
2005

"Flea-flicker" Multipass pipelining: An alternative to the high-power out-of-order offense

Barnes, R. D., Ryoo, S. & Hwu, W-M. W., Dec 1 2005, MICRO-38: Proceedings of the 38th Annual IEEE/ACM International Symposium on Microarchitecture. p. 319-330 12 p. 1540970. (Proceedings of the Annual International Symposium on Microarchitecture, MICRO).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Scheduling
Pipelines
Energy efficiency
Microprocessor chips
Hardware

The future of computer architecture research: An industrial perspective

Hwu, W. M. & Patel, S., Dec 12 2005, Proceedings - 11th International Symposium on High-Performance Computer Architecture, HPCA-11 2005. 1 p. (Proceedings - International Symposium on High-Performance Computer Architecture).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Industrial research
Computer architecture
Industry
Hardware
2003

Beating in-order stalls with "flea-flicker" two-pass pipelining

Barnes, R. D., Patel, S. J., Nystrom, E. M., Navarro, N., Sias, J. W. & Hwu, W-M. W., Jan 1 2003, Proceedings - 36th International Symposium on Microarchitecture, MICRO 2003. IEEE Computer Society, p. 387-398 12 p. 1253243. (Proceedings of the Annual International Symposium on Microarchitecture, MICRO; vol. 2003-January).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Pipelines
Transistors
2002

Code coverage and input variability: Effects on architecture and compiler research

Hunter, H. C. & Hwu, W-M. W., Dec 1 2002, Proceedings of the 2002 International Conference on Compilers, Architecture, and Synthesis for Embedded Systems, CASES '02. p. 79-87 9 p. (Proceedings of the 2002 International Conference on Compilers, Architecture, and Synthesis for Embedded Systems, CASES '02).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Telecommunication
Benchmarking
Experiments
Compliance

Vacuum packing: Extracting hardware-detected program phases for post-link optimization

Barnes, R. D., Nystrom, E. M., Merten, M. C. & Hwu, W-M. W., Jan 1 2002, Proceedings - 35th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 2002. IEEE Computer Society, p. 233-244 12 p. 1176253. (Proceedings of the Annual International Symposium on Microarchitecture, MICRO; vol. 2002-January).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Vacuum
Hardware
Phase transitions
2000

An empirical study of function pointers using SPEC benchmarks

Cheng, B. C. & Hwu, W-M. W., Jan 1 2000, Languages and Compilers for Parallel Computing - 12th International Workshop, LCPC 1999, Proceedings. Carter, L. & Ferrante, J. (eds.). Springer-Verlag, p. 490-493 4 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 1863).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Empirical Study
Benchmark
Extractor
Graph in graph theory
Compiler
1999

An architecture framework for introducing predicated execution into embedded microprocessors

Connors, D. A., Puiatti, J. M., August, D. I., Crozier, K. M. & Hwu, W. M. W., Dec 1 1999, Euro-Par 1999 - Parallel Processing: 5th International Conference, Proceedings. p. 1301-1311 11 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 1685 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Instruction Level Parallelism
Microprocessor
Microprocessor chips
Branch
High Performance
1998

A study of code reuse and sharing characteristics of Java applications

Conte, M. T., Trick, A. R., Gyllenhaal, J. C. & Hwu, W-M. W., Jan 1 1998, Workload Characterization: Methodology and Case Studies - Based on the 1st Workshop on Workload Characterization. Maynard, A. M. G. & John, L. K. (eds.). Institute of Electrical and Electronics Engineers Inc., p. 27-35 9 p. 809356. (Workload Characterization: Methodology and Case Studies - Based on the 1st Workshop on Workload Characterization; vol. 1998-November).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Internet
Web crawler

Improving static branch prediction in a compiler

Deitrich, B. L., Chen, B. C. & Hwu, W-M. W., Jan 1 1998, Proceedings - 1998 International Conference on Parallel Architectures and Compilation Techniques, PACT 1998. Institute of Electrical and Electronics Engineers Inc., p. 214-221 8 p. (Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Branch Prediction
Compiler
Instruction Level Parallelism
Branch
Heuristics
1997

Architectural support for compiler-synthesized dynamic branch prediction strategies: Rationale and initial results

August, D. I., Connors, D. A., Gyllenhaal, J. C. & Hwu, W-M. W., 1997, IEEE High-Performance Computer Architecture Symposium Proceedings. Anon (ed.). IEEE, p. 84-93 10 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Hardware
Statistics
Experiments

Framework for balancing control flow and predication

August, D. I., Hwu, W-M. W. & Mahlke, S. A., Dec 1 1997, Proceedings of the Annual International Symposium on Microarchitecture. p. 92-103 12 p. (Proceedings of the Annual International Symposium on Microarchitecture).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Flow control
Scheduling

Study of the cache and branch performance issues with running Java on current hardware platforms

Hsieh, C. H. A., Conte, M. T., Johnson, T. L., Gyllenhaal, J. C. & Hwu, W-M. W., 1997, Digest of Papers - COMPCON - IEEE Computer Society International Conference. Anon (ed.). IEEE, p. 211-216 6 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Caffeine
Hardware
1995

Application of compiler-assisted multiple-instruction retry to VLIW architectures

Chen, S. K., Fuchs, W. K. & Hwu, W-M. W., 1995, Proceedings of the Conference on Fault-Tolerant Parallel and Distributed Systems. IEEE, p. 51-58 8 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Very long instruction word architecture
Hazards
Hardware

A study of the effects of compiler-controlled speculation on instruction and data caches

Bringmann, R. A., Mahlke, S. A. & Hwu, W-M. W., Jan 1 1995, Proceedings of the 28th Annual Hawaii International Conference on System Sciences, HICSS 1995. IEEE Computer Society, p. 211-220 10 p. 375392. (Proceedings of the Annual Hawaii International Conference on System Sciences; vol. 1).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Comparison of full and partial predicated execution support for ILP processors

Mahlke, S. A., Hank, R. E., McCormick, J. E., August, D. I. & Hwu, W. M. W., Jan 1 1995, Conference Proceedings - Annual International Symposium on Computer Architecture, ISCA. p. 138-149 12 p. (Conference Proceedings - Annual International Symposium on Computer Architecture, ISCA).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Inductive logic programming (ILP)
Code generation

Comparison of full and partial predicated execution support for ILP processors

Mahlke, S. A., Hank, R. E., McCormick, J. E., August, D. I. & Hwu, W-M. W., 1995, ACM SIGARCH (Association for Computing Nachinery Special Interest Group on Computer Architecture) - Conference Proceedings. ACM, p. 138-149 12 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Inductive logic programming (ILP)
Code generation
1994

An analytical approach to scheduling code for superscalar and VLIW architectures

Chen, S. K., Fuchs, W. & Hwu, W-M. W., Jan 1 1994, Proceedings of the 1994 International Conference on Parallel Processing, ICPP 1994. Institute of Electrical and Electronics Engineers Inc., 4115732. (Proceedings of the International Conference on Parallel Processing; vol. 1).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Very long instruction word architecture
Superscalar
Scheduling
Speedup
Subroutines

Characterizing the impact of predicated execution on branch prediction

Mahlke, S. A., Hank, R. E., Bringmann, R. A., Gyllenhaal, J. C., Gallagher, D. M. & Hwu, W-M. W., Nov 30 1994, Proceedings of the 27th Annual International Symposium on Microarchitecture, MICRO 1994. IEEE Computer Society, p. 217-227 11 p. (Proceedings of the Annual International Symposium on Microarchitecture, MICRO; vol. Part F129425).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Data relocation and prefetching for programs with large data sets

Yamada, Y., Gyllenhall, J., Haab, G. & Hwu, W-M. W., Nov 30 1994, Proceedings of the 27th Annual International Symposium on Microarchitecture, MICRO 1994. IEEE Computer Society, p. 118-127 10 p. (Proceedings of the Annual International Symposium on Microarchitecture, MICRO; vol. Part F129425).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Copying
Relocation
Hardware
Data storage equipment

Dynamic memory disambiguation using the memory conflict buffer

Gallagher, D. M., Chen, W. Y., Mahlke, S. A., Gyllenhaal, J. C. & Hwu, W-M. W., Nov 1 1994, Proceedings of the 6th International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS 1994. Association for Computing Machinery, p. 183-193 11 p. (International Conference on Architectural Support for Programming Languages and Operating Systems - ASPLOS; vol. Part F129531).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Data storage equipment
Scheduling
Computer hardware
Repair

Speculative execution exception recovery using write-back suppression

Bringmann, R. A., Mahlke, S. A., Hank, R. E., Gyllenhaal, J. C. & Hwu, W-M. W., 1994, Proceedings of the Annual International Symposium on Microarchitecture. Anon (ed.). Publ by IEEE, p. 214-223 10 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Hardware
Recovery
Experiments

Superblock formation using static program analysis

Hank, R. E., Mahlke, S. A., Bringmann, R. A., Gyllenhaal, J. C. & Hwu, W-M. W., Jan 1 1994, Proceedings of the Annual International Symposium on Microarchitecture. Anon (ed.). Publ by IEEE, p. 247-255 9 p. (Proceedings of the Annual International Symposium on Microarchitecture).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Scheduling

The application of compiler-assisted multiple-instruction retry to VLIW architectures

Chen, S. K., Fuchs, W. K. & Hwu, W-M. W., Jan 1 1994, Proceedings of IEEE Workshop on Fault-Tolerant Parallel and Distributed Systems, FTPDS 1994. Pradhan, D. & Avresky, D. (eds.). Institute of Electrical and Electronics Engineers Inc., p. 51-58 8 p. 494474. (Proceedings of IEEE Workshop on Fault-Tolerant Parallel and Distributed Systems, FTPDS 1994).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Very long instruction word architecture
Hazards
Hardware
1993

Reverse if-conversion

Warter, N. J., Mahlke, S. A., Hwu, W. M. W. & Rau, B. R., Dec 1 1993, Proc ACM SIGPLAN 93 Conf Program Lang Des Implementation. Anon (ed.). Publ by ACM, p. 290-299 10 p. (Proc ACM SIGPLAN 93 Conf Program Lang Des Implementation).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Flow graphs
Scheduling

The benefit of predicated execution for software pipelining

Warter, N. J., Lavery, D. R. & Hwu, W-M. W., Jan 1 1993, Proceedings of the 26th Hawaii International Conference on System Sciences, HICSS 1993. IEEE Computer Society, p. 497-506 10 p. 1198122. (Proceedings of the Annual Hawaii International Conference on System Sciences; vol. 1).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Scheduling algorithms
Microprocessor chips
Hardware
Costs
Experiments

Using profile information to assist advanced compiler optimization and scheduling

Chen, W., Bringmann, R., Mahlke, S., Anik, S., Kiyohara, T., Warter, N., Lavery, D., Hwu, W-M. W., Hank, R. & Gyllenhaal, J., Jan 1 1993, Languages and Compilers for Parallel Computing - 5th International Workshop, Proceedings. Padua, D., Nicolau, A., Gelernter, D. & Banerjee, U. (eds.). Springer-Verlag, p. 31-48 18 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 757 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Compiler Optimization
Global optimization
Instruction Level Parallelism
Flow control
Scheduling
1992

Branch recovery with compiler-assisted multiple instruction retry

Alewine, N. J., Chen, S. K., Li, C. C., Fuchs, W. K. & Hwu, W. M., Jan 1 1992, FTCS 1992 - 22nd Annual International Symposium on Fault-Tolerant Computing. Institute of Electrical and Electronics Engineers Inc., p. 66-73 8 p. 243614. (FTCS 1992 - 22nd Annual International Symposium on Fault-Tolerant Computing).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Hardware
Recovery
Hazards

Compiler code transformations for superscalar-based high-performance systems

Mahlke, S. A., Chen, W. Y., Gyuenhaal, J. C., Hwu, W-M. W., Chang, P. P. & Kiyohara, T., Dec 1 1992, Proceedings of the 1992 ACM/IEEE conference on Supercomputing, Supercomputing 1992. Werner, R. (ed.). Association for Computing Machinery, p. 808-817 10 p. (Proceedings of the International Conference on Supercomputing; vol. Part F129723).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Supercomputers

Sentinel scheduling for VLIW and superscalar processors

Mahlke, S. A., Chen, W. Y., Hwu, W-M. W., Rau, B. R. & Schlansker, M. S., Jan 1 1992, International Conference on Architectural Support for Programming Languages and Operating Systems - ASPLOS. 9 ed. Publ by ACM, p. 238-247 10 p. (International Conference on Architectural Support for Programming Languages and Operating Systems - ASPLOS; vol. 27, no. 9).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Scheduling

Systematic prototyping of superscalar computer architectures

Conte, T. M. & Hwu, W-M. W., Jan 1 1992, Proceedings - 3rd International Workshop on Rapid System Prototyping: Shortening the Path from Specification to Prototype, RSP 1992. IEEE Computer Society, p. 161-170 10 p. 243910. (Proceedings of the International Workshop on Rapid System Prototyping; vol. 1992-January).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Computer architecture
Architectural design
Hardware
Data storage equipment

Tolerating data access latency with register preloading

Chen, W. Y., Mahlke, S. A., Hwu, W-M. W., Kiyohara, T. & Chang, P. P., Aug 1 1992, Proceedings of the 6th International Conference on Supercomputing, ICS 1992. Association for Computing Machinery, p. 104-113 10 p. (Proceedings of the International Conference on Supercomputing; vol. Part F129617).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Supercomputers
Hardware
Data storage equipment

Xprof profiling the execution of x window programs

Gupta, A. & Hwu, W-M. W., Jun 1 1992, Proceedings of the 1992 ACM SIGMETRICS Joint International Conference on Measurement and Modeling of Computer Systems, SIGMETRICS/PERFORMANCE 1992. Gaither, B. D. (ed.). Association for Computing Machinery, Inc, p. 253-254 2 p. (Proceedings of the 1992 ACM SIGMETRICS Joint International Conference on Measurement and Modeling of Computer Systems, SIGMETRICS/PERFORMANCE 1992).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1991

Comparing static and dynamic code scheduling for multiple-instruction-issue processors

Chang, P. P., Chen, W. Y., Mahlke, S. A. & Hwu, W-M. W., Sep 1 1991, MICRO 1991 - Proceedings of the 24th Annual International Symposium on Microarchitecture. IEEE Computer Society, p. 25-33 9 p. (Proceedings of the Annual International Symposium on Microarchitecture, MICRO).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Scheduling
Hardware
Experiments

Data access microarchitectures for superscalar processors with compiler-assisted data prefetching

Chen, W. Y., Mahlke, S. A., Chang, P. P. & Hwu, W-M. W., Sep 1 1991, MICRO 1991 - Proceedings of the 24th Annual International Symposium on Microarchitecture. IEEE Computer Society, p. 69-73 5 p. (Proceedings of the Annual International Symposium on Microarchitecture, MICRO).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Pollution
Data storage equipment

IMPACT: An architectural framework for multiple-instruction-issue processors

Chang, P. P., Mahlke, S. A., Chen, W. Y., Warter, N. J. & Hwu, W-M. W., May 1 1991, Conference Proceedings - Annual Symposium on Computer Architecture. Publ by IEEE, p. 266-275 10 p. (Conference Proceedings - Annual Symposium on Computer Architecture).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Architectural design
Experiments
Scheduling
Data storage equipment
1990

An experimental single-chip data flow CPU

Uvieghara, G. A., Hwu, W-M. W., Nakagome, Y., Jeong, D. K., Lee, D., Hodges, D. A. & Patt, Y., 1990, 90 Symp VLSI Circuits. Publ by IEEE, p. 119-120 2 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Program processors
Data storage equipment
Interfaces (computer)
Transistors
Throughput

A software based approach to achieving optimal performance for signature control flow checking

Warter, N. J. & Hwu, W. M. W., Dec 1 1990, Digest of Papers - FTCS (Fault-Tolerant Computing Symposium). Publ by IEEE, p. 442-449 8 p. (Digest of Papers - FTCS (Fault-Tolerant Computing Symposium)).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Flow control
Flow graphs
Computer architecture
Hardware

Benchmark characterization for experimental system evaluation

Conte, T. M. & Hwu, W. M. W., Jan 1 1990, Proceedings of the Hawaii International Conference on System Science. Hoevel, L. W., Shriver, B. D., Nunamaker, J. F. J., Sprague, R. H. J. & Milutinovic, V. (eds.). Publ by Western Periodicals Co, p. 6-18 13 p. (Proceedings of the Hawaii International Conference on System Science; vol. 1).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Data storage equipment
Benchmarking
Systems analysis
1989

Control flow optimization for supercomputer scalar processing

Chang, P. P. & Hwu, W-M. W., Jun 1 1989, Proceedings of the 3rd International Conference on Supercomputing, ICS 1989. Association for Computing Machinery, p. 145-153 9 p. (Proceedings of the International Conference on Supercomputing; vol. Part F130180).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Supercomputers
Flow control
Processing
Pipelines
Hardware

Forward semantic: A compiler-assisted instruction fetch method for heavily pipelined processors

Chang, P. P. & Hwu, W-M. W., Aug 1 1989, Proceedings of the Annual International Symposium on Microarchitecture, MICRO. Allan, V. H. (ed.). IEEE Computer Society, p. 188-198 11 p. (Proceedings of the Annual International Symposium on Microarchitecture, MICRO).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Pipelines
Semantics
Costs
UNIX
Computer aided design
1988

EXPLOITING PARALLEL MICROPROCESSOR MICROARCHITECTURES WITH A COMPILER CODE GENERATOR.

Hwu, W. M. W. & Chang, P. P., Jan 1 1988, Unknown Host Publication Title. IEEE, p. 45-53 9 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Microprocessor chips
Subroutines
Experiments
Costs