Filter
Conference contribution

Search results

  • 2024

    Retargeting and Respecializing GPU Workloads for Performance Portability

    Ivanov, I. R., Zinenko, O., Domke, J., Endo, T. & Moses, W. S., 2024, CGO 2024 - Proceedings of the 2024 IEEE/ACM International Symposium on Code Generation and Optimization. Grosser, T., Dubach, C., Steuwer, M., Xue, J., Ottoni, G. & Pereira, F. M. Q. (eds.). Institute of Electrical and Electronics Engineers Inc., p. 119-132 14 p. (CGO 2024 - Proceedings of the 2024 IEEE/ACM International Symposium on Code Generation and Optimization).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2023

    High-Performance GPU-to-CPU Transpilation and Optimization via High-Level Parallel Constructs

    Moses, W. S., Ivanov, I. R., Domke, J., Endo, T., Doerfert, J. & Zinenko, O., Feb 25 2023, PPoPP 2023 - Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming. Association for Computing Machinery, p. 119-134 16 p. (Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • Transparent Checkpointing for Automatic Differentiation of Program Loops Through Expression Transformations

    Schanen, M., Narayanan, S. H. K., Williamson, S., Churavy, V., Moses, W. S. & Paehler, L., Jun 26 2023, Computational Science - ICCS 2023: 23rd International Conference, Prague, Czech Republic, July 3–5, 2023, Proceedings, Part III. Mikyška, J., de Mulatier, C., Paszynski, M., Krzhizhanovskaya, V. V., Dongarra, J. J. & Sloot, P. M. A. (eds.). Springer, p. 483-497 15 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2022

    Enabling Transformers to Understand Low-Level Programs

    Guo, Z. C. & Moses, W. S., 2022, 2022 IEEE High Performance Extreme Computing Conference, HPEC 2022. Institute of Electrical and Electronics Engineers Inc., (2022 IEEE High Performance Extreme Computing Conference, HPEC 2022).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Scalable Automatic Differentiation of Multiple Parallel Paradigms through Compiler Augmentation

    Moses, W. S., Narayanan, S. H. K., Paehler, L., Churavy, V., Schanen, M., Huckelheim, J., Doerfert, J. & Hovland, P., 2022, Proceedings of SC 2022: International Conference for High Performance Computing, Networking, Storage and Analysis. IEEE Computer Society, (International Conference for High Performance Computing, Networking, Storage and Analysis, SC; vol. 2022-November).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2021

    Polygeist: Raising C to Polyhedral MLIR

    Moses, W. S., Chelini, L., Zhao, R. & Zinenko, O., 2021, Proceedings - 30th International Conference on Parallel Architectures and Compilation Techniques, PACT 2021. Lee, J. & Cohen, A. (eds.). Institute of Electrical and Electronics Engineers Inc., p. 45-59 15 p. (Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT; vol. 2021-September).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Reverse-mode automatic differentiation and optimization of GPU Kernels via enzyme

    Moses, W. S., Churavy, V., Paehler, L., Höckelheim, J., Narayanan, S. H. K., Schanen, M. & Doerfert, J., Nov 14 2021, Proceedings of SC 2021: The International Conference for High Performance Computing, Networking, Storage and Analysis: Science and Beyond. IEEE Computer Society, (International Conference for High Performance Computing, Networking, Storage and Analysis, SC).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • 2020

    AutoPhase: Juggling HLS Phase Orderings in Random Forests with Deep Reinforcement Learning

    Haj-Ali, A., Huang, Q., Moses, W., Xiang, J., Asanovic, K., Wawrzynek, J. & Stoica, I., 2020, Proceedings of Machine Learning and Systems. Dhillon, I., Papailiopoulos, D. & Sze, V. (eds.). Vol. 2. p. 70-81 12 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2019

    AutoPhase: Compiler Phase-Ordering for HLS with Deep Reinforcement Learning

    Huang, Q., Haj-Ali, A., Moses, W., Xiang, J., Stoica, I., Asanovic, K. & Wawrzynek, J., Apr 2019, Proceedings - 27th IEEE International Symposium on Field-Programmable Custom Computing Machines, FCCM 2019. Institute of Electrical and Electronics Engineers Inc., p. 308 1 p. 8735549. (Proceedings - 27th IEEE International Symposium on Field-Programmable Custom Computing Machines, FCCM 2019).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • LiTM: A lightweight deterministic software transactional memory system

    Xia, Y., Yu, X., Moses, W., Shun, J. & Devadas, S., Feb 17 2019, Proceedings of the 10th International Workshop on Programming Models and Applications for Multicores and Manycores, PMAM 2019. Chen, Q., Huang, Z. & Si, M. (eds.). Association for Computing Machinery, p. 1-10 10 p. (Proceedings of the 10th International Workshop on Programming Models and Applications for Multicores and Manycores, PMAM 2019).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
  • 2017

    OpenMPIR: Implementing OpenMP Tasks with Tapir

    Stelle, G., Moses, W. S., Olivier, S. L. & McCormick, P., Nov 12 2017, Proceedings of LLVM-HPC 2017: 4th Workshop on the LLVM Compiler Infrastructure in HPC - Held in conjunction with SC 2017: The International Conference for High Performance Computing, Networking, Storage and Analysis. Association for Computing Machinery, 3148186. (Proceedings of LLVM-HPC 2017: 4th Workshop on the LLVM Compiler Infrastructure in HPC - Held in conjunction with SC 2017: The International Conference for High Performance Computing, Networking, Storage and Analysis).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Tapir: Embedding Fork-Join Parallelism into LLVM's Intermediate Representation

    Schardl, T. B., Moses, W. S. & Leiserson, C. E., Jan 26 2017, PPoPP 2017 - Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. Association for Computing Machinery, p. 249-265 17 p. (ACM SIGPLAN Notices; vol. 52, no. 8).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access