Gpipe: Efficient training of giant neural networks using pipeline parallelism Y Huang, Y Cheng, A Bapna, O Firat, D Chen, M Chen, HJ Lee, J Ngiam, ... Advances in neural information processing systems 32, 2019 | 1868 | 2019 |
Gshard: Scaling giant models with conditional computation and automatic sharding D Lepikhin, HJ Lee, Y Xu, D Chen, O Firat, Y Huang, M Krikun, N Shazeer, ... arXiv preprint arXiv:2006.16668, 2020 | 1089 | 2020 |
Mesh-tensorflow: Deep learning for supercomputers N Shazeer, Y Cheng, N Parmar, D Tran, A Vaswani, P Koanantakool, ... Advances in neural information processing systems 31, 2018 | 437 | 2018 |
OptiML: an implicitly parallel domain-specific language for machine learning A Sujeeth, HJ Lee, K Brown, T Rompf, H Chafi, M Wu, A Atreya, ... Proceedings of the 28th International Conference on Machine Learning (ICML …, 2011 | 311 | 2011 |
A heterogeneous parallel framework for domain-specific languages KJ Brown, AK Sujeeth, HJ Lee, T Rompf, H Chafi, M Odersky, K Olukotun 2011 International Conference on Parallel Architectures and Compilation …, 2011 | 281 | 2011 |
Delite: A compiler architecture for performance-oriented embedded domain-specific languages AK Sujeeth, KJ Brown, H Lee, T Rompf, H Chafi, M Odersky, K Olukotun ACM Transactions on Embedded Computing Systems (TECS) 13 (4s), 1-25, 2014 | 274 | 2014 |
A domain-specific approach to heterogeneous parallelism H Chafi, AK Sujeeth, KJ Brown, HJ Lee, AR Atreya, K Olukotun ACM SIGPLAN Notices 46 (8), 35-46, 2011 | 220 | 2011 |
Optimizing data structures in high-level programs: New directions for extensible compilers based on staging T Rompf, AK Sujeeth, N Amin, KJ Brown, V Jovanovic, HJ Lee, ... Proceedings of the 40th annual ACM SIGPLAN-SIGACT symposium on Principles of …, 2013 | 142 | 2013 |
Gspmd: general and scalable parallelization for ml computation graphs Y Xu, HJ Lee, D Chen, B Hechtman, Y Huang, R Joshi, M Krikun, ... arXiv preprint arXiv:2105.04663, 2021 | 131 | 2021 |
Generating configurable hardware from parallel patterns R Prabhakar, D Koeplinger, KJ Brown, HJ Lee, C De Sa, C Kozyrakis, ... Acm Sigplan Notices 51 (4), 651-665, 2016 | 107 | 2016 |
Implementing domain-specific languages for heterogeneous parallel computing HJ Lee, K Brown, A Sujeeth, H Chafi, T Rompf, M Odersky, K Olukotun Ieee Micro 31 (5), 42-53, 2011 | 97 | 2011 |
Hardware system synthesis from domain-specific languages N George, HJ Lee, D Novo, T Rompf, KJ Brown, AK Sujeeth, M Odersky, ... 2014 24th International Conference on Field Programmable Logic and …, 2014 | 92 | 2014 |
Composition and reuse with compiled domain-specific languages AK Sujeeth, T Rompf, KJ Brown, HJ Lee, H Chafi, V Popic, M Wu, ... ECOOP 2013–Object-Oriented Programming: 27th European Conference …, 2013 | 89 | 2013 |
Building-blocks for performance oriented DSLs T Rompf, AK Sujeeth, HJ Lee, KJ Brown, H Chafi, M Odersky, K Olukotun arXiv preprint arXiv:1109.0778, 2011 | 84 | 2011 |
Have abstraction and eat performance, too: Optimized heterogeneous computing with parallel patterns KJ Brown, HJ Lee, T Rompf, AK Sujeeth, C De Sa, C Aberger, K Olukotun Proceedings of the 2016 International Symposium on Code Generation and …, 2016 | 65 | 2016 |
Locality-aware mapping of nested parallel patterns on gpus HJ Lee, KJ Brown, AK Sujeeth, T Rompf, K Olukotun 2014 47th Annual IEEE/ACM International Symposium on Microarchitecture, 63-74, 2014 | 60 | 2014 |
Surgical precision JIT compilers T Rompf, AK Sujeeth, KJ Brown, HJ Lee, H Chafi, K Olukotun Proceedings of the 35th ACM SIGPLAN conference on programming language …, 2014 | 57 | 2014 |
Forge: generating a high performance DSL implementation from a declarative specification AK Sujeeth, A Gibbons, KJ Brown, HJ Lee, T Rompf, M Odersky, ... Acm Sigplan Notices 49 (3), 145-154, 2013 | 53 | 2013 |
Scale mlperf-0.6 models on google tpu-v3 pods S Kumar, V Bitorff, D Chen, C Chou, B Hechtman, HJ Lee, N Kumar, ... arXiv preprint arXiv:1909.09756, 2019 | 41 | 2019 |
Go meta! A case for generative programming and dsls in performance critical systems T Rompf, KJ Brown, HJ Lee, AK Sujeeth, M Jonnalagedda, N Amin, ... 1st Summit on Advances in Programming Languages (SNAPL 2015) 32, 238-261, 2015 | 38 | 2015 |