Conformer: Convolution-augmented transformer for speech recognition A Gulati, J Qin, CC Chiu, N Parmar, Y Zhang, J Yu, W Han, S Wang, ... arXiv preprint arXiv:2005.08100, 2020 | 3590 | 2020 |
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ... arXiv preprint arXiv:2312.11805, 2023 | 3196 | 2023 |
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ... arXiv preprint arXiv:2403.05530, 2024 | 1165 | 2024 |
Bigssl: Exploring the frontier of large-scale semi-supervised learning for automatic speech recognition Y Zhang, DS Park, W Han, J Qin, A Gulati, J Shor, A Jansen, Y Xu, ... IEEE Journal of Selected Topics in Signal Processing 16 (6), 1519-1532, 2022 | 205 | 2022 |
An attention-based spatiotemporal lstm network for next poi recommendation L Huang, Y Ma, S Wang, Y Liu IEEE Transactions on Services Computing 14 (6), 1585-1597, 2019 | 191 | 2019 |
BFloat16: The secret to high performance on Cloud TPUs S Wang, P Kanwar Google Cloud Blog 4 (1), 2019 | 164 | 2019 |
Making memristive neural network accelerators reliable B Feinberg, S Wang, E Ipek 2018 IEEE International Symposium on High Performance Computer Architecture …, 2018 | 139 | 2018 |
GSPMD: general and scalable parallelization for ML computation graphs Y Xu, HJ Lee, D Chen, B Hechtman, Y Huang, R Joshi, M Krikun, ... arXiv preprint arXiv:2105.04663, 2021 | 130 | 2021 |
Enabling scientific computing on memristive accelerators B Feinberg, UKR Vengalam, N Whitehair, S Wang, E Ipek 2018 ACM/IEEE 45th Annual International Symposium on Computer Architecture …, 2018 | 112 | 2018 |
Conformer: Convolutionaugmented transformer for speech recognition. arXiv 2020 A Gulati, J Qin, CC Chiu, N Parmar, Y Zhang, J Yu, W Han, S Wang, ... arXiv preprint arXiv:2005.08100, 2020 | 92 | 2020 |
Overlap communication with dependent computation via decomposition in large deep learning models S Wang, J Wei, A Sabne, A Davis, B Ilbeyi, B Hechtman, D Chen, ... Proceedings of the 28th ACM International Conference on Architectural …, 2022 | 64 | 2022 |
Gpuguard: Mitigating contention based side and covert channel attacks on gpus Q Xu, H Naghibijouybari, S Wang, N Abu-Ghazaleh, M Annavaram Proceedings of the ACM International Conference on Supercomputing, 497-509, 2019 | 44 | 2019 |
Scale mlperf-0.6 models on google tpu-v3 pods S Kumar, V Bitorff, D Chen, C Chou, B Hechtman, HJ Lee, N Kumar, ... arXiv preprint arXiv:1909.09756, 2019 | 42 | 2019 |
Reducing data movement energy via online data clustering and encoding S Wang, E Ipek International Symposium on Microarchitecture (MICRO), 2016 | 37 | 2016 |
Automatic cross-replica sharding of weight update in data-parallel training Y Xu, HJ Lee, D Chen, H Choi, B Hechtman, S Wang arXiv preprint arXiv:2004.13336, 2020 | 34 | 2020 |
Exploring the limits of Concurrency in ML Training on Google TPUs S Kumar, Y Wang, C Young, J Bradbury, N Kumar, D Chen, A Swing Proceedings of Machine Learning and Systems 3, 81-92, 2021 | 22 | 2021 |
Effect of design and operating parameters on dynamic response of a micro direct methanol fuel cell Y Zhang, H He, Z Yuan, S Wang, X Liu International journal of hydrogen energy 36 (3), 2230-2236, 2011 | 20 | 2011 |
Development and characterization of a novel air-breathing micro direct methanol fuel cell stack for portable applications X Liu, B Zhang, Y Zhang, H He, J Li, S Wang, Z Yuan, H Deng Journal of Micromechanics and Microengineering 20, 2010 | 20 | 2010 |
Content aware refresh: Exploiting the asymmetry of DRAM retention errors to reduce the refresh frequency of less vulnerable data S Wang, MN Bojnordi, X Guo, E Ipek IEEE Transactions on Computers 68 (3), 362-374, 2018 | 19 | 2018 |
Learning to fuse A Abdolrashidi, Q Xu, S Wang, S Roy, Y Zhou NeurIPS ML for Systems Workshop, 2019 | 10 | 2019 |