Multi-lingual evaluation of code generation models B Athiwaratkun, SK Gouda, Z Wang, X Li, Y Tian, M Tan, WU Ahmad, ... arXiv preprint arXiv:2210.14868, 2022 | 150* | 2022 |
Speech recognition: Keyword spotting through image recognition SK Gouda, S Kanetkar, D Harrison, MK Warmuth arXiv preprint arXiv:1803.03759, 2018 | 29* | 2018 |
Combining word embeddings and n-grams for unsupervised document summarization Z Jiang, M Srivastava, S Krishna, D Akodes, R Schwartz arXiv preprint arXiv:2004.14119, 2020 | 8 | 2020 |
The 2019 bbn cross-lingual information retrieval system L Zhang, D Karakos, W Hartmann, M Srivastava, L Tarlin, D Akodes, ... Proceedings of the workshop on Cross-Language Search and Summarization of …, 2020 | 6 | 2020 |
BASS: Batched attention-optimized speculative sampling H Qian, SK Gonugondla, S Ha, M Shang, SK Gouda, R Nallapati, ... arXiv preprint arXiv:2404.15778, 2024 | 3 | 2024 |
Constrained decoding for code language models via efficient left and right quotienting of context-sensitive grammars D Melcer, N Fulton, S Krishna Gouda, H Qian arXiv e-prints, arXiv: 2402.17988, 2024 | 3 | 2024 |
Bifurcated attention: Accelerating massively parallel decoding with shared prefixes in llms B Athiwaratkun, SK Gonugondla, SK Gouda, H Qian, H Ding, Q Sun, ... arXiv preprint arXiv:2403.08845, 2024 | 2 | 2024 |
Training LLMs to better self-debug and explain code N Jiang, X Li, S Wang, Q Zhou, SB Hossain, B Ray, V Kumar, X Ma, ... arXiv preprint arXiv:2405.18649, 2024 | 1 | 2024 |
Token alignment via character matching for subword completion B Athiwaratkun, S Wang, M Shang, Y Tian, Z Wang, SK Gonugondla, ... arXiv preprint arXiv:2403.08688, 2024 | 1 | 2024 |
Constrained Decoding for Fill-in-the-Middle Code Language Models via Efficient Left and Right Quotienting of Context-Sensitive Grammars D Melcer, N Fulton, SK Gouda, H Qian arXiv preprint arXiv:2402.17988, 2024 | 1 | 2024 |
Bifurcated attention for single-context large-batch sampling B Athiwaratkun, S Gonugondla, SK Gouda, H Ding, Q Sun, J Wang, J Guo, ... | 1 | 2024 |
On io-efficient attention mechanisms: Context-aware bifurcated attention and the generalized multi-group attention B Athiwaratkun, SK Gonugondla, SK Gouda, H Qian, H Ding, Q Sun, ... Workshop on Efficient Systems for Foundation Models@ ICML2023, 2023 | 1 | 2023 |
Foreign Language Automated Information Retrieval (FLAIR)/Machine Translation For English Retrieval of Information In Any Language (MATERIAL) J Makhoul, L Zhang, SK Gouda, R Schwartz, W Hartmann, L Tarlin, ... | | 2021 |
SageLite: Harmonizing Text and Code Through Multi-Stage Training D Zhang, S Mayers, J Wang, SK Gouda, N Jain, J Zhang, X Ma, A Deoras | | |