Alexatm 20b: Few-shot learning using a large-scale multilingual seq2seq model S Soltan, S Ananthakrishnan, J FitzGerald, R Gupta, W Hamza, H Khan, ... arXiv preprint arXiv:2208.01448, 2022 | 88 | 2022 |
Alexa teacher model: Pretraining and distilling multi-billion-parameter encoders for natural language understanding systems J FitzGerald, S Ananthakrishnan, K Arkoudas, D Bernardi, A Bhagia, ... Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and …, 2022 | 76 | 2022 |
Alexatm 20b: Few-shot learning using a large-scale multilingual seq2seq model, 2022 S Soltan, S Ananthakrishnan, J FitzGerald, R Gupta, W Hamza, H Khan, ... URL https://arxiv. org/abs/2208.01448 94, 0 | 12 | |
Attention fusion: a light yet efficient late fusion mechanism for task adaptation in nlu J Cao, CS Prakash, W Hamza Findings of the Association for Computational Linguistics: NAACL 2022, 857-866, 2022 | 9 | 2022 |
Instilling type knowledge in language models via multi-task QA S Li, M Sridhar, CS Prakash, J Cao, W Hamza, J McAuley arXiv preprint arXiv:2204.13796, 2022 | 9 | 2022 |
AlexaTM 20B: Few-shot learning using a large-scale multilingual seq2seq model. arXiv 2022 S Soltan, S Ananthakrishnan, J FitzGerald, R Gupta, W Hamza, H Khan, ... arXiv preprint arXiv:2208.01448, 2022 | 9 | 2022 |
Alexa teacher model J FitzGerald, S Ananthakrishnan, K Arkoudas, D Bernardi, A Bhagia, ... Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and …, 2022 | 2 | 2022 |
The Amazon Nova family of models: Technical report and model card AAG Intelligence https://www.amazon.science/publications/the-amazon-nova-family-of-models …, 2024 | 1 | 2024 |
MATTER: Memory-augmented transformer using heterogeneous knowledge sources D Lee, CS Prakash, J FitzGerald, J Lehmann arXiv preprint arXiv:2406.04670, 2024 | 1 | 2024 |
Shared encoder for natural language understanding processing JJ Hueser, F Triefenbach, CS Prakash, J Cao, W Hamza, M Momotko US Patent App. 17/690,609, 2023 | | 2023 |
Sharing encoder representations across languages, domains and tasks in large-scale spoken language understanding J Hueser, J Gaspers, T Gueudre, C Prakash, J Cao, D Sorokin, Q Do, ... Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023 | | 2023 |
FARS: FSM-Augmentation to Make LLMs Hallucinate the Right APIs S Rongali, CS Prakash, A Gupta, W Hamza | | |