Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ... arXiv preprint arXiv:2403.05530, 2024 | 1151 | 2024 |
Audiopalm: A large language model that can speak and listen PK Rubenstein, C Asawaroengchai, DD Nguyen, A Bapna, Z Borsos, ... arXiv preprint arXiv:2306.12925, 2023 | 196 | 2023 |
Spoken question answering and speech continuation using spectrogram-powered llm E Nachmani, A Levkovitch, R Hirsch, J Salazar, C Asawaroengchai, ... arXiv preprint arXiv:2305.15255, 2023 | 30 | 2023 |
Translatotron 3: Speech to speech translation with monolingual data E Nachmani, A Levkovitch, Y Ding, C Asawaroengchai, H Zen, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 12 | 2024 |
Probabilistic learning models for topic extraction i Thai language C Asawaroengchai, W Chaisangmongkon, D Laowattana 2018 5th International Conference on Business and Industrial Research (ICBIR …, 2018 | 7 | 2018 |
Ramanovich PK Rubenstein, C Asawaroengchai, DD Nguyen, A Bapna, Z Borsos, ... Marco Tagliasacchi, Alexandru Tudor, Mihajlo Velimirovic, Damien Vincent …, 2023 | 5 | 2023 |
Artificial intelligence for generating depth map C Asawaroengchai, S Phanvilai, P Leelaphattarakij, V Trairattanapa, ... US Patent App. 16/698,731, 2021 | 4 | 2021 |
Generating 360 degree interactive content V Trairattanapa, P Leelaphattarakij, S Phanvilai, J Sukkasem, ... US Patent App. 16/714,354, 2021 | 3 | 2021 |
STAB: speech tokenizer assessment benchmark S Vashishth, H Singh, S Bharadwaj, S Ganapathy, C Asawaroengchai, ... arXiv preprint arXiv:2409.02384, 2024 | 2 | 2024 |
Performing tasks using generative neural networks PK Rubenstein, M Sharifi, A Tudor, C Asawaroengchai, DD Nguyen, ... US Patent App. 18/750,973, 2024 | | 2024 |
LANGUAGE MODELS USING SPOKEN LANGUAGE MODELING MD Tadmor, E Nachmani, A Levkovitch, J Salazar, C Asawaroengchai, ... US Patent App. 18/662,442, 2024 | | 2024 |
Speech-to-speech translation with monolingual data MT Ramanovich, E Nachmani, A Levkovitch, B Chun, D Yifan, N Bar, ... US Patent App. 18/589,358, 2024 | | 2024 |