Musiclm: Generating music from text A Agostinelli, TI Denk, Z Borsos, J Engel, M Verzetti, A Caillon, Q Huang, ... arXiv preprint arXiv:2301.11325, 2023 | 629 | 2023 |
Audiolm: a language modeling approach to audio generation Z Borsos, R Marinier, D Vincent, E Kharitonov, O Pietquin, M Sharifi, ... IEEE/ACM transactions on audio, speech, and language processing 31, 2523-2533, 2023 | 620 | 2023 |
Hotword recognition M Sharifi, JN Foerster US Patent 9,747,926, 2017 | 439 | 2017 |
Hotword detection on multiple devices M Sharifi US Patent 9,318,107, 2016 | 435 | 2016 |
Fr\'echet audio distance: A metric for evaluating music enhancement algorithms K Kilgour, M Zuluaga, D Roblek, M Sharifi arXiv preprint arXiv:1812.08466, 2018 | 428 | 2018 |
Device leadership negotiation among voice interface devices K Mixter, DM Casado, AH Gruenstein, T Tai, CT Hughes, MN Sharifi US Patent 9,812,128, 2017 | 428 | 2017 |
Systems and methods for live media content matching M Sharifi US Patent 9,661,361, 2017 | 357 | 2017 |
Hotword recognition M Sharifi, JN Foerster US Patent 9,928,840, 2018 | 287 | 2018 |
Hotword detection on multiple devices M Sharifi US Patent 10,134,398, 2018 | 281 | 2018 |
Hotword detection on multiple devices M Sharifi US Patent 9,514,752, 2016 | 274 | 2016 |
Promoting voice actions to hotwords M Sharifi US Patent 8,719,039, 2014 | 257 | 2014 |
Answering questions using environmental context M Sharifi, G Postelnicu US Patent App. 13/626,439, 2014 | 223 | 2014 |
Recorded media hotword trigger suppression AH Gruenstein, J Schalkwyk, M Sharifi US Patent 10,867,600, 2020 | 205 | 2020 |
Audiopalm: A large language model that can speak and listen PK Rubenstein, C Asawaroengchai, DD Nguyen, A Bapna, Z Borsos, ... arXiv preprint arXiv:2306.12925, 2023 | 200 | 2023 |
Speak, read and prompt: High-fidelity text-to-speech with minimal supervision E Kharitonov, D Vincent, Z Borsos, R Marinier, S Girgin, O Pietquin, ... Transactions of the Association for Computational Linguistics 11, 1703-1718, 2023 | 190 | 2023 |
Efficient utterance-specific endpointer triggering for always-on hotwording M Sharifi, D Roblek, S Siddhartha US Patent 8,775,191, 2014 | 173 | 2014 |
Promoting voice actions to hotwords M Sharifi US Patent 9,542,942, 2017 | 141 | 2017 |
Speaker identification using a text-independent model and a text-dependent model M Sharifi, D Roblek US Patent 10,255,922, 2019 | 136 | 2019 |
Providing pre-computed hotword models M Sharifi US Patent 9,263,042, 2016 | 131 | 2016 |
Promoting voice actions to hotwords M Sharifi US Patent 9,263,035, 2016 | 130 | 2016 |