Theo dõi
Sourish Chaudhuri
Sourish Chaudhuri
Google Inc, Carnegie Mellon University
Email được xác minh tại google.com
Tiêu đề
Trích dẫn bởi
Trích dẫn bởi
Năm
CNN architectures for large-scale audio classification
S Hershey, S Chaudhuri, DPW Ellis, JF Gemmeke, A Jansen, RC Moore, ...
2017 ieee international conference on acoustics, speech and signal …, 2017
32732017
Ava active speaker: An audio-visual dataset for active speaker detection
J Roth, S Chaudhuri, O Klejch, R Marvin, A Gallagher, L Kaver, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
2002020
Non-negative matrix factorization based compensation of music for automatic speech recognition.
B Raj, T Virtanen, S Chaudhuri, R Singh
Interspeech, 717-720, 2010
1582010
Associating faces with voices for speaker diarization within videos
S Chaudhuri, K Hoover
US Patent 10,497,382, 2019
1032019
Unsupervised Learning of Acoustic Unit Descriptors for Audio Content Representation and Classification.
S Chaudhuri, M Harvilla, B Raj
Interspeech, 2265-2268, 2011
762011
Audio event detection from acoustic unit occurrence patterns
A Kumar, P Dighe, R Singh, S Chaudhuri, B Raj
2012 IEEE international conference on acoustics, speech and signal …, 2012
742012
Engaging collaborative learners with helping agents
S Chaudhuri, R Kumar, I Howley, CP Rosé
Artificial intelligence in education, 365-372, 2009
562009
Ava-speech: A densely labeled dataset of speech activity in movies
S Chaudhuri, J Roth, DPW Ellis, A Gallagher, L Kaver, R Marvin, ...
arXiv preprint arXiv:1808.00606, 2018
522018
Using audio-visual information to understand speaker activity: Tracking active speakers on and off screen
K Hoover, S Chaudhuri, C Pantofaru, I Sturdy, M Slaney
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
50*2018
Unsupervised structure discovery for semantic analysis of audio
S Chaudhuri, B Raj
Advances in Neural Information Processing Systems 25, 2012
342012
It’s not easy being green: Supporting collaborative “green design” learning
S Chaudhuri, R Kumar, M Joshi, E Terrell, F Higgs, V Aleven, ...
Intelligent Tutoring Systems: 9th International Conference, ITS 2008 …, 2008
322008
An HMM based part-of-speech tagger and statistical chunker for 3 Indian languages
GMR Sastry, S Chaudhuri, PN Reddy
Shallow Parsing for South Asian Languages 13, 2007
262007
Unsupervised hierarchical structure induction for deeper semantic analysis of audio
S Chaudhuri, B Raj
2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013
242013
Unsupervised word discovery from phonetic input using nested pitman-yor language modeling
O Walter, R Haeb-Umbach, S Chaudhuri, B Raj
ICRA Workshop on Autonomous Learning, 2013
222013
Automatic smoothed captioning of non-speech sounds from audio
F Wang, S Chaudhuri, D Ellis, N Reale
US Patent 10,037,313, 2018
192018
Exploiting Temporal Sequence Structure for Semantic Analysis of Multimedia.
S Chaudhuri, R Singh, B Raj
INTERSPEECH, 1728-1731, 2012
192012
Speaking classification using audio-visual data
S Chaudhuri, O Klejch, JE Roth
US Patent 10,846,522, 2020
182020
Structured Models for Semantic Analysis of Audio Content
S Chaudhuri
PhD thesis, Carnegie Mellon University. 46, 47, 2013
18*2013
Gating model for video analysis
S Ramaswamy, S Chaudhuri, J Roth
US Patent 10,984,246, 2021
172021
Learning contextual relevance of audio segments using discriminative models over AUD sequences
S Chaudhuri, B Raj
2011 IEEE Workshop on Applications of Signal Processing to Audio and …, 2011
162011
Hệ thống không thể thực hiện thao tác ngay bây giờ. Hãy thử lại sau.
Bài viết 1–20