Decomposing a scene into geometric and semantically consistent regions

S Gould, R Fulton, D Koller - 2009 IEEE 12th international …, 2009 - ieeexplore.ieee.org
High-level, or holistic, scene understanding involves reasoning about objects, regions, and
the 3D relationships between them. This requires a representation above the level of pixels …

The ICSI meeting corpus

A Janin, D Baron, J Edwards, D Ellis… - … , Speech, and Signal …, 2003 - ieeexplore.ieee.org
We have collected a corpus of data from natural meetings that occurred at the International
Computer Science Institute (ICSI) in Berkeley, California over the last three years. The …

Smart meeting systems: A survey of state-of-the-art and open issues

Z Yu, Y Nakamura - ACM Computing Surveys (CSUR), 2010 - dl.acm.org
Smart meeting systems, which record meetings and analyze the generated audio--visual
content for future viewing, have been a topic of great interest in recent years. A successful …

Data quality: The other face of big data

B Saha, D Srivastava - 2014 IEEE 30th international conference …, 2014 - ieeexplore.ieee.org
In our Big Data era, data is being generated, collected and analyzed at an unprecedented
scale, and data-driven decision making is swee** through all aspects of society. Recent …

Apparatus and method performing audio-video sensor fusion for object localization, tracking, and separation

C Choi, HK Lee, SM Yoon, D Kong - US Patent 7,536,029, 2009 - Google Patents
An apparatus for tracking and identifying objects includes an audio likelihood module which
determines corresponding audio likelihoods for each of a plurality of sounds received from …

Automatic analysis of multimodal group actions in meetings

L McCowan, D Gatica-Perez, S Bengio… - IEEE transactions on …, 2005 - ieeexplore.ieee.org
This paper investigates the recognition of group actions in meetings. A framework is
employed in which group actions result from the interactions of the individual participants …

Audio user interaction recognition and application interface

LH Kim, J Shin, E Visser - US Patent 9,746,916, 2017 - Google Patents
Disclosed is an application interface that takes into account the user's gaze direction relative
to who is speaking in an interactive multi-participant environment where audio-based …

Memory dependence prediction using store sets

GZ Chrysos, JS Emer - ACM SIGARCH Computer Architecture News, 1998 - dl.acm.org
For maximum performance, an out-of-order processor must issue load instructions as early
as possible, while avoiding memory-order violations with prior store instructions that write to …

Forming beams with nulls directed at noise sources

WV Oxford - US Patent 7,991,167, 2011 - Google Patents
A communication system (eg, a speakerphone) includes an array of microphones, a
speaker, memory and a processor. The processor may perform a virtual broadside scan on …

Maximum likelihood sound source localization and beamforming for directional microphone arrays in distributed meetings

C Zhang, D Florêncio, DE Ba… - IEEE Transactions on …, 2008 - ieeexplore.ieee.org
In distributed meeting applications, microphone arrays have been widely used to capture
superior speech sound and perform speaker localization through sound source localization …