Decomposing a scene into geometric and semantically consistent regions
High-level, or holistic, scene understanding involves reasoning about objects, regions, and
the 3D relationships between them. This requires a representation above the level of pixels …
the 3D relationships between them. This requires a representation above the level of pixels …
The ICSI meeting corpus
We have collected a corpus of data from natural meetings that occurred at the International
Computer Science Institute (ICSI) in Berkeley, California over the last three years. The …
Computer Science Institute (ICSI) in Berkeley, California over the last three years. The …
Smart meeting systems: A survey of state-of-the-art and open issues
Z Yu, Y Nakamura - ACM Computing Surveys (CSUR), 2010 - dl.acm.org
Smart meeting systems, which record meetings and analyze the generated audio--visual
content for future viewing, have been a topic of great interest in recent years. A successful …
content for future viewing, have been a topic of great interest in recent years. A successful …
Data quality: The other face of big data
In our Big Data era, data is being generated, collected and analyzed at an unprecedented
scale, and data-driven decision making is swee** through all aspects of society. Recent …
scale, and data-driven decision making is swee** through all aspects of society. Recent …
Apparatus and method performing audio-video sensor fusion for object localization, tracking, and separation
An apparatus for tracking and identifying objects includes an audio likelihood module which
determines corresponding audio likelihoods for each of a plurality of sounds received from …
determines corresponding audio likelihoods for each of a plurality of sounds received from …
Automatic analysis of multimodal group actions in meetings
This paper investigates the recognition of group actions in meetings. A framework is
employed in which group actions result from the interactions of the individual participants …
employed in which group actions result from the interactions of the individual participants …
Audio user interaction recognition and application interface
Disclosed is an application interface that takes into account the user's gaze direction relative
to who is speaking in an interactive multi-participant environment where audio-based …
to who is speaking in an interactive multi-participant environment where audio-based …
Memory dependence prediction using store sets
GZ Chrysos, JS Emer - ACM SIGARCH Computer Architecture News, 1998 - dl.acm.org
For maximum performance, an out-of-order processor must issue load instructions as early
as possible, while avoiding memory-order violations with prior store instructions that write to …
as possible, while avoiding memory-order violations with prior store instructions that write to …
Forming beams with nulls directed at noise sources
WV Oxford - US Patent 7,991,167, 2011 - Google Patents
A communication system (eg, a speakerphone) includes an array of microphones, a
speaker, memory and a processor. The processor may perform a virtual broadside scan on …
speaker, memory and a processor. The processor may perform a virtual broadside scan on …
Maximum likelihood sound source localization and beamforming for directional microphone arrays in distributed meetings
In distributed meeting applications, microphone arrays have been widely used to capture
superior speech sound and perform speaker localization through sound source localization …
superior speech sound and perform speaker localization through sound source localization …