Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Robotic vision for human-robot interaction and collaboration: A survey and systematic review
Robotic vision, otherwise known as computer vision for robots, is a critical process for robots
to collect and interpret detailed information related to human actions, goals, and …
to collect and interpret detailed information related to human actions, goals, and …
A dataset of dynamic reverberant sound scenes with directional interferers for sound event localization and detection
This report presents the dataset and baseline of Task 3 of the DCASE2021 Challenge on
Sound Event Localization and Detection (SELD). The dataset is based on emulation of real …
Sound Event Localization and Detection (SELD). The dataset is based on emulation of real …
Audiovisual fusion: Challenges and new approaches
In this paper, we review recent results on audiovisual (AV) fusion. We also discuss some of
the challenges and report on approaches to address them. One important issue in AV fusion …
the challenges and report on approaches to address them. One important issue in AV fusion …
Unicon: Unified context network for robust active speaker detection
We propose a new efficient framework, the Unified Context Network (UniCon), for robust
active speaker detection (ASD). Traditional methods for ASD usually operate on each …
active speaker detection (ASD). Traditional methods for ASD usually operate on each …
Co-localization of audio sources in images using binaural features and locally-linear regression
This paper addresses the problem of localizing audio sources using binaural
measurements. We propose a supervised formulation that simultaneously localizes multiple …
measurements. We propose a supervised formulation that simultaneously localizes multiple …
ChildBot: Multi-robot perception and interaction with children
In this paper, we present an integrated robotic system capable of participating in and
performing a wide range of educational and entertainment tasks collaborating with one or …
performing a wide range of educational and entertainment tasks collaborating with one or …
Who's speaking? Audio-supervised classification of active speakers in video
Active speakers have traditionally been identified in video by detecting their moving lips.
This paper demonstrates the same using spatio-temporal features that aim to capture other …
This paper demonstrates the same using spatio-temporal features that aim to capture other …
Mixture of inference networks for VAE-based audio-visual speech enhancement
We address unsupervised audio-visual speech enhancement based on variational
autoencoders (VAEs), where the prior distribution of clean speech spectrogram is simulated …
autoencoders (VAEs), where the prior distribution of clean speech spectrogram is simulated …
[HTML][HTML] Prediction of who will be next speaker and when using mouth-opening pattern in multi-party conversation
We investigated the mouth-opening transition pattern (MOTP), which represents the change
of mouth-opening degree during the end of an utterance, and used it to predict the next …
of mouth-opening degree during the end of an utterance, and used it to predict the next …
Ava (a social robot): Design and performance of a robotic hearing apparatus
Socially cognitive robots are supposed to communicate and interact with humans and other
robots in the most natural way. Listeners turn their heads to-ward speakers to enhance …
robots in the most natural way. Listeners turn their heads to-ward speakers to enhance …