Temporal sentiment localization: Listen and look in untrimmed videos
Video sentiment analysis aims to uncover the underlying attitudes of viewers, which has a
wide range of applications in real world. Existing works simply classify a video into a single …
wide range of applications in real world. Existing works simply classify a video into a single …
SoundDet: Polyphonic moving sound event detection and localization from raw waveform
We present a new framework SoundDet, which is an end-to-end trainable and light-weight
framework, for polyphonic moving sound event detection and localization. Prior methods …
framework, for polyphonic moving sound event detection and localization. Prior methods …
Visual object detector for cow sound event detection
Sound event detection (SED) is a reasonable choice in a number of application domains
including cattle sheds, dense forests, or any dark environments where visual objects are …
including cattle sheds, dense forests, or any dark environments where visual objects are …
Proposal-based few-shot sound event detection for speech and environmental sounds with perceivers
Many applications involve detecting and localizing specific sound events within long,
untrimmed documents, including keyword spotting, medical observation, and bioacoustic …
untrimmed documents, including keyword spotting, medical observation, and bioacoustic …
Human and Machine Performance in Counting Sound Classes in Single-Channel Soundscapes
Individual sounds are difficult to detect in complex soundscapes because of a strong
overlap. This article explores the task of estimating sound polyphony, which is defined here …
overlap. This article explores the task of estimating sound polyphony, which is defined here …
Internet of things (IoT) discovery using deep neural networks
E Lo, JH Kohl - Proceedings of the IEEE/CVF Winter …, 2020 - openaccess.thecvf.com
We present a novel approach to Internet of Things (IoT) discovery using Deep Neural
Network (DNN) based object detection. Traditional methods of IoT discovery are based on …
Network (DNN) based object detection. Traditional methods of IoT discovery are based on …
[HTML][HTML] A deep learning model for detecting and classifying multiple marine mammal species from passive acoustic data
Underwater passive acoustics is used worldwide for multi-year monitoring of marine
mammals. Yet, the large amount of audio recordings raises the need to automate the …
mammals. Yet, the large amount of audio recordings raises the need to automate the …
Musicyolo: A sight-singing onset/offset detection framework based on object detection instead of spectrum frames
X Wang, W Xu, W Yang… - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org
In this paper, we propose MusicYOLO based on object detection to detect the onset and
offset in singing for the first time. The onset of the vocal is not as stable and clear as that of …
offset in singing for the first time. The onset of the vocal is not as stable and clear as that of …
Audios Don't Lie: Multi-Frequency Channel Attention Mechanism for Audio Deepfake Detection
Y Feng - arxiv preprint arxiv:2412.09467, 2024 - arxiv.org
With the rapid development of artificial intelligence technology, the application of deepfake
technology in the audio field has gradually increased, resulting in a wide range of security …
technology in the audio field has gradually increased, resulting in a wide range of security …
[HTML][HTML] A monophonic cow sound annotation tool using a semi-automatic method on audio/video data
In this paper, we present a semi-automatic tool for labeling monophonic sound events with
specific reference to cow sounds. The proposed system takes as input audio or video data …
specific reference to cow sounds. The proposed system takes as input audio or video data …