- Academic Search

Usage of Unmanned Aerial Vehicles (UAVs) is growing rapidly in a wide range of consumer
applications, as they prove to be both autonomous and flexible in a variety of environments …

Zapisz Cytuj Cytowane przez 221 Powiązane artykuły Wszystkie wersje 15 Kopia

[Free GPT-4]
[DeepSeek]

[PDF] springer.com

Automated audio captioning: An overview of recent progress and new challenges

X Mei, X Liu, MD Plumbley, W Wang - … journal on audio, speech, and music …, 2022 - Springer

Automated audio captioning is a cross-modal translation task that aims to generate natural
language descriptions for given audio clips. This task has received increasing attention with …

Zapisz Cytuj Cytowane przez 60 Powiązane artykuły Wszystkie wersje 11

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Wavcaps: A chatgpt-assisted weakly-labelled audio captioning dataset for audio-language multimodal research

X Mei, C Meng, H Liu, Q Kong, T Ko… - … on Audio, Speech …, 2024 - ieeexplore.ieee.org

The advancement of audio-language (AL) multimodal learning tasks has been significant in
recent years, yet the limited size of existing audio-language datasets poses challenges for …

Zapisz Cytuj Cytowane przez 155 Powiązane artykuły Wszystkie wersje 3

[Free GPT-4]
[DeepSeek]

[PDF] hal.science

DCASE 2017 challenge setup: Tasks, datasets and baseline system

A Mesaros, T Heittola, A Diment, B Elizalde… - … 2017-workshop on …, 2017 - inria.hal.science

DCASE 2017 Challenge consists of four tasks: acoustic scene classification, detection of
rare sound events, sound event detection in real-life audio, and large-scale weakly …

Zapisz Cytuj Cytowane przez 600 Powiązane artykuły Wszystkie wersje 8 Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] tuni.fi

Detection and classification of acoustic scenes and events: Outcome of the DCASE 2016 challenge

A Mesaros, T Heittola, E Benetos… - … on Audio, Speech …, 2017 - ieeexplore.ieee.org

Public evaluation campaigns and datasets promote active development in target research
areas, allowing direct comparison of algorithms. The second edition of the challenge on …

Zapisz Cytuj Cytowane przez 385 Powiązane artykuły Wszystkie wersje 9

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Large-scale weakly supervised audio classification using gated convolutional neural network

Y Xu, Q Kong, W Wang… - 2018 IEEE international …, 2018 - ieeexplore.ieee.org

In this paper, we present a gated convolutional neural network and a temporal attention-
based localization method for audio classification, which won the 1st place in the large-scale …

Zapisz Cytuj Cytowane przez 265 Powiązane artykuły Wszystkie wersje 9

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Unsupervised learning of semantic audio representations

A Jansen, M Plakal, R Pandya… - … on acoustics, speech …, 2018 - ieeexplore.ieee.org

Even in the absence of any explicit semantic annotation, vast collections of audio recordings
provide valuable information for learning the categorical structure of sounds. We consider …

Zapisz Cytuj Cytowane przez 185 Powiązane artykuły Wszystkie wersje 14

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Audio set classification with attention model: A probabilistic perspective

Q Kong, Y Xu, W Wang… - 2018 IEEE International …, 2018 - ieeexplore.ieee.org

This paper investigates the Audio Set classification. Audio Set is a large scale weakly
labelled dataset (WLD) of audio clips. In WLD only the presence of a label is known, without …

Zapisz Cytuj Cytowane przez 124 Powiązane artykuły Wszystkie wersje 9

Environmental audio scene and sound event recognition for autonomous surveillance: A survey and comparative studies

S Chandrakala, SL Jayalakshmi - ACM Computing Surveys (CSUR), 2019 - dl.acm.org

Monitoring of human and social activities is becoming increasingly pervasive in our living
environment for public security and safety applications. The recognition of suspicious events …

Zapisz Cytuj Cytowane przez 102 Powiązane artykuły Wszystkie wersje 2

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Convolutional gated recurrent neural network incorporating spatial features for audio tagging

Y Xu, Q Kong, Q Huang, W Wang… - … Joint Conference on …, 2017 - ieeexplore.ieee.org

Environmental audio tagging is a newly proposed task to predict the presence or absence of
a specific audio event in a chunk. Deep neural network (DNN) based methods have been …

Zapisz Cytuj Cytowane przez 121 Powiązane artykuły Wszystkie wersje 8

Utwórz alert

Cytuj

Szukanie zaawansowane

Zapisano w Mojej bibliotece

Unsupervised feature learning based on deep models for environmental audio tagging

Deep learning on multi sensor data for counter UAV applications—A systematic review

Automated audio captioning: An overview of recent progress and new challenges

Wavcaps: A chatgpt-assisted weakly-labelled audio captioning dataset for audio-language multimodal research

DCASE 2017 challenge setup: Tasks, datasets and baseline system

Detection and classification of acoustic scenes and events: Outcome of the DCASE 2016 challenge

Large-scale weakly supervised audio classification using gated convolutional neural network

Unsupervised learning of semantic audio representations

Audio set classification with attention model: A probabilistic perspective

Environmental audio scene and sound event recognition for autonomous surveillance: A survey and comparative studies

Convolutional gated recurrent neural network incorporating spatial features for audio tagging