Strong labeling of sound events using crowdsourced weak labels and annotator competence estimation

I Martín-Morató, A Mesaros - IEEE/ACM transactions on audio …, 2023 - ieeexplore.ieee.org
Crowdsourcing is a popular tool for collecting large amounts of annotated data, but the
specific format of the strong labels necessary for sound event detection is not easily …

Characterising soundscape research in human-computer interaction

SS Johansen, N Van Berkel, J Fritsch - Proceedings of the 2022 ACM …, 2022 - dl.acm.org
'Soundscapes' are an increasingly active topic in Human-Computer Interaction (HCI) and
interaction design. From map** acoustic environments through sound recordings to …

Quality aspects of annotated data: A research synthesis

J Beck - AStA Wirtschafts-und Sozialstatistisches Archiv, 2023 - Springer
Abstract The quality of Machine Learning (ML) applications is commonly assessed by
quantifying how well an algorithm fits its respective training data. Yet, a perfect model that …

Training sound event detection with soft labels from crowdsourced annotations

I Martín-Morató, M Harju, P Ahokas… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
In this paper, we study the use of soft labels to train a system for sound event detection
(SED). Soft labels can result from annotations which account for human uncertainty about …

Leveraging hierarchical structures for few-shot musical instrument recognition

HF Garcia, A Aguilar, E Manilow, B Pardo - arxiv preprint arxiv …, 2021 - arxiv.org
Deep learning work on musical instrument recognition has generally focused on instrument
classes for which we have abundant data. In this work, we exploit hierarchical relationships …

SONYC urban sound tagging (SONYC-UST): A multilabel dataset from an urban acoustic sensor network

M Cartwright, AEM Mendez, A Cramer, V Lostanlen… - 2019 - archive.nyu.edu
SONYC Urban Sound Tagging (SONYC-UST) is a dataset for the development and
evaluation of machine listening systems for real-world urban noise monitoring. It consists of …

What is the ground truth? reliability of multi-annotator data for audio tagging

I Martín-Morató, A Mesaros - 2021 29th European Signal …, 2021 - ieeexplore.ieee.org
Crowdsourcing has become a common approach for annotating large amounts of data. It
has the advantage of harnessing a large workforce to produce large amounts of data in a …

[HTML][HTML] The life of a New York City noise sensor network

C Mydlarz, M Sharma, Y Lockerman, B Steers, C Silva… - Sensors, 2019 - mdpi.com
Noise pollution is one of the topmost quality of life issues for urban residents in the United
States. Continued exposure to high levels of noise has proven effects on health, including …

Closing the Knowledge Gap in Designing Data Annotation Interfaces for AI-powered Disaster Management Analytic Systems

Z Ara, H Salemi, SR Hong, Y Senarath… - Proceedings of the 29th …, 2024 - dl.acm.org
Data annotation interfaces predominantly leverage ground truth labels to guide annotators
toward accurate responses. With the growing adoption of Artificial Intelligence (AI) in domain …

SONYC-UST-V2: An urban sound tagging dataset with spatiotemporal context

M Cartwright, J Cramer, AEM Mendez, Y Wang… - arxiv preprint arxiv …, 2020 - arxiv.org
We present SONYC-UST-V2, a dataset for urban sound tagging with spatiotemporal
information. This dataset is aimed for the development and evaluation of machine listening …