[HTML][HTML] Video and audio deepfake datasets and open issues in deepfake technology: being ahead of the curve

Z Akhtar, TL Pendyala, VS Athmakuri - Forensic Sciences, 2024 - mdpi.com
The revolutionary breakthroughs in Machine Learning (ML) and Artificial Intelligence (AI) are
extensively being harnessed across a diverse range of domains, eg, forensic science …

TIMIT-TTS: A text-to-speech dataset for multimodal synthetic media detection

D Salvi, B Hosler, P Bestagini, MC Stamm… - IEEE …, 2023 - ieeexplore.ieee.org
With the rapid development of deep learning techniques, the generation and counterfeiting
of multimedia material has become increasingly simple. Current technology enables the …

Hierarchical classification for instrument activity detection in orchestral music recordings

M Krause, M Müller - IEEE/ACM Transactions on Audio, Speech …, 2023 - ieeexplore.ieee.org
Instrument activity detection is a fundamental task in music information retrieval, serving as a
basis for many applications, such as music recommendation, music tagging, or remixing …

[HTML][HTML] Wagner Ring Dataset: A complex opera scenario for music processing and computational musicology

C Weiß, V Arifi-Müller, M Krause… - Transactions of the …, 2023 - transactions.ismir.net
This paper introduces the Wagner Ring Dataset (WRD), a multi-modal and multi-version
resource on the large-scale opera cycle Der Ring des Nibelungen by Richard Wagner. The …

PiCoGen2: Piano cover generation with transfer learning approach and weakly aligned data

CP Tan, H Ai, YH Chang, SH Guan… - arxiv preprint arxiv …, 2024 - arxiv.org
Piano cover generation aims to create a piano cover from a pop song. Existing approaches
mainly employ supervised learning and the training demands strongly-aligned and paired …

[PDF][PDF] Automatic Note-Level Score-to-Performance Alignments in the ASAP Dataset.

SD Peter, CEC Chacón, F Foscarin… - Trans. Int. Soc …, 2023 - pdfs.semanticscholar.org
Several MIR applications require fine-grained note alignments between MIDI performances
and their musical scores for training and evaluation. However, large and high-quality …

Source Separation of Piano Concertos Using Musically Motivated Augmentation Techniques

Y Özer, M Müller - IEEE/ACM Transactions on Audio, Speech …, 2024 - ieeexplore.ieee.org
In this work, we address the novel and rarely considered source separation task of
decomposing piano concerto recordings into separate piano and orchestral tracks. Being a …

[PDF][PDF] Weakly Supervised Multi-Pitch Estimation Using Cross-Version Alignment.

M Krause, S Strahl, M Müller - ISMIR, 2023 - audiolabs-erlangen.de
ABSTRACT Multi-pitch estimation (MPE), the task of detecting active pitches within a
polyphonic music recording, has garnered significant research interest in recent years. Most …

[PDF][PDF] Notewise evaluation for music source separation: A case study for separated piano tracks

Y Özer, HU Berendes, V Arifi-Müller… - Submitted for …, 2024 - audiolabs-erlangen.de
Deep learning has significantly advanced music source separation (MSS), aiming to
decompose music recordings into individual tracks corresponding to singing or specific …

[HTML][HTML] BPSD: A coherent multi-version dataset for analyzing the first movements of beethoven's piano sonatas

J Zeitler, C Weiß, V Arifi-Müller… - Transactions of the …, 2024 - transactions.ismir.net
This paper introduces the Beethoven Piano Sonata Dataset (BPSD), a multi-version dataset
focusing on the first movements of Beethoven's 32 piano sonatas. Recognized as pivotal …