- Academic Search

H Zhu, MD Luo, R Wang, AH Zheng, R He - International Journal of …, 2021 - Springer

Audio-visual learning, aimed at exploiting the relationship between audio and visual
modalities, has drawn considerable attention since deep learning started to be used …

Tallenna Viittaa Viittausten määrä 191 Aiheeseen liittyviä artikkeleita Kaikki 12 versiota

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Foundation models for music: A survey

Y Ma, A Øland, A Ragni, BMS Del Sette, C Saitis… - arxiv preprint arxiv …, 2024 - arxiv.org

In recent years, foundation models (FMs) such as large language models (LLMs) and latent
diffusion models (LDMs) have profoundly impacted diverse sectors, including music. This …

Tallenna Viittaa Viittausten määrä 12 Aiheeseen liittyviä artikkeleita Kaikki 4 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Auto-regressive image synthesis with integrated quantization

F Zhan, Y Yu, R Wu, J Zhang, K Cui, C Zhang… - European Conference on …, 2022 - Springer

Deep generative models have achieved conspicuous progress in realistic image synthesis
with multifarious conditional inputs, while generating diverse yet high-fidelity images …

Tallenna Viittaa Viittausten määrä 126 Aiheeseen liittyviä artikkeleita Kaikki 8 versiota

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Music gesture for visual sound separation

C Gan, D Huang, H Zhao… - Proceedings of the …, 2020 - openaccess.thecvf.com

Recent deep learning approaches have achieved impressive performance on visual sound
separation tasks. However, these approaches are mostly built on appearance and optical …

Tallenna Viittaa Viittausten määrä 228 Aiheeseen liittyviä artikkeleita Kaikki 9 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

The sound of motions

H Zhao, C Gan, WC Ma… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com

Sounds originate from object motions and vibrations of surrounding air. Inspired by the fact
that humans is capable of interpreting sound sources from how objects move visually, we …

Tallenna Viittaa Viittausten määrä 294 Aiheeseen liittyviä artikkeleita Kaikki 8 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Foley music: Learning to generate music from videos

C Gan, D Huang, P Chen, JB Tenenbaum… - Computer Vision–ECCV …, 2020 - Springer

In this paper, we introduce Foley Music, a system that can synthesize plausible music for a
silent video clip about people playing musical instruments. We first identify two key …

Tallenna Viittaa Viittausten määrä 154 Aiheeseen liittyviä artikkeleita Kaikki 8 versiota

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

MT3: Multi-task multitrack music transcription

J Gardner, I Simon, E Manilow, C Hawthorne… - arxiv preprint arxiv …, 2021 - arxiv.org

Automatic Music Transcription (AMT), inferring musical notes from raw audio, is a
challenging task at the core of music understanding. Unlike Automatic Speech Recognition …

Tallenna Viittaa Viittausten määrä 107 Aiheeseen liittyviä artikkeleita Kaikki 5 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Taming visually guided sound generation

V Iashin, E Rahtu - arxiv preprint arxiv:2110.08791, 2021 - arxiv.org

Recent advances in visually-induced audio generation are based on sampling short, low-
fidelity, and one-class sounds. Moreover, sampling 1 second of audio from the state-of-the …

Tallenna Viittaa Viittausten määrä 108 Aiheeseen liittyviä artikkeleita Kaikki 6 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Multi-instrument music synthesis with spectrogram diffusion

C Hawthorne, I Simon, A Roberts, N Zeghidour… - arxiv preprint arxiv …, 2022 - arxiv.org

An ideal music synthesizer should be both interactive and expressive, generating high-
fidelity audio in realtime for arbitrary combinations of instruments and notes. Recent neural …

Tallenna Viittaa Viittausten määrä 67 Aiheeseen liittyviä artikkeleita Kaikki 4 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Giantmidi-piano: A large-scale midi dataset for classical piano music

Q Kong, B Li, J Chen, Y Wang - arxiv preprint arxiv:2010.07061, 2020 - arxiv.org

Symbolic music datasets are important for music information retrieval and musical analysis.
However, there is a lack of large-scale symbolic datasets for classical piano music. In this …

Tallenna Viittaa Viittausten määrä 118 Aiheeseen liittyviä artikkeleita Kaikki 7 versiota HTML-versio

Luo ilmoitus

Viittaa

Tarkennettu haku

Tallennettu omaan kirjastoon

Creating a multitrack classical music performance dataset for multimodal music analysis:...

Deep audio-visual learning: A survey

Foundation models for music: A survey

Auto-regressive image synthesis with integrated quantization

Music gesture for visual sound separation

The sound of motions

Foley music: Learning to generate music from videos

MT3: Multi-task multitrack music transcription

Taming visually guided sound generation

Multi-instrument music synthesis with spectrogram diffusion

Giantmidi-piano: A large-scale midi dataset for classical piano music