A survey on deep learning for symbolic music generation: Representations, algorithms, evaluations, and challenges

S Ji, X Yang, J Luo - ACM Computing Surveys, 2023 - dl.acm.org
Significant progress has been made in symbolic music generation with the help of deep
learning techniques. However, the tasks covered by symbolic music generation have not …

Sparks of large audio models: A survey and outlook

S Latif, M Shoukat, F Shamshad, M Usama… - arxiv preprint arxiv …, 2023 - arxiv.org
This survey paper provides a comprehensive overview of the recent advancements and
challenges in applying large language models to the field of audio signal processing. Audio …

Musecoco: Generating symbolic music from text

P Lu, X Xu, C Kang, B Yu, C **ng, X Tan… - arxiv preprint arxiv …, 2023 - arxiv.org
Generating music from text descriptions is a user-friendly mode since the text is a relatively
easy interface for user engagement. While some approaches utilize texts to control music …

A review of intelligent music generation systems

L Wang, Z Zhao, H Liu, J Pang, Y Qin, Q Wu - Neural Computing and …, 2024 - Springer
With the introduction of ChatGPT, the public's perception of AI-generated content has begun
to reshape. Artificial intelligence has significantly reduced the barrier to entry for non …

Diff-bgm: A diffusion model for video background music generation

S Li, Y Qin, M Zheng, X **… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
When editing a video a piece of attractive background music is indispensable. However
video background music generation tasks face several challenges for example the lack of …

Foundation models for music: A survey

Y Ma, A Øland, A Ragni, BMS Del Sette, C Saitis… - arxiv preprint arxiv …, 2024 - arxiv.org
In recent years, foundation models (FMs) such as large language models (LLMs) and latent
diffusion models (LDMs) have profoundly impacted diverse sectors, including music. This …

Pit: Optimization of dynamic sparse deep learning models via permutation invariant transformation

N Zheng, H Jiang, Q Zhang, Z Han, L Ma… - Proceedings of the 29th …, 2023 - dl.acm.org
Dynamic sparsity, where the sparsity patterns are unknown until runtime, poses a significant
challenge to deep learning. The state-of-the-art sparsity-aware deep learning solutions are …

Musicagent: An ai agent for music understanding and generation with large language models

D Yu, K Song, P Lu, T He, X Tan, W Ye, S Zhang… - arxiv preprint arxiv …, 2023 - arxiv.org
AI-empowered music processing is a diverse field that encompasses dozens of tasks,
ranging from generation tasks (eg, timbre synthesis) to comprehension tasks (eg, music …

Virtual instrument performances (vip): A comprehensive review

T Kyriakou, MÁ de la Campa Crespo… - Computer Graphics …, 2024 - Wiley Online Library
Driven by recent advancements in Extended Reality (XR), the hype around the Metaverse,
and real‐time computer graphics, the transformation of the performing arts, particularly in …

Natural language processing methods for symbolic music generation and information retrieval: a survey

DVT Le, L Bigo, D Herremans, M Keller - ACM Computing Surveys, 2024 - dl.acm.org
Music is frequently associated with the notion of language as both domains share several
similarities, including the ability for their content to be represented as sequences of symbols …