A comprehensive survey on pretrained foundation models: A history from bert to chatgpt

C Zhou, Q Li, C Li, J Yu, Y Liu, G Wang… - International Journal of …, 2024 - Springer
Abstract Pretrained Foundation Models (PFMs) are regarded as the foundation for various
downstream tasks across different data modalities. A PFM (eg, BERT, ChatGPT, GPT-4) is …

A survey on deep learning for symbolic music generation: Representations, algorithms, evaluations, and challenges

S Ji, X Yang, J Luo - ACM Computing Surveys, 2023 - dl.acm.org
Significant progress has been made in symbolic music generation with the help of deep
learning techniques. However, the tasks covered by symbolic music generation have not …

Museformer: Transformer with fine-and coarse-grained attention for music generation

B Yu, P Lu, R Wang, W Hu, X Tan… - Advances in …, 2022 - proceedings.neurips.cc
Symbolic music generation aims to generate music scores automatically. A recent trend is to
use Transformer or its variants in music generation, which is, however, suboptimal, because …

Multimodal pretraining, adaptation, and generation for recommendation: A survey

Q Liu, J Zhu, Y Yang, Q Dai, Z Du, XM Wu… - Proceedings of the 30th …, 2024 - dl.acm.org
Personalized recommendation serves as a ubiquitous channel for users to discover
information tailored to their interests. However, traditional recommendation models primarily …

Sparks of large audio models: A survey and outlook

S Latif, M Shoukat, F Shamshad, M Usama… - arxiv preprint arxiv …, 2023 - arxiv.org
This survey paper provides a comprehensive overview of the recent advancements and
challenges in applying large language models to the field of audio signal processing. Audio …

MidiBERT-piano: large-scale pre-training for symbolic music understanding

YH Chou, I Chen, CJ Chang, J Ching… - arxiv preprint arxiv …, 2021 - arxiv.org
This paper presents an attempt to employ the mask language modeling approach of BERT
to pre-train a 12-layer Transformer model over 4,166 pieces of polyphonic piano MIDI files …

Musecoco: Generating symbolic music from text

P Lu, X Xu, C Kang, B Yu, C **ng, X Tan… - arxiv preprint arxiv …, 2023 - arxiv.org
Generating music from text descriptions is a user-friendly mode since the text is a relatively
easy interface for user engagement. While some approaches utilize texts to control music …

Foundation models for music: A survey

Y Ma, A Øland, A Ragni, BMS Del Sette, C Saitis… - arxiv preprint arxiv …, 2024 - arxiv.org
In recent years, foundation models (FMs) such as large language models (LLMs) and latent
diffusion models (LDMs) have profoundly impacted diverse sectors, including music. This …

Natural language processing methods for symbolic music generation and information retrieval: A survey

DVT Le, L Bigo, D Herremans, M Keller - ACM Computing Surveys, 2024 - dl.acm.org
Music is frequently associated with the notion of language as both domains share several
similarities, including the ability for their content to be represented as sequences of symbols …

Multitrack music transformer

HW Dong, K Chen, S Dubnov… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Existing approaches for generating multitrack music with transformer models have been
limited in terms of the number of instruments, the length of the music segments and slow …