Mert: Acoustic music understanding model with large-scale self-supervised training

Y Li, R Yuan, G Zhang, Y Ma, X Chen, H Yin… - arxiv preprint arxiv …, 2023 - arxiv.org
Self-supervised learning (SSL) has recently emerged as a promising paradigm for training
generalisable models on large-scale data in the fields of vision, text, and speech. Although …

[HTML][HTML] Language experience predicts music processing in a half-million speakers of fifty-four languages

J Liu, CB Hilton, E Bergelson, SA Mehr - Current Biology, 2023 - cell.com
Tonal languages differ from other languages in their use of pitch (tones) to distinguish
words. Lifelong experience speaking and hearing tonal languages has been argued to …

Human genomics and the biocultural origin of music

L Beccacece, P Abondio, E Cilli, D Restani… - International Journal of …, 2021 - mdpi.com
Music is an exclusive feature of humankind. It can be considered as a form of universal
communication, only partly comparable to the vocalizations of songbirds. Many trends of …

Bottom-up and top-down neural signatures of disordered multi-talker speech perception in adults with normal hearing

A Parthasarathy, KE Hancock, K Bennett, V DeGruttola… - Elife, 2020 - elifesciences.org
In social settings, speech waveforms from nearby speakers mix together in our ear canals.
Normally, the brain unmixes the attended speech stream from the chorus of background …

Individual differences in perception of the speech-to-song illusion are linked to musical aptitude but not musical training.

A Tierney, AD Patel, K Jasmin… - Journal of Experimental …, 2021 - psycnet.apa.org
In the speech-to-song illusion, certain spoken phrases are perceived as sung after
repetition. One possible explanation for this increase in musicality is that, as phrases are …

MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training

LI Yizhi, R Yuan, G Zhang, Y Ma, X Chen… - The Twelfth …, 2023 - openreview.net
Self-supervised learning (SSL) has recently emerged as a promising paradigm for training
generalisable models on large-scale data in the fields of vision, text, and speech. Although …

Effects of language experience on domain-general perceptual strategies

K Jasmin, H Sun, AT Tierney - Cognition, 2021 - Elsevier
Speech and music are highly redundant communication systems, with multiple acoustic
cues signaling the existence of perceptual categories. This redundancy makes these …

[HTML][HTML] Auditory precision hypothesis-L2: Dimension-specific relationships between auditory processing and second language segmental learning

K Saito, M Kachlicka, Y Suzukida, K Petrova, BJ Lee… - Cognition, 2022 - Elsevier
Growing evidence suggests a broad relationship between individual differences in auditory
processing ability and the rate and ultimate attainment of language acquisition throughout …

Domain-general auditory processing as an anchor of post-pubertal second language pronunciation learning: Behavioural and neurophysiological investigations of …

K Saito, M Kachlicka, H Sun, A Tierney - Journal of Memory and Language, 2020 - Elsevier
In the cognitive psychology literature, auditory processing has been extensively researched
and suggested as a foundation of first language acquisition in childhood. This study tests an …

Domain-general auditory processing explains multiple dimensions of L2 acquisition in adulthood

K Saito, H Sun, M Kachlicka, JRC Alayo… - Studies in Second …, 2022 - cambridge.org
In this study, we propose a hypothesis that domain-general auditory processing, a
perceptual anchor of L1 acquisition, can serve as the foundation of successful post-pubertal …