One deep music representation to rule them all? A comparative analysis of different representation learning strategies

J Kim, J Urbano, CCS Liem, A Hanjalic - Neural Computing and …, 2020 - Springer
Inspired by the success of deploying deep learning in the fields of Computer Vision and
Natural Language Processing, this learning paradigm has also found its way into the field of …

Automatic Assessment of Piano Performances Using Timbre and Pitch Features

V Phanichraksaphong, WH Tsai - Electronics, 2023 - mdpi.com
To assist piano learners with the improvement of their skills, this study investigates
techniques for automatically assessing piano performances based on timbre and pitch …

[HTML][HTML] Disruptive situation detection on public transport through speech emotion recognition

E Mancini, A Galassi, F Ruggeri, P Torroni - Intelligent Systems with …, 2024 - Elsevier
Disruptive situations are emotionally-charged events diverging from ordinary behavior, like
people fighting or screaming. Public transports are one type of social environment where …

Cell dynamic morphology classification using deep convolutional neural networks

H Li, F Pang, Y Shi, Z Liu - Cytometry Part A, 2018 - Wiley Online Library
Cell morphology is often used as a proxy measurement of cell status to understand cell
physiology. Hence, interpretation of cell dynamic morphology is a meaningful task in …

[HTML][HTML] OneBitPitch (OBP): Ultra-High-Speed Pitch Detection Algorithm Based on One-Bit Quantization and Modified Autocorrelation

D Coccoluto, V Cesarini, G Costantini - Applied Sciences, 2023 - mdpi.com
Featured Application Fast pitch detection algorithm for the real-time estimation of the
fundamental frequency, optimized for hardware implementation. Abstract This paper …

A hybrid neural network based on the duplex model of pitch perception for singing melody extraction

H Chou, MT Chen, TS Chi - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
In this paper, we build up a hybrid neural network (NN) for singing melody extraction from
polyphonic music by imitating human pitch perception. For human hearing, there are two …

[PDF][PDF] Estimation of Fundamental Frequency from Singing Voice Using Harmonics of Impulse-like Excitation Source.

SR Kadiri, B Yegnanarayana - Interspeech, 2018 - isca-archive.org
This paper focuses on the problem of estimating fundamental frequency from singing voice.
Estimation of fundamental frequency is a well studied topic in the speech research …

A novel pitch extraction based on jointly trained deep BLSTM recurrent neural networks with bottleneck features

B Liu, J Tao, D Zhang, Y Zheng - 2017 IEEE International …, 2017 - ieeexplore.ieee.org
Pitch is an important characteristic of speech and is useful for many applications. However, it
is still challenging to estimate pitch in strong noise. In this paper, we propose a joint training …

F0 Estimation From Telephone Speech Using Deep Feature Loss

SM Shetty, S Revankar, NC Iyer… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Accurate pitch estimation in speech signal plays a vital role in several applications. Robust
pitch estimation in telephone speech is still a challenge due to the narrow bandwidth of the …

Improving deep neural network based speech synthesis through contextual feature parametrization and multi-task learning

Z Wen, K Li, Z Huang, CH Lee, J Tao - Journal of Signal Processing …, 2018 - Springer
We propose three techniques to improve speech synthesis based on deep neural network
(DNN). First, at the DNN input we use real-valued contextual feature vector to represent …