APCodec: A Neural Audio Codec with Parallel Amplitude and Phase Spectrum Encoding and Decoding Y Ai, XH Jiang, YX Lu, HP Du, ZH Ling arXiv preprint arXiv:2402.10533, 2024 | 20 | 2024 |
APNet2: High-Quality and High-Efficiency Neural Vocoder with Direct Prediction of Amplitude and Phase Spectra HP Du, YX Lu, Y Ai, ZH Ling National Conference on Man-Machine Speech Communication, 66-80, 2023 | 8 | 2023 |
Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction YX Lu, Y Ai, HP Du, ZH Ling arXiv preprint arXiv:2401.06387, 2024 | 6 | 2024 |
BiVocoder: A Bidirectional Neural Vocoder Integrating Feature Extraction and Waveform Generation HP Du, YX Lu, Y Ai, ZH Ling arXiv preprint arXiv:2406.02162, 2024 | 2 | 2024 |
Considering Temporal Connection between Turns for Conversational Speech Synthesis K Mei, Z Liu, H Du, H Li, Y Ai, L Chen, Z Ling ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 2 | 2024 |
APCodec+: A Spectrum-Coding-Based High-Fidelity and High-Compression-Rate Neural Audio Codec with Staged Training Paradigm HP Du, Y Ai, RC Zheng, ZH Ling arXiv preprint arXiv:2410.22807, 2024 | 1 | 2024 |
ERVQ: Enhanced Residual Vector Quantization with Intra-and-Inter-Codebook Optimization for Neural Audio Codecs RC Zheng, HP Du, XH Jiang, Y Ai, ZH Ling arXiv preprint arXiv:2410.12359, 2024 | 1 | 2024 |
Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis YX Lu, HP Du, ZY Sheng, Y Ai, ZH Ling arXiv preprint arXiv:2412.16977, 2024 | | 2024 |
A Neural Denoising Vocoder for Clean Waveform Generation from Noisy Mel-Spectrogram based on Amplitude and Phase Predictions HP Du, YX Lu, Y Ai, ZH Ling arXiv preprint arXiv:2411.12268, 2024 | | 2024 |
SAMOS: A Neural MOS Prediction Model Leveraging Semantic Representations and Acoustic Features YF Shi, Y Ai, YX Lu, HP Du, ZH Ling arXiv preprint arXiv:2411.11232, 2024 | | 2024 |
ESTVocoder: An Excitation-Spectral-Transformed Neural Vocoder Conditioned on Mel Spectrogram XH Jiang, HP Du, Y Ai, YX Lu, ZH Ling arXiv preprint arXiv:2411.11258, 2024 | | 2024 |
Pitch-and-Spectrum-Aware Singing Quality Assessment with Bias Correction and Model Fusion YF Shi, Y Ai, YX Lu, HP Du, ZH Ling arXiv preprint arXiv:2411.11123, 2024 | | 2024 |
MDCTCodec: A Lightweight MDCT-based Neural Audio Codec towards High Sampling Rate and Low Bitrate Scenarios XH Jiang, Y Ai, RC Zheng, HP Du, YX Lu, ZH Ling arXiv preprint arXiv:2411.00464, 2024 | | 2024 |
Stage-Wise and Prior-Aware Neural Speech Phase Prediction F Liu, Y Ai, HP Du, YX Lu, RC Zheng, ZH Ling arXiv preprint arXiv:2410.04990, 2024 | | 2024 |
A Neural Denoising Vocoder for Clean Waveform Generation from Noisy HP Du, YX Lu, Y Ai, ZH Ling Man-Machine Speech Communication: 19th National Conference, NCMMSC 2024 …, 0 | | |
Excitation-Spectral-Transformed Neural Vocoder Conditioned on Mel Spectrogram XH Jiang, HP Du, Y Ai, YX Lu, ZH Ling Man-Machine Speech Communication: 19th National Conference, NCMMSC 2024 …, 0 | | |