- Academic Search

Y Lee, K Jang, J Goo, Y Jung, H Kim - ar** human in the loop of drone-assisted inspection

Y Li, A Parsan, B Wang, P Dong, S Yao… - Engineering Applications of …, 2023 - Elsevier

Audio commands are a preferred communication medium to keep inspectors in the loop of
civil infrastructure inspection performed by a semi-autonomous drone. To understand job …

保存引用被引用次数：7 相关文章所有 6 个版本

[Free GPT-4]

[PDF] arxiv.org

Adapting TTS models for new speakers using transfer learning

P Neekhara, J Li, B Ginsburg - arxiv preprint arxiv:2110.05798, 2021 - arxiv.org

Training neural text-to-speech (TTS) models for a new speaker typically requires several
hours of high quality speech data. Prior works on voice cloning attempt to address this …

保存引用被引用次数：12 相关文章所有 2 个版本 HTML 版

[Free GPT-4]

[PDF] mdpi.com

Automatic Fluency Assessment Method for Spontaneous Speech without Reference Text

J Liu, A Wumaier, C Fan, S Guo - Electronics, 2023 - mdpi.com

The automatic fluency assessment of spontaneous speech without reference text is a
challenging task that heavily depends on the accuracy of automatic speech recognition …

保存引用被引用次数：5 相关文章所有 3 个版本网页快照

[Free GPT-4]

[PDF] arxiv.org

One-Step Knowledge Distillation and Fine-Tuning in Using Large Pre-Trained Self-Supervised Learning Models for Speaker Verification

J Heo, C Lim, J Kim, H Shin, HJ Yu - arxiv preprint arxiv:2305.17394, 2023 - arxiv.org

The application of speech self-supervised learning (SSL) models has achieved remarkable
performance in speaker verification (SV). However, there is a computational cost hurdle in …

保存引用被引用次数：2 相关文章所有 5 个版本 HTML 版

[Free GPT-4]

[PDF] aalto.fi

[PDF][PDF] Multi-task wav2vec2 serving as a pronunciation training system for children

Y Getman, R Al-Ghezi, T Grosz… - 9th Workshop on Speech …, 2023 - research.aalto.fi

Computer-assisted learning tools (CAPT) are increasingly reliant on AI tools. Recent studies
demonstrated how neural systems pre-trained in a self-supervised fashion, such as …

保存引用被引用次数：4 相关文章所有 6 个版本 HTML 版

[Free GPT-4]

[PDF] arxiv.org

SKILL: Similarity-aware Knowledge distILLation for Speech Self-Supervised Learning

L Zampierin, GB Hacene, B Nguyen… - arxiv preprint arxiv …, 2024 - arxiv.org

Self-supervised learning (SSL) has achieved remarkable success across various speech-
processing tasks. To enhance its efficiency, previous works often leverage the use of …

保存引用被引用次数：3 相关文章所有 2 个版本 HTML 版

[Free GPT-4]

[PDF] arxiv.org

SelfVC: Voice Conversion With Iterative Refinement using Self Transformations

P Neekhara, S Hussain, R Valle, B Ginsburg… - arxiv preprint arxiv …, 2023 - arxiv.org

We propose SelfVC, a training strategy to iteratively improve a voice conversion model with
self-synthesized examples. Previous efforts on voice conversion focus on explicitly …

保存引用被引用次数：3 相关文章所有 5 个版本 HTML 版

创建快讯

引用

高级搜索

已保存到“我的图书馆”

Multi-task voice activated framework using self-supervised learning

Fithubert: Going thinner and deeper for knowledge distillation of speech self-supervised learning

Adapting TTS models for new speakers using transfer learning

Automatic Fluency Assessment Method for Spontaneous Speech without Reference Text

One-Step Knowledge Distillation and Fine-Tuning in Using Large Pre-Trained Self-Supervised Learning Models for Speaker Verification

[PDF][PDF] Multi-task wav2vec2 serving as a pronunciation training system for children

SKILL: Similarity-aware Knowledge distILLation for Speech Self-Supervised Learning

SelfVC: Voice Conversion With Iterative Refinement using Self Transformations