- Academic Search

R Zhen, W Song, Q He, J Cao, L Shi, J Luo - Electronics, 2023 - mdpi.com

Virtual human is widely employed in various industries, including personal assistance,
intelligent customer service, and online education, thanks to the rapid development of …

保存引用被引用数: 51 関連記事全 3 バージョンキャッシュ

[Free GPT-4]

[PDF] arxiv.org

Funasr: A fundamental end-to-end speech recognition toolkit

Z Gao, Z Li, J Wang, H Luo, X Shi, M Chen, Y Li… - ar** text-to-speech (TTS) systems for a variety of real-world …

保存引用被引用数: 2 関連記事全 4 バージョン

CNAMD Corpus: A Chinese Natural Audiovisual Multimodal Database of Conversations for Social Interactive Agents

J Wu, S Chen, W ** companion Socially Interactive Agents
(SIAs) that provide companionship and reduce loneliness. However, recent works focus on …

保存引用被引用数: 3 関連記事全 3 バージョン

Alchemy: Data-Free Adversarial Training

Y Bai, Z Ma, Y Chen, J Deng, S Pang, Y Liu… - Proceedings of the 2024 …, 2024 - dl.acm.org

Machine learning models have become integral to various aspects of daily life, prompting
increased vulnerability to adversarial attacks. Adversarial training is one of the most …

保存引用関連記事

[Free GPT-4]

[PDF] everyvoice.ca

[PDF][PDF] Speech generation for indigenous language education

RK Kazantsevaa, R Kuhna, S Larkina… - Computer Speech & …, 2024 - docs.everyvoice.ca

The vast majority of the world's languages are unable to follow in the footsteps of existing
resource-intensive pathways to building text-to-speech (TTS) systems. But, as the quality of …

保存引用被引用数: 1 関連記事全 2 バージョン HTMLバージョン

[Free GPT-4]

[PDF] mdpi.com

Grammar-supervised end-to-end speech recognition with part-of-speech tagging and dependency parsing

G Wan, T Mao, J Zhang, H Chen, J Gao, Z Ye - Applied Sciences, 2023 - mdpi.com

For most automatic speech recognition systems, many unacceptable hypothesis errors still
make the recognition results absurd and difficult to understand. In this paper, we introduce …

保存引用被引用数: 3 関連記事全 3 バージョンキャッシュ

[Free GPT-4]

[PDF] arxiv.org

Looking and listening: Audio guided text recognition

W Yu, M Liu, B Yang, E Zhang, D Jiang, X Sun… - arxiv preprint arxiv …, 2023 - arxiv.org

Text recognition in the wild is a long-standing problem in computer vision. Driven by end-to-
end deep learning, recent studies suggest vision and language processing are effective for …

保存引用被引用数: 2 関連記事全 2 バージョン HTMLバージョン

アラートを作成

引用

検索オプション

マイライブラリに保存しました

Paddlespeech: An easy-to-use all-in-one speech toolkit

Human-computer interaction system: A survey of talking-head generation

Funasr: A fundamental end-to-end speech recognition toolkit

CNAMD Corpus: A Chinese Natural Audiovisual Multimodal Database of Conversations for Social Interactive Agents

Alchemy: Data-Free Adversarial Training

[PDF][PDF] Speech generation for indigenous language education

Grammar-supervised end-to-end speech recognition with part-of-speech tagging and dependency parsing

Looking and listening: Audio guided text recognition