- Academic Search

M Benzeghiba, R De Mori, O Deroo, S Dupont… - Speech …, 2007 - Elsevier

Major progress is being recorded regularly on both the technology and exploitation of
automatic speech recognition (ASR) and spoken language systems. However, there are still …

保存引用被引用数: 777 関連記事全 24 バージョン

[Free GPT-4]

[PDF] arxiv.org

Query expansion with locally-trained word embeddings

F Diaz, B Mitra, N Craswell - ar** a system for automatically
transcribing and indexing audio-visual academic lectures for audio information retrieval. We …

保存引用被引用数: 119 関連記事全 13 バージョン

[Free GPT-4]

[PDF] aclanthology.org

[PDF][PDF] Style & topic language model adaptation using HMM-LDA

BJP Hsu, J Glass - Proceedings of the 2006 Conference on …, 2006 - aclanthology.org

Adapting language models across styles and topics, such as for lecture transcription,
involves combining generic style models with topic-specific content relevant to the target …

保存引用被引用数: 83 関連記事全 11 バージョン HTMLバージョン

会議録作成支援のための国会審議の音声認識システム

秋田祐哉，三村**人，河原達也 - 電子情報通信学会論文誌 D, 2010 - search.ieice.org

我々は国会審議の会議録作成支援を想定した音声認識システムの研究開発に取り組んでいる.
会議録では原則として発話をすべて書き起こして記録することから, 音声認識を活用する際には高い …

保存引用被引用数: 17 関連記事全 4 バージョン

Do smart speaker skills support diverse audiences?

HA Shafei, CC Tan - Pervasive and Mobile Computing, 2022 - Elsevier

Smart speakers with voice assistants like Google Home or Amazon Alexa are increasingly
popular and essential in our daily lives due to their convenience of issuing voice commands …

保存引用被引用数: 3 関連記事全 3 バージョン

[Free GPT-4]

[PDF] researchgate.net

Automatic lecture transcription by exploiting presentation slide information for language model adaptation

T Kawahara, Y Nemoto, Y Akita - 2008 IEEE International …, 2008 - ieeexplore.ieee.org

The paper addresses language model adaptation for automatic lecture transcription by fully
exploiting presentation slide information used in the lecture. As the text in the presentation …

保存引用被引用数: 51 関連記事全 8 バージョン

[Free GPT-4]

[PDF] kyoto-u.ac.jp

Exploiting automatic speech recognition errors to enhance partial and synchronized caption for facilitating second language listening

MS Mirzaei, K Meshgi, T Kawahara - Computer Speech & Language, 2018 - Elsevier

This paper addresses the viability of using Automatic Speech Recognition (ASR) errors as
the predictor of difficulties in speech segments, thereby exploiting them to improve Partial …

保存引用被引用数: 21 関連記事全 6 バージョン

[Free GPT-4]

[PDF] researchgate.net

A new ASR evaluation measure and minimum Bayes-risk decoding for open-domain speech understanding

H Nanjo, T Kawahara - Proceedings.(ICASSP'05). IEEE …, 2005 - ieeexplore.ieee.org

A new evaluation measure of speech recognition and a decoding strategy for keyword-
based open-domain speech understanding are presented. Conventionally, WER (word error …

保存引用被引用数: 40 関連記事全 8 バージョン

[Free GPT-4]

[PDF] nycu.edu.tw

Modeling of speaking rate influences on Mandarin speech prosody and its application to speaking rate-controlled TTS

SH Chen, CH Hsieh, CY Chiang… - … ACM transactions on …, 2014 - ieeexplore.ieee.org

A new data-driven approach to building a speaking rate-dependent hierarchical prosodic
model (SR-HPM), directly from a large prosody-unlabeled speech database containing …

保存引用被引用数: 27 関連記事全 7 バージョン

アラートを作成

引用

検索オプション

マイライブラリに保存しました

Language model and speaking rate adaptation for spontaneous presentation speech recognition

Automatic speech recognition and speech variability: A review

Query expansion with locally-trained word embeddings

[PDF][PDF] Style & topic language model adaptation using HMM-LDA

会議録作成支援のための国会審議の音声認識システム

Do smart speaker skills support diverse audiences?

Automatic lecture transcription by exploiting presentation slide information for language model adaptation

Exploiting automatic speech recognition errors to enhance partial and synchronized caption for facilitating second language listening

A new ASR evaluation measure and minimum Bayes-risk decoding for open-domain speech understanding

Modeling of speaking rate influences on Mandarin speech prosody and its application to speaking rate-controlled TTS