Enriching speech recognition with automatic detection of sentence boundaries and disfluencies

Y Liu, E Shriberg, A Stolcke, D Hillard… - … on audio, speech …, 2006 - ieeexplore.ieee.org
Effective human and automatic processing of speech requires recovery of more than just the
words. It also involves recovering phenomena such as sentence boundaries, filler words …

[PDF][PDF] Direct modeling of prosody: An overview of applications in automatic speech processing

E Shriberg, A Stolcke - … of the International Conference on Speech …, 2004 - isca-archive.org
We describe a “direct modeling” approach to using prosody in various speech technology
tasks. The approach does not involve any hand-labeling or modeling of prosodic events …

A study in machine learning from imbalanced data for sentence boundary detection in speech

Y Liu, NV Chawla, MP Harper, E Shriberg… - Computer Speech & …, 2006 - Elsevier
Enriching speech recognition output with sentence boundaries improves its human
readability and enables further processing by downstream language processing modules …

[PDF][PDF] Noisy BiLSTM-Based Models for Disfluency Detection.

N Bach, F Huang - Interspeech, 2019 - isca-archive.org
This paper describes BiLSTM-based models to disfluency detection in speech transcripts
using residual BiLSTM blocks, self-attention, and noisy training approach. Our best model …

[PDF][PDF] Disfluency detection with a semi-markov model and prosodic features

J Ferguson, G Durrett, D Klein - … of the 2015 Conference of the …, 2015 - aclanthology.org
We present a discriminative model for detecting disfluencies in spoken language transcripts.
Structurally, our model is a semi-Markov conditional random field with features targeting …

Improving meeting inclusiveness using speech interruption analysis

SW Fu, Y Fan, Y Hosseinkashi, J Gupchup… - Proceedings of the 30th …, 2022 - dl.acm.org
Meetings are a pervasive method of communication within all types of companies and
organizations, and using remote collaboration systems to conduct meetings has increased …

[PDF][PDF] Joint transition-based dependency parsing and disfluency detection for automatic speech recognition texts

M Yoshikawa - 2017 - naist.repo.nii.ac.jp
Joint dependency parsing with disfluency detection is an important task in speech language
processing. Recent methods show high performance for this task, although most authors …

[PDF][PDF] Comparing HMM, maximum entropy, and conditional random fields for disfluency detection.

Y Liu, E Shriberg, A Stolcke, MP Harper - Interspeech, 2005 - academia.edu
Automatic detection of disfluencies in spoken language is important for making speech
recognition output more readable, and for aiding downstream language processing …

[PDF][PDF] A lexically-driven algorithm for disfluency detection

M Snover, B Dorr, R Schwartz - … of HLT-NAACL 2004: Short Papers, 2004 - aclanthology.org
This paper describes a transformation-based learning approach to disfluency detection in
speech transcripts using primarily lexical features. Our method produces comparable results …

Combining lexical, syntactic and prosodic cues for improved online dialog act tagging

VKR Sridhar, S Bangalore, S Narayanan - Computer Speech & Language, 2009 - Elsevier
Prosody is an important cue for identifying dialog acts. In this paper, we show that modeling
the sequence of acoustic–prosodic values as n-gram features with a maximum entropy …