Spoken document understanding and organization

L Lee, B Chen - IEEE Signal Processing Magazine, 2005 - ieeexplore.ieee.org
Spoken documents (or associated multimedia content) are in fact better understood and
reorganized in a way that retrieval/browsing can be performed easily. For example, they are …

[PDF][PDF] MATBN: A Mandarin Chinese broadcast news corpus

HM Wang, B Chen, JW Kuo… - International journal of …, 2005 - aclanthology.org
Abstract The MATBN Mandarin Chinese broadcast news corpus contains a total of 198
hours of broadcast news from the Public Television Service Foundation (Taiwan) with …

Live streaming speech recognition using deep bidirectional LSTM acoustic models and interpolated language models

J Jorge, A Giménez, JA Silvestre-Cerdà… - … on Audio, Speech …, 2021 - ieeexplore.ieee.org
Although Long-Short Term Memory (LSTM) networks and deep Transformers are now
extensively used in offline ASR, it is unclear how best offline systems can be adapted to …

Extractive broadcast news summarization leveraging recurrent neural network language modeling techniques

KY Chen, SH Liu, B Chen, HM Wang… - … on Audio, Speech …, 2015 - ieeexplore.ieee.org
Extractive text or speech summarization manages to select a set of salient sentences from
an original document and concatenate them to form a summary, enabling users to better …

Combining relevance language modeling and clarity measure for extractive speech summarization

SH Liu, KY Chen, B Chen, HM Wang… - … on Audio, Speech …, 2015 - ieeexplore.ieee.org
Extractive speech summarization, which purports to select an indicative set of sentences
from a spoken document so as to succinctly represent the most important aspects of the …

Word topic models for spoken document retrieval and transcription

B Chen - ACM Transactions on Asian Language Information …, 2009 - dl.acm.org
Statistical language modeling (LM), which aims to capture the regularities in human natural
language and quantify the acceptability of a given word sequence, has long been an …

A probabilistic generative framework for extractive broadcast news speech summarization

YT Chen, B Chen, HM Wang - IEEE Transactions on Audio …, 2008 - ieeexplore.ieee.org
In this paper, we consider extractive summarization of broadcast news speech and propose
a unified probabilistic generative framework that combines the sentence generative …

Leveraging Kullback–Leibler divergence measures and information-rich cues for speech summarization

SH Lin, YM Yeh, B Chen - IEEE transactions on audio, speech …, 2010 - ieeexplore.ieee.org
Imperfect speech recognition often leads to degraded performance when exploiting
conventional text-based methods for speech summarization. To alleviate this problem, this …

Enhanced language modeling with proximity and sentence relatedness information for extractive broadcast news summarization

SH Liu, KY Chen, B Chen - ACM Transactions on Asian and Low …, 2020 - dl.acm.org
The primary task of extractive summarization is to automatically select a set of representative
sentences from a text or spoken document that can concisely express the most important …

Exploring the use of unsupervised query modeling techniques for speech recognition and summarization

KY Chen, SH Liu, B Chen, HM Wang, HH Chen - Speech Communication, 2016 - Elsevier
Statistical language modeling (LM) that intends to quantify the acceptability of a given piece
of text has long been an interesting yet challenging research area. In particular, language …