Optimizing statistical machine translation for text simplification
Most recent sentence simplification systems use basic machine translation models to learn
lexical and syntactic paraphrases from a manually simplified parallel corpus. These methods …
lexical and syntactic paraphrases from a manually simplified parallel corpus. These methods …
[PDF][PDF] PPDB: The paraphrase database
We present the 1.0 release of our paraphrase database, PPDB. Its English portion, PPDB:
Eng, contains over 220 million paraphrase pairs, consisting of 73 million phrasal and 8 …
Eng, contains over 220 million paraphrase pairs, consisting of 73 million phrasal and 8 …
[LIBRO][B] Recognizing textual entailment: Models and applications
In the last few years, a number of NLP researchers have developed and participated in the
task of Recognizing Textual Entailment (RTE). This task encapsulates Natural Language …
task of Recognizing Textual Entailment (RTE). This task encapsulates Natural Language …
[PDF][PDF] Annotated gigaword
We have created layers of annotation on the English Gigaword v. 5 corpus to render it useful
as a standardized corpus for knowledge extraction and distributional semantics. Most …
as a standardized corpus for knowledge extraction and distributional semantics. Most …
[PDF][PDF] The Multilingual Paraphrase Database.
We release a massive expansion of the paraphrase database (PPDB) that now includes a
collection of paraphrases in 23 different languages. The resource is derived from large …
collection of paraphrases in 23 different languages. The resource is derived from large …
[PDF][PDF] Unsupervised sentence enhancement for automatic summarization
We present sentence enhancement as a novel technique for text-to-text generation in
abstractive summarization. Compared to extraction or previous approaches to sentence …
abstractive summarization. Compared to extraction or previous approaches to sentence …
[PDF][PDF] Learning verb inference rules from linguistically-motivated evidence
Learning inference relations between verbs is at the heart of many semantic applications.
However, most prior work on learning such rules focused on a rather narrow set of …
However, most prior work on learning such rules focused on a rather narrow set of …
[PDF][PDF] Global Learning of Textual Entailment Graphs
J Berant - 2012 - nlp.stanford.edu
Two decades after the introduction of the World Wide Web, the world is experiencing an
“information explosion”, that is, a rapid increase in the amount of available textual …
“information explosion”, that is, a rapid increase in the amount of available textual …
[PDF][PDF] Jerboa: A toolkit for randomized and streaming algorithms
B Van Durme - 2012 - academia.edu
Recent studies have shown the applicability of streaming and randomized algorithms in a
variety of large-scale language mining tasks. However, lack of many publicly available …
variety of large-scale language mining tasks. However, lack of many publicly available …
Large-Scale Paraphrasing for Text-to-Text Generation
J Ganitkevic - 2018 - jscholarship.library.jhu.edu
We present our work on the extraction and estimation of syntactic paraphrases using
commodity text data and automated linguistic annotation. Our initial approach leverages …
commodity text data and automated linguistic annotation. Our initial approach leverages …