Text preprocessing for text mining in organizational research: Review and recommendations

L Hickman, S Thapa, L Tay, M Cao… - Organizational …, 2022 - journals.sagepub.com
Recent advances in text mining have provided new methods for capitalizing on the
voluminous natural language text data created by organizations, their employees, and their …

Multiword expression processing: A survey

M Constant, G Eryiğit, J Monti, L Van Der Plas… - Computational …, 2017 - direct.mit.edu
Multiword expressions (MWEs) are a class of linguistic forms spanning conventional word
boundaries that are both idiosyncratic and pervasive across different languages. The …

Still a pain in the neck: Evaluating text representations on lexical composition

V Shwartz, I Dagan - … of the Association for Computational Linguistics, 2019 - direct.mit.edu
Building meaningful phrase representations is challenging because phrase meanings are
not simply the sum of their constituent meanings. Lexical composition can shift the meanings …

Idiomatic expression identification using semantic compatibility

Z Zeng, S Bhat - Transactions of the Association for Computational …, 2021 - direct.mit.edu
Idiomatic expressions are an integral part of natural language and constantly being added to
a language. Owing to their non-compositionality and their ability to take on a figurative or …

Unsupervised compositionality prediction of nominal compounds

S Cordeiro, A Villavicencio, M Idiart… - Computational …, 2019 - direct.mit.edu
Nominal compounds such as red wine and nut case display a continuum of compositionality,
with varying contributions from the components of the compound to its semantics. This article …

Mark my word: A sequence-to-sequence approach to definition modeling

T Mickus, D Paperno, M Constant - arxiv preprint arxiv:1911.05715, 2019 - arxiv.org
Defining words in a textual context is a useful task both for practical purposes and for
gaining insight into distributed word representations. Building on the distributional …

Getting BART to ride the idiomatic train: Learning to represent idiomatic expressions

Z Zeng, S Bhat - Transactions of the Association for Computational …, 2022 - direct.mit.edu
Idiomatic expressions (IEs), characterized by their non-compositionality, are an important
part of natural language. They have been a classical challenge to NLP, including pre-trained …

Leveraging contextual embeddings and idiom principle for detecting idiomaticity in potentially idiomatic expressions

R Hashempour, A Villavicencio - … of the Workshop on the Cognitive …, 2020 - aclanthology.org
The majority of studies on detecting idiomatic expressions have focused on discovering
potentially idiomatic expressions overlooking the context. However, many idioms like blow …

Multiword expressions: between lexicography and NLP

P Gantar, L Colman, C Parra Escartín… - International Journal …, 2019 - academic.oup.com
The paper aims to establish a synergy between the lexicographic and natural language
processing (NLP) communities in relation to concepts and classifications of multiword …

Idiomatic expression paraphrasing without strong supervision

J Zhou, Z Zeng, H Gong, S Bhat - … of the AAAI Conference on Artificial …, 2022 - ojs.aaai.org
Idiomatic expressions (IEs) play an essential role in natural language. In this paper, we
study the task of idiomatic sentence paraphrasing (ISP), which aims to paraphrase a …