Statistical measures for usage‐based linguistics

ST Gries, NC Ellis - Language Learning, 2015‏ - Wiley Online Library
The advent of usage‐/exemplar‐based approaches has resulted in a major change in the
theoretical landscape of linguistics, but also in the range of methodologies that are brought …

Collocations in context: A new perspective on collocation networks

V Brezina, T McEnery, S Wattam - International journal of corpus …, 2015‏ - jbe-platform.com
The idea that text in a particular field of discourse is organized into lexical patterns, which
can be visualized as networks of words that collocate with each other, was originally …

Multiword expression processing: A survey

M Constant, G Eryiğit, J Monti, L Van Der Plas… - Computational …, 2017‏ - direct.mit.edu
Multiword expressions (MWEs) are a class of linguistic forms spanning conventional word
boundaries that are both idiosyncratic and pervasive across different languages. The …

[PDF][PDF] Automatic evaluation of topic coherence

D Newman, JH Lau, K Grieser… - … technologies: The 2010 …, 2010‏ - aclanthology.org
This paper introduces the novel task of topic coherence evaluation, whereby a set of words,
as generated by a topic model, is rated for coherence or interpretability. We apply a range of …

50-something years of work on collocations: What is or should be next…

ST Gries - International Journal of Corpus Linguistics, 2013‏ - jbe-platform.com
This paper explores ways in which research into collocation should be improved. After a
discussion of the parameters underlying the notion of collocation, the paper has three main …

[ספר][B] Quantitative corpus linguistics with R: A practical introduction

ST Gries - 2016‏ - taylorfrancis.com
As in its first edition, the new edition of Quantitative Corpus Linguistics with R demonstrates
how to process corpus-linguistic data with the open-source programming language and …

[ספר][B] Corpus linguistics and statistics with R

G Desagulier, G Desagulier, Amboy - 2017‏ - Springer
In the summer of 2008, I gave a talk at an international conference in Brighton. The talk was
about constructions involving multiple hedging in American English (eg, I'm gonna have to …

[PDF][PDF] Automatic labelling of topic models

JH Lau, K Grieser, D Newman… - Proceedings of the 49th …, 2011‏ - aclanthology.org
We propose a method for automatically labelling topics learned via LDA topic models. We
generate our label candidate set from the top-ranking topic terms, titles of Wikipedia articles …

[ספר][B] Syntax-based collocation extraction

V Seretan - 2011‏ - direct.mit.edu
Collocation is a common language phenomenon which has attracted the interest of
researchers in many subfields of both theoretical and computational linguistics. Although …

Beyond single-word measures: L2 writing assessment, lexical richness and formulaic competence

Y Bestgen - System, 2017‏ - Elsevier
Formulaic competence, the native-like use of ready-made sequences of words, is a key
aspect in the development of L2 writing proficiency. Becoming increasingly important in …