Linguistic annotation in/for corpus linguistics

ST Gries, AL Berez - Handbook of linguistic annotation, 2017 - Springer
This article surveys linguistic annotation in corpora and corpus linguistics. We first define the
concept of 'corpus' as a radial category and then, in Sect. 2, discuss a variety of kinds of …

[KSIĄŻKA][B] Handbook of natural language processing

N Indurkhya, FJ Damerau - 2010 - taylorfrancis.com
The Handbook of Natural Language Processing, Second Edition presents practical tools
and techniques for implementing natural language processing in computer systems. Along …

[KSIĄŻKA][B] Introduction: The handbook of linguistic annotation

N Ide - 2017 - Springer
Abstract The Handbook of Linguistic Annotation provides a comprehensive survey of the
development and state-of-the-art for linguistic annotation of language resources, including …

[KSIĄŻKA][B] Spoken corpus linguistics: From monomodal to multimodal

S Adolphs, R Carter - 2013 - taylorfrancis.com
In this book, Adolphs and Carter explore key approaches to work in spoken corpus
linguistics. The book discusses some of the pioneering challenges faced in designing …

[KSIĄŻKA][B] Matrix: A statistical method and software tool for linguistic analysis through corpus comparison

PE Rayson - 2003 - search.proquest.com
This thesis reports the development of a new kind of method and tool (Matrix) for advancing
the statistical analysis of electronic corpora of linguistic data. First, we describe the standard …

[PDF][PDF] The UAM CorpusTool: Software for corpus annotation and exploration

M O'Donnell - Proceedings of the XXVI Congreso de AESLA, 2008 - Citeseer
This paper describes the capabilities of the UAM CorpusTool, software for the annotation of
text corpora. The software allows the user to annotate a corpus of text files at a number of …

[PDF][PDF] MULTEXT-East Version 3: Multilingual Morphosyntactic Specifications, Lexicons and Corpora.

T Erjavec - LREC, 2004 - Citeseer
The paper presents the third edition of the MULTEXT-East language resources, a
multilingual dataset for language engineering research and development. This standardised …

[PDF][PDF] The Hungarian National Corpus.

T Váradi - LREC, 2002 - researchgate.net
The paper reports on the development of the Hungarian National Corpus, which was
completed at the end of 2001 after four years' effort. The HNC is designed to be a balanced …

[KSIĄŻKA][B] Corpus linguistics for online communication: A guide for research

L Collins - 2019 - taylorfrancis.com
Corpus Linguistics for Online Communication provides an instructive and practical guide to
conducting research using methods in corpus linguistics in studies of various forms of online …

MULTEXT-East: morphosyntactic resources for Central and Eastern European languages

T Erjavec - Language resources and evaluation, 2012 - Springer
The paper presents the MULTEXT-East language resources, a multilingual dataset for
language engineering research, focused on the morphosyntactic level of linguistic …