Theo dõi
Tomaž Erjavec
Tiêu đề
Trích dẫn bởi
Trích dẫn bởi
Năm
The JRC-Acquis: A multilingual aligned parallel corpus with 20+ languages
R Steinberger, B Pouliquen, A Widiger, C Ignat, T Erjavec, D Tufis, ...
arXiv preprint cs/0609058, 2006
7972006
MULTEXT-East Version 3: Multilingual Morphosyntactic Specifications, Lexicons and Corpora.
T Erjavec
LREC, 2004
3342004
Universal Dependencies 2.2
J Nivre, M Abrams, Ž Agić, L Ahrenberg, L Antonsen, MJ Aranzabe, ...
2532018
TEI P5: Guidelines for electronic text encoding and interchange
TEI Consortium, L Burnard, CM Sperberg-Mac Queen
1992008
Sense discrimination with parallel corpora
N Ide, T Erjavec, D Tufiş
Proceedings of the ACL-02 workshop on Word sense disambiguation: recent …, 2002
1752002
hrWaC and slWaC: Compiling web corpora for Croatian and Slovene
N Ljubešić, T Erjavec
Text, Speech and Dialogue: 14th International Conference, TSD 2011, Pilsen …, 2011
1532011
MULTEXT-East: Parallel and Comparable Corpora and Lexicons for Six Central and Eastern European Languages
L Dimitrova, N Ide, V Petkevic, T Erjavec, HJ Kaalep, D Tufis
Proceedings of the 36th Annual Meeting of the Association for Computational …, 1998
1521998
MULTEXT-East: morphosyntactic resources for Central and Eastern European languages
T Erjavec
Language resources and evaluation 46, 131-142, 2012
1372012
Towards a Slovene Dependency Treebank
S Džeroski, T Erjavec, N Ledinek, P Pajas, Ž Zdenek, A Žele
Proceedings of the 5th International Conference on Language Resources and …, 2006
1332006
Machine learning of morphosyntactic structure: Lemmatizing unknown Slovene words
T Erjavec, S Džeroski
Applied Artificial Intelligence 18 (1), 17-41, 2004
1142004
The ParlaMint corpora of parliamentary proceedings
T Erjavec, M Ogrodniczuk, P Osenova, N Ljubešić, K Simov, A Pančur, ...
Language resources and evaluation 57 (1), 415-448, 2023
1132023
Lemmagen: Multilingual lemmatisation with induced ripple-down rules
M Juršic, I Mozetic, T Erjavec, N Lavrac
Journal of Universal Computer Science 16 (9), 1190-1214, 2010
1132010
Legal framework, dataset and annotation schema for socially unacceptable online discourse practices in Slovene
D Fišer, T Erjavec, N Ljubešić
Proceedings of the first workshop on abusive language online, 46-51, 2017
1122017
Massive multi lingual corpus compilation: Acquis Communautaire and ToTaLe
T Erjavec, C Ignat, B Pouliquen, R Steinberger
ARCHIVES OF CONTROL SCIENCE 15 (4), 529, 2005
902005
Uvod v korpusno jezikoslovje
V Gorjanc, M Stabej, T Erjavec, I Saksida, S Kranjc
Izolit, 2005
842005
Designing and evaluating a Russian tagset
S Sharoff, M Kopotev, T Erjavec, A Feldman, D Divjak
832008
Korpusi slovenskega jezika Gigafida, KRES, ccGigafida in ccKRES: gradnja, vsebina, uporaba
N Logar, M Grčar, M Brakus, T Erjavec, ŠA Holdt, S Krek
Znanstvena založba Filozofske fakultete, 2020
752020
The JOS Linguistically Tagged Corpus of Slovene.
T Erjavec, D Fišer, S Krek, N Ledinek
LREC, 2010
702010
The MULTEXT East corpus.
T Erjavec, N Ide
LREC, 971-974, 1998
681998
Normalising Slovene data: historical texts vs. user-generated content
N Ljubešic, K Zupan, D Fišer, T Erjavec
Proceedings of the 13th Conference on Natural Language Processing (KONVENS …, 2016
632016
Hệ thống không thể thực hiện thao tác ngay bây giờ. Hãy thử lại sau.
Bài viết 1–20