Taja Kuzman

인용

	전체	2020년 이후
서지정보	417	380
h-index	8	8
i10-index	8	7

200

100

150

20162017201820192020202120222023202420258 8 7 11 6 13 23 124 188 20

공개 액세스

모두 보기

자료 15개

자료 1개

공개

비공개

재정 지원 요구사항 기준

공동 저자

Nikola LjubešićResearcher at Jožef Stefan Instituteijs.si의 이메일 확인됨
Peter RupnikJožef Stefan Instituteijs.si의 이메일 확인됨
Rik van NoordAssistant professor in Humane AI and NLP, University of Groningenrug.nl의 이메일 확인됨
Antonio ToralAssistant Professor, University of Groningenrug.nl의 이메일 확인됨
Gema Ramírez-SánchezCEO at Prompsit Language Engineering, computational linguistprompsit.com의 이메일 확인됨
Vít SuchomelMasaryk University and Lexical Computing Ltd.mail.muni.cz의 이메일 확인됨
Mikel L. Forcada (ORCID 0000-0003-0...Professor of Computer Languages and Systems, Universitat d'Alacantua.es의 이메일 확인됨
Leopoldo Pla SempereUniversidad de Alicantedlsi.ua.es의 이메일 확인됨
Marta BañónPrompsit Language Engineeringprompsit.com의 이메일 확인됨
Simon KrekResearcher at Jožef Stefan Instituteijs.si의 이메일 확인됨
Tomaž ErjavecJožef Stefan Instituteijs.si의 이메일 확인됨
Miquel Esplà-GomisUniversitat d'Alacantdlsi.ua.es의 이메일 확인됨
Jaka ČibejResearcher, University of Ljubljanaff.uni-lj.si의 이메일 확인됨
Polona GantarResearcher, Faculty of Arts, Ljubljana, Sloveniaguest.arnes.si의 이메일 확인됨
Kaja DobrovoljcResearch Associate, University of Ljubljana & Jozef Stefan Instituteijs.si의 이메일 확인됨
Špela VintarFull Professor, University of Ljubljanaff.uni-lj.si의 이메일 확인됨
Špela Arhar HoldtResearch Associate at University of Ljubljana, Sloveniacjvt.si의 이메일 확인됨
Mihael ArcanCo-Founder and Chief Scientific Officer (CSO) @ Lua Healthluahealth.io의 이메일 확인됨
Darja FišerAssistant Professor, University of Ljubljanaff.uni-lj.si의 이메일 확인됨
Mojca BrglezJunior Researcher / PhD student, University of Ljubljanaff.uni-lj.si의 이메일 확인됨

팔로우

Taja Kuzman

Department of Knowledge Technologies, Jožef Stefan Institute

ijs.si의 이메일 확인됨

computational linguistics language technology natural language processing web corpora genre identification


제목 서지정보순 정렬 연도순 정렬 제목순 정렬	인용 인용	연도
Automatic genre identification: a survey T Kuzman, N Ljubešić Language Resources and Evaluation, 1-34, 2023	119	2023
Neural machine translation of literary texts from English to Slovene T Kuzman, Š Vintar, M Arcan Proceedings of the qualities of literary machine translation, 1-9, 2019	39	2019
Linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint. ana 2.0 T Erjavec, M Ogrodniczuk, P Osenova, N Ljubešić, K Simov, V Grigorova, ... CLARIN ERIC, 2021	35*	2021
ChatGPT: Beginning of an End of Manual Linguistic Data Annotation? Use Case of Automatic Genre Identification T Kuzman, I Mozetič, N Ljubešić arXiv preprint arXiv: 2303.03953, 2023	30*	2023
MaCoCu: Massive collection and curation of monolingual and bilingual data: focus on under-resourced languages M Banón, M Espla-Gomis, ML Forcada, C García-Romero, T Kuzman, ... 23rd Annual Conference of the European Association for Machine Translation …, 2022	22	2022
Training corpus ssj500k 1.3. Slovenian language resource repository CLARIN. SI S Krek, T Erjavec, K Dobrovoljc, S Može, N Ledinek, N Holz	20	2013
Automatic genre identification for robust enrichment of massive text collections: Investigation of classification methods in the era of large language models T Kuzman, I Mozetič, N Ljubešić Machine Learning and Knowledge Extraction 5 (3), 1149-1175, 2023	15	2023
The GINCO training dataset for web genre identification of documents out in the wild T Kuzman, P Rupnik, N Ljubešić arXiv preprint arXiv:2201.03857, 2022	15	2022
CLASSLA-web: Comparable Web Corpora of South Slavic Languages Enriched with Linguistic and Genre Annotation N Ljubešić, T Kuzman arXiv preprint arXiv:2403.12721, 2024	8	2024
BENCHić-lang: A benchmark for discriminating between Bosnian, Croatian, Montenegrin and Serbian P Rupnik, T Kuzman, N Ljubešić Tenth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial …, 2023	8	2023
Training corpus ssj500k 1.4 S Krek, K Dobrovoljc, T Erjavec, S Može, N Ledinek, N Holz Centre for Language Resources and Technologies, University of Ljubljana, 2015	8	2015
Get to know your parallel data: Performing english variety and genre classification over macocu corpora T Kuzman, P Rupnik, N Ljubešić Tenth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial …, 2023	7	2023
Do Language Models Care About Text Quality? Evaluating Web-Crawled Corpora Across 11 Languages R van Noord, T Kuzman, P Rupnik, N Ljubešić, M Esplà-Gomis, ... arXiv preprint arXiv:2403.08693, 2024	6	2024
Assessing comparability of genre datasets via cross-lingual and cross-dataset experiments T Kuzman, N Ljubešic, S Pollak Jezikovne tehnologije in digitalna humanistika: zbornik konference …, 2022	6	2022
Verbal multiword expressions in Slovene P Gantar, S Krek, T Kuzman Computational and Corpus-Based Phraseology: Second International Conference …, 2017	6	2017
ChatGPT: Beginning of an End of Manual Linguistic Data Annotation? Use Case of Automatic Genre Identification. March 7, 2023 T Kuzman, I Mozetič, N Ljubešić Reference Source, 0	6
Slovene-English parallel corpus MaCoCu-sl-en 2.0 M Bañón, M Chichirau, M Esplà-Gomis, ML Forcada, A Galiano-Jiménez, ... Jožef Stefan Institute, 2023	5	2023
Choice of plausible alternatives dataset in Serbian COPA-SR N Ljubešić, M Starović, T Kuzman, T Samardžić Jožef Stefan Institute, 2022	5	2022
Exploring the Impact of Lexical and Grammatical Features on Automatic Genre Identification T Kuzman, N Ljubešić Proceedings of the Odkrivanje Znanja in Podatkovna Skladišca—SiKDD …, 2022	5	2022
Annotated Corpora and Tools of the PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions C Ramisch, SR Cordeiro, A Savary, V Vincze, V Barbu Mititelu, A Bhatia, ... LINDAT/CLARIAH-CZ Digital Library at the Institute of Formal and Applied …, 2018	5	2018

현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.

학술자료 1–20

연간 인용횟수

중복된 서지정보

병합된 서지정보

공동 저자 추가공동 저자

팔로우

인용

공동 저자