A survey of semantic relatedness evaluation datasets and procedures

MA Hadj Taieb, T Zesch, M Ben Aouicha - Artificial Intelligence Review, 2020 - Springer
Semantic relatedness between words is a core concept in natural language processing.
While countless approaches have been proposed, measuring which one works best is still a …

SYN2015: representative corpus of written Czech

M Křen, V Cvrček, T Čapka, A Čermáková, M Hnátková… - 2015 - lindat.mff.cuni.cz
3. Prepis muze být rozdelen do vıce souboru. Soubory pak musı být vzestupne ocıslovány a
název souboru pak musı koncit cıslem souboru.(Naprıklad: Rozhovor s Janem Sokolem 1 …

Antonyms are similar: Towards paradigmatic association approach to rating similarity in SimLex-999 and WordSim-353

T Kliegr, O Zamazal - Data & Knowledge Engineering, 2018 - Elsevier
SimLex-999 is a widely used lexical resource for tracking progress in word similarity
computation. It anchors similarity in synonymy, while other researchers such as Agirre et …

Czech dataset for semantic textual similarity

L Svoboda, T Brychcín - International conference on text, speech, and …, 2018 - Springer
Semantic textual similarity is the core shared task at the International Workshop on Semantic
Evaluation (SemEval). It focuses on sentence meaning comparison. So far, most of the …

[PDF][PDF] Finely tuned, 2 billion token based word embeddings for portuguese

J Rodrigues, A Branco - Proceedings of the eleventh international …, 2018 - aclanthology.org
A distributional semantics model—also known as word embeddings—is a major asset for
any language as the research results reported in the literature have consistently shown that …

Enriching word embeddings with global information and testing on highly inflected language

L Svoboda, T Brychcín - Computación y Sistemas, 2019 - scielo.org.mx
In this paper we evaluate our new approach based on the Continuous Bag-of-Words and
Skip-gram models enriched with global context information on highly inflected Czech …

An evaluation of Czech word embeddings

K Hořeňovská - Proceedings of the 22nd Nordic Conference on …, 2019 - aclanthology.org
We present an evaluation of Czech low-dimensional distributed word representations, also
known as word embeddings. We describe five different approaches to training the models …

Evaluating Quality of Word Embeddings with Sentiment Polarity Identification Task

V Indurthi, SR Oota - … Web Challenges: 5th SemWebEval Challenge at …, 2018 - Springer
Neural word embeddings have been widely used in modern NLP applications as they
provide vector representation of words and capture the semantic properties of words and the …

Distribuční sémantika s využitím neuronových sítí

L Svoboda - 2020 - dspace.zcu.cz
V posledních letech vykazují metody založené na neuronových sítích zásadní zlepšení v
zachycení sémantiky a syntaxe slov nebo vět. Mnoho bylo vyzkoumáno o vnoření anglických …

[HTML][HTML] Word Meaning Representation Improvement Using Wikipedia Categories

L Svoboda, T Brychcín, V Matoušek - Slavonic Natural Language …, 2019 - books.google.com
Abstract Extension of Skip-Gram and Continuous Bag-of-Words models via global context
information is presented in this paper. Wikipedia corpus where articles are organized in a …