A survey of text representation and embedding techniques in nlp
Natural Language Processing (NLP) is a research field where a language in consideration
is processed to understand its syntactic, semantic, and sentimental aspects. The …
is processed to understand its syntactic, semantic, and sentimental aspects. The …
Skills prediction based on multi-label resume classification using CNN with model predictions explanation
Skills extraction is a critical task when creating job recommender systems. It is also useful for
building skills profiles and skills knowledge bases for organizations. The aim of skills …
building skills profiles and skills knowledge bases for organizations. The aim of skills …
A comparative analysis on question classification task based on deep learning approaches
Question classification is one of the essential tasks for automatic question answering
implementation in natural language processing (NLP). Recently, there have been several …
implementation in natural language processing (NLP). Recently, there have been several …
A word embedding-based method for unsupervised adaptation of cooking recipes
Studying food recipes is indispensable to understand the science of cooking. An essential
problem in food computing is the adaptation of recipes to user needs and preferences. The …
problem in food computing is the adaptation of recipes to user needs and preferences. The …
Urban dictionary embeddings for slang NLP applications
The choice of the corpus on which word embeddings are trained can have a sizable effect
on the learned representations, the types of analyses that can be performed with them, and …
on the learned representations, the types of analyses that can be performed with them, and …
Petro NLP: Resources for natural language processing and information extraction for the oil and gas industry
Most companies struggle to find and extract relevant information from their technical
documents. In particular, the Oil and Gas (O&G) industry faces the challenge of dealing with …
documents. In particular, the Oil and Gas (O&G) industry faces the challenge of dealing with …
A compendium and evaluation of taxonomy quality attributes
Introduction Taxonomies capture knowledge about a particular domain in a succinct manner
and establish a common understanding among peers. Researchers use taxonomies to …
and establish a common understanding among peers. Researchers use taxonomies to …
Forecasting SQL query cost at Twitter
With the advent of the Big Data era, it is usually computationally expensive to calculate the
resource usages of a SQL query with traditional DBMS approaches. Can we estimate the …
resource usages of a SQL query with traditional DBMS approaches. Can we estimate the …
Towards understanding the impacts of textual dissimilarity on duplicate bug report detection
About 40% of software bug reports are duplicates of one another, which pose a major
overhead during software maintenance. Traditional techniques often focus on detecting …
overhead during software maintenance. Traditional techniques often focus on detecting …
Embeddings for named entity recognition in geoscience Portuguese literature
This work focuses on Portuguese Named Entity Recognition (NER) in the Geology domain.
The only domain-specific dataset in the Portuguese language annotated for NER is the …
The only domain-specific dataset in the Portuguese language annotated for NER is the …