Natural language based financial forecasting: a survey
Natural language processing (NLP), or the pragmatic research perspective of computational
linguistics, has become increasingly powerful due to data availability and various …
linguistics, has become increasingly powerful due to data availability and various …
Documenting large webtext corpora: A case study on the colossal clean crawled corpus
Large language models have led to remarkable progress on many NLP tasks, and
researchers are turning to ever-larger text corpora to train them. Some of the largest corpora …
researchers are turning to ever-larger text corpora to train them. Some of the largest corpora …
Econometrics meets sentiment: An overview of methodology and applications
The advent of massive amounts of textual, audio, and visual data has spurred the
development of econometric methodology to transform qualitative sentiment data into …
development of econometric methodology to transform qualitative sentiment data into …
Beneath the tip of the iceberg: Current challenges and new directions in sentiment analysis research
Sentiment analysis as a field has come a long way since it was first introduced as a task
nearly 20 years ago. It has widespread commercial applications in various domains like …
nearly 20 years ago. It has widespread commercial applications in various domains like …
[HTML][HTML] Tracking COVID-19 discourse on Twitter in North America: infodemiology study using topic modeling and aspect-based sentiment analysis
Background Social media is a rich source where we can learn about people's reactions to
social issues. As COVID-19 has impacted people's lives, it is essential to capture how …
social issues. As COVID-19 has impacted people's lives, it is essential to capture how …
Diachronic word embeddings and semantic shifts: a survey
Recent years have witnessed a surge of publications aimed at tracing temporal changes in
lexical semantics using distributional methods, particularly prediction-based word …
lexical semantics using distributional methods, particularly prediction-based word …
Community interaction and conflict on the web
Users organize themselves into communities on web platforms. These communities can
interact with one another, often leading to conflicts and toxic interactions. However, little is …
interact with one another, often leading to conflicts and toxic interactions. However, little is …
A hierarchical model of reviews for aspect-based sentiment analysis
Opinion mining from customer reviews has become pervasive in recent years. Sentences in
reviews, however, are usually classified independently, even though they form part of a …
reviews, however, are usually classified independently, even though they form part of a …
[HTML][HTML] An emotion and cognitive based analysis of mental health disorders from social media data
Mental disorders can severely affect quality of life, constitute a major predictive factor of
suicide, and are usually underdiagnosed and undertreated. Early detection of signs of …
suicide, and are usually underdiagnosed and undertreated. Early detection of signs of …
Analyzing polarization in social media: Method and application to tweets on 21 mass shootings
We provide an NLP framework to uncover four linguistic dimensions of political polarization
in social media: topic choice, framing, affect and illocutionary force. We quantify these …
in social media: topic choice, framing, affect and illocutionary force. We quantify these …