Understandable big data: a survey

CK Emani, N Cullot, C Nicolle - Computer science review, 2015‏ - Elsevier
This survey presents the concept of Big Data. Firstly, a definition and the features of Big Data
are given. Secondly, the different steps for Big Data data processing and the main problems …

Location reference recognition from texts: A survey and comparison

X Hu, Z Zhou, H Li, Y Hu, F Gu, J Kersten, H Fan… - ACM Computing …, 2023‏ - dl.acm.org
A vast amount of location information exists in unstructured texts, such as social media
posts, news stories, scientific articles, web pages, travel blogs, and historical archives …

A replicable comparison study of NER software: StanfordNLP, NLTK, OpenNLP, SpaCy, Gate

X Schmitt, S Kubler, J Robert… - … conference on social …, 2019‏ - ieeexplore.ieee.org
Named Entity Recognition (NER) is a key building block of any Natural Language
Processing (NLP) system, making possible the detection and classification of entities (eg …

Exploiting context for rumour detection in social media

A Zubiaga, M Liakata, R Procter - … SocInfo 2017, Oxford, UK, September 13 …, 2017‏ - Springer
Tools that are able to detect unverified information posted on social media during a news
event can help to avoid the spread of rumours that turn out to be false. In this paper we …

Who cares about sarcastic tweets? investigating the impact of sarcasm on sentiment analysis

DG Maynard, MA Greenwood - Lrec 2014 proceedings, 2014‏ - eprints.whiterose.ac.uk
Sarcasm is a common phenomenon in social media, and is inherently difficult to analyse, not
just automatically but often for humans too. It has an important effect on sentiment, but is …

Twitter as a lifeline: Human-annotated twitter corpora for NLP of crisis-related messages

M Imran, P Mitra, C Castillo - arxiv preprint arxiv:1605.05894, 2016‏ - arxiv.org
Microblogging platforms such as Twitter provide active communication channels during
mass convergence and emergency events such as earthquakes, typhoons. During the …

Learning reporting dynamics during breaking news for rumour detection in social media

A Zubiaga, M Liakata, R Procter - arxiv preprint arxiv:1610.07363, 2016‏ - arxiv.org
Breaking news leads to situations of fast-paced reporting in social media, producing all
kinds of updates related to news stories, albeit with the caveat that some of those early …

[PDF][PDF] Twitter part-of-speech tagging for all: Overcoming sparse and noisy data

L Derczynski, A Ritter, S Clark… - Proceedings of the …, 2013‏ - aclanthology.org
Part-of-speech information is a pre-requisite in many NLP algorithms. However, Twitter text
is difficult to part-of-speech tag: it is noisy, with linguistic errors and idiosyncratic style. We …

Location extraction from social media: Geoparsing, location disambiguation, and geotagging

SE Middleton, G Kordopatis-Zilos… - ACM Transactions on …, 2018‏ - dl.acm.org
Location extraction, also called “toponym extraction,” is a field covering geoparsing,
extracting spatial representations from location mentions in text, and geotagging, assigning …

Discourse-aware rumour stance classification in social media using sequential classifiers

A Zubiaga, E Kochkina, M Liakata, R Procter… - Information Processing …, 2018‏ - Elsevier
Rumour stance classification, defined as classifying the stance of specific social media posts
into one of supporting, denying, querying or commenting on an earlier post, is becoming of …