[HTML][HTML] Strategies and principles of distributed machine learning on big data

EP **ng, Q Ho, P **e, D Wei - Engineering, 2016 - Elsevier
The rise of big data has led to new demands for machine learning (ML) systems to learn
complex models, with millions to billions of parameters, that promise adequate capacity to …

Learning causality for news events prediction

K Radinsky, S Davidovich, S Markovitch - Proceedings of the 21st …, 2012 - dl.acm.org
The problem we tackle in this work is, given a present news event, to generate a plausible
future event that can be caused by the given event. We present a new methodology for …

Dirichlet-hawkes processes with applications to clustering continuous-time document streams

N Du, M Farajtabar, A Ahmed, AJ Smola… - Proceedings of the 21th …, 2015 - dl.acm.org
Clusters in document streams, such as online news articles, can be induced by their textual
contents, as well as by the temporal dynamics of their arriving patterns. Can we leverage …

Mining the web to predict future events

K Radinsky, E Horvitz - Proceedings of the sixth ACM international …, 2013 - dl.acm.org
We describe and evaluate methods for learning to forecast forthcoming events of interest
from a corpus containing 22 years of news stories. We consider the examples of identifying …

Topic evolution based on the probabilistic topic model: a review

H Zhou, H Yu, R Hu - Frontiers of Computer Science, 2017 - Springer
Accurately representing the quantity and characteristics of users' interest in certain topics is
an important problem facing topic evolution researchers, particularly as it applies to modern …

Trains of thought: Generating information maps

D Shahaf, C Guestrin, E Horvitz - … of the 21st international conference on …, 2012 - dl.acm.org
When information is abundant, it becomes increasingly difficult to fit nuggets of knowledge
into a single coherent picture. Complex stories spaghetti into branches, side stories, and …

Hierarchical geographical modeling of user locations from social media posts

A Ahmed, L Hong, AJ Smola - … of the 22nd international conference on …, 2013 - dl.acm.org
With the availability of cheap location sensors, geotagging of messages in online social
networks is proliferating. For instance, Twitter, Facebook, Foursquare, and Google+ provide …

Using paraphrases for improving first story detection in news and Twitter

S Petrovic, M Osborne, V Lavrenko - … : Conference of the North …, 2012 - research.ed.ac.uk
First story detection (FSD) involves identifying first stories about events from a continuous
stream of documents. A major problem in this task is the high degree of lexical variation in …

Document-based topic coherence measures for news media text

D Korenčić, S Ristov, J Šnajder - Expert systems with Applications, 2018 - Elsevier
There is a rising need for automated analysis of news text, and topic models have proven to
be useful tools for this task. However, as the quality of the topics induced by topic models …

Cluster-discovery of Twitter messages for event detection and trending

SB Kaleel, A Abhari - Journal of computational science, 2015 - Elsevier
Social media data carries abundant hidden occurrences of real-time events. In this paper, a
novel methodology is proposed for detecting and trending events from tweet clusters that are …