Web page classification: Features and algorithms

X Qi, BD Davison - ACM computing surveys (CSUR), 2009 - dl.acm.org
Classification of Web page content is essential to many tasks in Web information retrieval
such as maintaining Web directories and focused crawling. The uncontrolled nature of Web …

Scholarly paper recommendation via user's recent research interests

K Sugiyama, MY Kan - Proceedings of the 10th annual joint conference …, 2010 - dl.acm.org
We examine the effect of modeling a researcher's past works in recommending scholarly
papers to the researcher. Our hypothesis is that an author's published works constitute a …

Graph based anomaly detection and description: a survey

L Akoglu, H Tong, D Koutra - Data mining and knowledge discovery, 2015 - Springer
Detecting anomalies in data is a vital task, with numerous high-impact applications in areas
such as security, finance, health care, and law enforcement. While numerous techniques …

Fakedetector: Effective fake news detection with deep diffusive neural network

J Zhang, B Dong, SY Philip - 2020 IEEE 36th international …, 2020 - ieeexplore.ieee.org
In recent years, due to the booming development of online social networks, fake news for
various commercial and political purposes has been appearing in large numbers and …

What yelp fake review filter might be doing?

A Mukherjee, V Venkataraman, B Liu… - Proceedings of the …, 2013 - ojs.aaai.org
Online reviews have become a valuable resource for decision making. However, its
usefulness brings forth a curse‒deceptive opinion spam. In recent years, fake review …

[PDF][PDF] Detecting spammers on twitter

F Benevenuto, G Magno, T Rodrigues… - … messaging, anti-abuse …, 2010 - dcc.ufmg.br
With millions of users tweeting around the world, real time search systems and different
types of mining tools are emerging to allow people tracking the repercussion of events and …

Counterfactual inference for text classification debiasing

C Qian, F Feng, L Wen, C Ma, P **e - Proceedings of the 59th …, 2021 - aclanthology.org
Today's text classifiers inevitably suffer from unintended dataset biases, especially the
document-level label bias and word-level keyword bias, which may hurt models' …

Don't follow me: Spam detection in twitter

AH Wang - 2010 international conference on security and …, 2010 - ieeexplore.ieee.org
The rapidly growing social network Twitter has been infiltrated by large amount of spam. In
this paper, a spam detection prototype system is proposed to identify suspicious users on …

Knowledge-based trust: Estimating the trustworthiness of web sources

XL Dong, E Gabrilovich, K Murphy, V Dang… - arxiv preprint arxiv …, 2015 - arxiv.org
The quality of web sources has been traditionally evaluated using exogenous signals such
as the hyperlink structure of the graph. We propose a new approach that relies on …

Understanding and combating link farming in the twitter social network

S Ghosh, B Viswanath, F Kooti, NK Sharma… - Proceedings of the 21st …, 2012 - dl.acm.org
Recently, Twitter has emerged as a popular platform for discovering real-time information on
the Web, such as news stories and people's reaction to them. Like the Web, Twitter has …