Web page classification: Features and algorithms

X Qi, BD Davison - ACM computing surveys (CSUR), 2009 - dl.acm.org
Classification of Web page content is essential to many tasks in Web information retrieval
such as maintaining Web directories and focused crawling. The uncontrolled nature of Web …

Collective data-sanitization for preventing sensitive information inference attacks in social networks

Z Cai, Z He, X Guan, Y Li - IEEE Transactions on Dependable …, 2016 - ieeexplore.ieee.org
Releasing social network data could seriously breach user privacy. User profile and
friendship relations are inherently private. Unfortunately, sensitive information may be …

Graph based anomaly detection and description: a survey

L Akoglu, H Tong, D Koutra - Data mining and knowledge discovery, 2015 - Springer
Detecting anomalies in data is a vital task, with numerous high-impact applications in areas
such as security, finance, health care, and law enforcement. While numerous techniques …

A survey of text classification algorithms

CC Aggarwal, CX Zhai - Mining text data, 2012 - Springer
The problem of classification has been widely studied in the data mining, machine learning,
database, and information retrieval communities with applications in a number of diverse …

Collective classification in network data

P Sen, G Namata, M Bilgic, L Getoor, B Galligher… - AI magazine, 2008 - ojs.aaai.org
Many real-world applications produce networked data such as the world-wide web
(hypertext documents connected via hyperlinks), social networks (for example, people …

Relational learning via latent social dimensions

L Tang, H Liu - Proceedings of the 15th ACM SIGKDD international …, 2009 - dl.acm.org
Social media such as blogs, Facebook, Flickr, etc., presents data in a network format rather
than classical IID distribution. To address the interdependency among data instances …

Link-based classification

S Bandyopadhyay, U Maulik, LB Holder… - Advanced methods for …, 2005 - Springer
A key challenge for machine learning is the problem of mining richly structured data sets,
where the objects are linked in some way due to either an explicit or implicit relationship that …

Leveraging social media networks for classification

L Tang, H Liu - Data mining and knowledge discovery, 2011 - Springer
Social media has reshaped the way in which people interact with each other. The rapid
development of participatory web and social networking sites like YouTube, Twitter, and …

[PDF][PDF] Classification in networked data: A toolkit and a univariate case study.

SA Macskassy, F Provost - Journal of machine learning research, 2007 - jmlr.org
This paper1 is about classifying entities that are interlinked with entities for which the class is
known. After surveying prior work, we present NetKit, a modular toolkit for classification in …

[PDF][PDF] Relational dependency networks.

J Neville, D Jensen - Journal of Machine Learning Research, 2007 - jmlr.org
Recent work on graphical models for relational data has demonstrated significant
improvements in classification and inference when models represent the dependencies …