Web page classification: Features and algorithms
Classification of Web page content is essential to many tasks in Web information retrieval
such as maintaining Web directories and focused crawling. The uncontrolled nature of Web …
such as maintaining Web directories and focused crawling. The uncontrolled nature of Web …
Collective data-sanitization for preventing sensitive information inference attacks in social networks
Releasing social network data could seriously breach user privacy. User profile and
friendship relations are inherently private. Unfortunately, sensitive information may be …
friendship relations are inherently private. Unfortunately, sensitive information may be …
Graph based anomaly detection and description: a survey
Detecting anomalies in data is a vital task, with numerous high-impact applications in areas
such as security, finance, health care, and law enforcement. While numerous techniques …
such as security, finance, health care, and law enforcement. While numerous techniques …
A survey of text classification algorithms
The problem of classification has been widely studied in the data mining, machine learning,
database, and information retrieval communities with applications in a number of diverse …
database, and information retrieval communities with applications in a number of diverse …
Collective classification in network data
Many real-world applications produce networked data such as the world-wide web
(hypertext documents connected via hyperlinks), social networks (for example, people …
(hypertext documents connected via hyperlinks), social networks (for example, people …
Relational learning via latent social dimensions
Social media such as blogs, Facebook, Flickr, etc., presents data in a network format rather
than classical IID distribution. To address the interdependency among data instances …
than classical IID distribution. To address the interdependency among data instances …
Link-based classification
A key challenge for machine learning is the problem of mining richly structured data sets,
where the objects are linked in some way due to either an explicit or implicit relationship that …
where the objects are linked in some way due to either an explicit or implicit relationship that …
Leveraging social media networks for classification
Social media has reshaped the way in which people interact with each other. The rapid
development of participatory web and social networking sites like YouTube, Twitter, and …
development of participatory web and social networking sites like YouTube, Twitter, and …
[PDF][PDF] Classification in networked data: A toolkit and a univariate case study.
This paper1 is about classifying entities that are interlinked with entities for which the class is
known. After surveying prior work, we present NetKit, a modular toolkit for classification in …
known. After surveying prior work, we present NetKit, a modular toolkit for classification in …
[PDF][PDF] Relational dependency networks.
Recent work on graphical models for relational data has demonstrated significant
improvements in classification and inference when models represent the dependencies …
improvements in classification and inference when models represent the dependencies …