Web page classification: Features and algorithms

X Qi, BD Davison - ACM computing surveys (CSUR), 2009 - dl.acm.org
Classification of Web page content is essential to many tasks in Web information retrieval
such as maintaining Web directories and focused crawling. The uncontrolled nature of Web …

Towards website domain name classification using graph based semi-supervised learning

A Faroughi, A Morichetta, L Vassio, F Figueiredo… - Computer Networks, 2021 - Elsevier
In this work, we tackle the problem of classifying websites domain names to a category, eg,
map** bbc. com to the” News and Media” class. Domain name classification is …

Temporal pagerank on social networks

W Hu, H Zou, Z Gong - Web Information Systems Engineering–WISE 2015 …, 2015 - Springer
Social network has been a widely accepted way for people to communicate and interact
online. However, few of existing works studied temporal dimension in assessing the …

Query recommendation in the information domain of children

SD Torres, D Hiemstra, I Weber… - Journal of the …, 2014 - Wiley Online Library
Children represent an increasing group of web users. Some of the key problems that
hamper their search experience is their limited vocabulary, their difficulty in using the right …

[ΒΙΒΛΙΟ][B] Web page classification and hierarchy adaptation

XG Qi - 2012 - search.proquest.com
Classification is a supervised learning problem in which a classifier is trained on a set of
data labeled with predefined categories and then applied to label future examples. It plays a …

Semantic Enrichment of Knowledge Sources Supported by Domain Ontologies

RDD da Costa - 2014 - search.proquest.com
This thesis introduces a novel conceptual framework to support the creation of knowledge
representations based on enriched Semantic Vectors, using the classical vector space …

Information retrieval for children: search behavior and solutions

SD Torres - 2014 - research.utwente.nl
Nowadays, children of very young ages and teenagers use the Internet extensively for
entertainment and educational purposes. The number of active young users in the Internet is …

Technique basée HITS/SVM pour la réduction et la pondération des caractéristiques des pages Web

MN MEADI - 2017 - thesis.univ-biskra.dz
Le nombre de pages Web publiées sur le World Wide Web est estimé des centaines de
millions. La fouille de ces pages demande un effort intellectuel incroyable qui dépasse les …

Research on classification algorithm and its application in cased–based reasoning

K Gao, H Zhang, S Li, W Wang… - International journal of …, 2013 - inderscienceonline.com
Cased–based reasoning has been used in decision application with less domain
knowledge. This paper presents the novel classification algorithm, which is regarded as the …

[PDF][PDF] Exploiting links and text structure on the Web: a quantitative approach to improving search quality

C Kohlschütter - 2011 - repo.uni-hannover.de
As Web search is becoming a routine activity in our daily lives, users scale up their
expectations concerning Search Quality. This comprises factors such as accuracy, coverage …