Harvesting Big Data in social science: A methodological approach for collecting online user-generated content

M Olmedilla, MR Martínez-Torres, SL Toral - Computer Standards & …, 2016 - Elsevier
Online user-generated content is playing a progressively important role as information
source for social scientists seeking for digging out value. Advances procedures and …

Applying ontology learning and multi-objective ant colony optimization method for focused crawling to meteorological disasters domain knowledge

J Liu, Y Dong, Z Liu, D Chen - Expert Systems with Applications, 2022 - Elsevier
The focused crawler based on semantic analysis is a research hotspot in the field of
information retrieval. The domain ontology is generally applied to construct the topic model …

A novel focused crawler combining Web space evolution and domain ontology

J Liu, X Li, Q Zhang, G Zhong - Knowledge-based systems, 2022 - Elsevier
In many fields, how to catch the related-topic Web resources is crucial. As a vertical search
method, focused crawler has received great attention in recent years. Currently, most …

A web page distillation strategy for efficient focused crawling based on optimized Naïve bayes (ONB) classifier

AI Saleh, AE Abulwafa, MF Al Rahmawy - Applied Soft Computing, 2017 - Elsevier
The target of a focused crawler (FC) is to retrieve pages related to a specific domain of
interest (DOI). However, FCs may be hasted if bad links were injected into their crawling …

[PDF][PDF] A survey about algorithms utilized by focused web crawler

YB Yu, SL Huang, N Tashi, H Zhang… - Journal of Electronic …, 2018 - journal.uestc.edu.cn
Focused crawlers (also known as subject-oriented crawler), as the core part of vertical
search engine, collect topic-specific web pages as many as they can to form a subject …

A semantic and intelligent focused crawler based on semantic vector space model and membrane computing optimization algorithm

W Liu, Z Gan, T ** an effective focused crawler to retrieve data of Indian-origin scientists and utilizing text classification for comparative analysis.
S Gautam, R Bhatia, S Jain - International Journal of …, 2024 - search.ebscohost.com
This article presents the implementation of focused web crawling to retrieve data about
scientists of Indian ancestry who are working in foreign nations. This study demonstrates the …