Harvesting Big Data in social science: A methodological approach for collecting online user-generated content
Online user-generated content is playing a progressively important role as information
source for social scientists seeking for digging out value. Advances procedures and …
source for social scientists seeking for digging out value. Advances procedures and …
Applying ontology learning and multi-objective ant colony optimization method for focused crawling to meteorological disasters domain knowledge
J Liu, Y Dong, Z Liu, D Chen - Expert Systems with Applications, 2022 - Elsevier
The focused crawler based on semantic analysis is a research hotspot in the field of
information retrieval. The domain ontology is generally applied to construct the topic model …
information retrieval. The domain ontology is generally applied to construct the topic model …
A novel focused crawler combining Web space evolution and domain ontology
J Liu, X Li, Q Zhang, G Zhong - Knowledge-based systems, 2022 - Elsevier
In many fields, how to catch the related-topic Web resources is crucial. As a vertical search
method, focused crawler has received great attention in recent years. Currently, most …
method, focused crawler has received great attention in recent years. Currently, most …
A web page distillation strategy for efficient focused crawling based on optimized Naïve bayes (ONB) classifier
The target of a focused crawler (FC) is to retrieve pages related to a specific domain of
interest (DOI). However, FCs may be hasted if bad links were injected into their crawling …
interest (DOI). However, FCs may be hasted if bad links were injected into their crawling …
[PDF][PDF] A survey about algorithms utilized by focused web crawler
Focused crawlers (also known as subject-oriented crawler), as the core part of vertical
search engine, collect topic-specific web pages as many as they can to form a subject …
search engine, collect topic-specific web pages as many as they can to form a subject …
A semantic and intelligent focused crawler based on semantic vector space model and membrane computing optimization algorithm
W Liu, Z Gan, T ** an effective focused crawler to retrieve data of Indian-origin scientists and utilizing text classification for comparative analysis.
This article presents the implementation of focused web crawling to retrieve data about
scientists of Indian ancestry who are working in foreign nations. This study demonstrates the …
scientists of Indian ancestry who are working in foreign nations. This study demonstrates the …