Machine knowledge: Creation and curation of comprehensive knowledge bases
Equip** machines with comprehensive knowledge of the world's entities and their
relationships has been a longstanding goal of AI. Over the last decade, large-scale …
relationships has been a longstanding goal of AI. Over the last decade, large-scale …
Web data extraction, applications and techniques: A survey
Abstract Web Data Extraction is an important problem that has been studied by means of
different scientific tools and in a broad range of applications. Many approaches to extracting …
different scientific tools and in a broad range of applications. Many approaches to extracting …
[LIVRE][B] Web data mining: exploring hyperlinks, contents, and usage data
B Liu - 2011 - Springer
Liu has written a comprehensive text on Web mining, which consists of two parts. The first
part covers the data mining and machine learning foundations, where all the essential …
part covers the data mining and machine learning foundations, where all the essential …
Information extraction
S Sarawagi - Foundations and Trends® in Databases, 2008 - nowpublishers.com
The automatic extraction of information from unstructured sources has opened up new
avenues for querying, organizing, and analyzing data by drawing upon the clean semantics …
avenues for querying, organizing, and analyzing data by drawing upon the clean semantics …
A brief survey of web data extraction tools
In the last few years, several works in the literature have addressed the problem of data
extraction from Web pages. The importance of this problem derives from the fact that, once …
extraction from Web pages. The importance of this problem derives from the fact that, once …
Data-Centric Systems and Applications
Data warehouses are databases of a specific kind that periodically collect information about
the activities being performed by an organization. This information is then accumulated over …
the activities being performed by an organization. This information is then accumulated over …
Semantic annotation for knowledge management: Requirements and a survey of the state of the art
While much of a company's knowledge can be found in text repositories, current content
management systems have limited capabilities for structuring and interpreting documents. In …
management systems have limited capabilities for structuring and interpreting documents. In …
Form-based ontology creation and information harvesting
Extracting data from web pages. User input is received defining a tabular form. User input is
received correlating portions of the form with user selected data items contained in one or …
received correlating portions of the form with user selected data items contained in one or …
Data extraction and label assignment for web databases
J Wang, FH Lochovsky - … of the 12th international conference on World …, 2003 - dl.acm.org
Many tools have been developed to help users query, extract and integrate data from web
pages generated dynamically from databases, ie, from the Hidden Web. A key prerequisite …
pages generated dynamically from databases, ie, from the Hidden Web. A key prerequisite …
Data-Centric Systems and Applications
The rapid growth of the Web in the past two decades has made it the largest publicly
accessible data source in the world. Web mining aims to discover useful information or …
accessible data source in the world. Web mining aims to discover useful information or …