DIADEM: thousands of websites to a single database

T Furche, G Gottlob, G Grasso, X Guo, G Orsi… - Proceedings of the …, 2014 - dl.acm.org
The web is overflowing with implicitly structured data, spread over hundreds of thousands of
sites, hidden deep behind search forms, or siloed in marketplaces, only accessible as …

Robust and noise resistant wrapper induction

T Furche, J Guo, S Maneth, C Schallhart - Proceedings of the 2016 …, 2016 - dl.acm.org
Wrapper induction is the problem of automatically inferring a query from annotated web
pages of the same template. This query should not only select the annotated content …

IBEX: harvesting entities from the web using unique identifiers

A Talaika, J Biega, A Amarilli… - Proceedings of the 18th …, 2015 - dl.acm.org
In this paper we study the prevalence of unique entity identifiers on the Web. These are, eg,
ISBNs (for books), GTINs (for commercial products), DOIs (for documents), email addresses …

Deriving intensional descriptions for web services

M Koutraki, D Vodislav, N Preda - … of the 24th ACM International on …, 2015 - dl.acm.org
Many data providers make their data available through Web service APIs. In order to
unleash the potential of these sources for intelligent applications, the data has to be …

WEIDJ: Development of a new algorithm for semi-structured web data extraction

IAA Sabri, M Man - … Telecommunication Computing Electronics …, 2021 - telkomnika.uad.ac.id
In the era of industrial digitalization, people are increasingly investing in solutions that allow
their process for data collection, data analysis and performance improvement. In this paper …

[PDF][PDF] A deep web data extraction model for web mining: a review

MMIAA Sabri, M Man - Indonesian Journal of Electrical Engineering …, 2021 - academia.edu
The world wide web has become a large pool of information. Extracting structured data from
a published webpages has drawn attention in the last decade. The process of web data …

Set of t-uples expansion by example

NAS Er, T Abdessalem, S Bressan - Proceedings of the 18th International …, 2016 - dl.acm.org
Set expansion is the task of finding elements of a set given example members. We are
interested in the design of algorithms and techniques for a set expansion tool that expands a …

Amber: Automatic supervision for multi-attribute extraction

T Furche, G Gottlob, G Grasso, G Orsi… - arxiv preprint arxiv …, 2012 - arxiv.org
The extraction of multi-attribute objects from the deep web is the bridge between the
unstructured web and structured data. Existing approaches either induce wrappers from a …

Filipo: A sample driven approach for finding linkage points between RDF data and APIs

T Zeimetz, R Schenkel - Advances in Databases and Information Systems …, 2021 - Springer
Data integration is an important task in order to create comprehensive RDF knowledge
bases. Many data sources are used to extend a given dataset or to correct errors. Since …

Query rewriting using views: a theoretical and practical perspective

I Ileana - 2014 - pastel.hal.science
In this work, we address the problem of query rewriting using views, by adopting both a
theoretical and a pragmatic perspective. In the first and main chapter, we approach the topic …