[BOK][B] Wrapper induction for information extraction

N Kushmerick - 1997 - search.proquest.com
The Internet presents numerous sources of useful information--telephone directories,
product catalogs, stock quotes, weather forecasts, etc. Recently, many systems have been …

[PDF][PDF] Data integration: The teenage years

A Halevy, A Rajaraman, J Ordille - … conference on Very large data bases, 2006 - cin.ufpe.br
Data integration is a pervasive challenge faced in applications that need to query across
multiple autonomous and heterogeneous data sources. Data integration is crucial in large …

Querying heterogeneous information sources using source descriptions

A Levy, A Rajaraman, J Ordille - 1996 - ilpubs.stanford.edu
We witness a rapid increase in the number of structured information sources that are
available online, especially on the WWW. These sources include commercial databases on …

Wrapper induction: Efficiency and expressiveness

N Kushmerick - Artificial intelligence, 2000 - Elsevier
The Internet presents numerous sources of useful information—telephone directories,
product catalogs, stock quotes, event listings, etc. Recently, many systems have been built …

Optimizing queries across diverse data sources

L Haas, D Kossmann, E Wimmers, J Yang - 1997 - ilpubs.stanford.edu
Businesses today need to interrelate data stored in diverse systems with differing
capabilities, ideally via a single high-level query interface. W e present the design of a query …

Scaling access to heterogeneous data sources with DISCO

A Tomasic, L Raschid… - IEEE Transactions on …, 1998 - ieeexplore.ieee.org
Accessing many data sources aggravates problems for users of heterogeneous distributed
databases. Database administrators must deal with fragile mediators, that is, mediators with …

A framework for supporting data integration using the materialized and virtual approaches

R Hull, G Zhou - Proceedings of the 1996 ACM SIGMOD international …, 1996 - dl.acm.org
This paper presents a framework for data integration currently under development in the
Squirrel project. The framework is based on a special class of mediators, called Squirrel …

Query answering algorithms for information agents

A Levy, A Rajaraman, J Ordille - 1996 - ilpubs.stanford.edu
The database theory community, centered around the PODS (Principles of Database
Systems) conference has had a long-term interest in logic as a way to represent" data,"" …

Scaling heterogeneous databases and the design of disco

A Tomasic, L Raschid… - Proceedings of 16th …, 1996 - ieeexplore.ieee.org
Access to large numbers of data sources introduces new problems for users of
heterogeneous distributed databases. End users and application programmers must deal …

[PDF][PDF] Planning to gather information

CT Kwok, DS Weld - PROCEEDINGS OF THE NATIONAL …, 1996 - Citeseer
The exponential growth of the Internet has produced a labyrinth of documents, databases
and services. While almost any type of information is available somewhere, even expert …