A brief survey of web data extraction tools

AHF Laender, BA Ribeiro-Neto, AS Da Silva… - ACM Sigmod …, 2002‏ - dl.acm.org
In the last few years, several works in the literature have addressed the problem of data
extraction from Web pages. The importance of this problem derives from the fact that, once …

Towards structured sharing of raw and derived neuroimaging data across existing resources

DB Keator, K Helmer, J Steffener, JA Turner… - Neuroimage, 2013‏ - Elsevier
Data sharing efforts increasingly contribute to the acceleration of scientific discovery.
Neuroimaging data is accumulating in distributed domain-specific databases and there is …

[ספר][B] XML in a nutshell: a desktop quick reference

ER Harold, WS Means - 2004‏ - books.google.com
If you're a developer working with XML, you know there's a lot to know about XML, and the
XML space is evolving almost moment by moment. But you don't need to commit every XML …

OBSERVER: An approach for query processing in global information systems based on interoperation across pre-existing ontologies

E Mena, A Illarramendi, V Kashyap… - Distributed and parallel …, 2000‏ - Springer
There has been an explosion in the types, availability and volume of data accessible in an
information system, thanks to the World Wide Web (the Web) and related inter-networking …

XWRAP: An XML-enabled wrapper construction system for web information sources

L Liu, C Pu, W Han - … of 16th International Conference on Data …, 2000‏ - ieeexplore.ieee.org
The paper describes the methodology and the software development of XWRAP, an XML-
enabled wrapper construction system for semi-automatic generation of wrapper programs …

Managing semantic heterogeneity in databases: a theoretical prospective

R Hull - Proceedings of the sixteenth ACM SIGACT-SIGMOD …, 1997‏ - dl.acm.org
Modern database management systems essentially solve the problem of accessing and
managing large volumes of related data on a single platform, or on a cluster of tightly …

Extracting Semistructured Information from the Web.

J Hammer, H Garcia-Molina, J Cho, R Aranha… - 1997‏ - ilpubs.stanford.edu
We describe a configurable tool for extracting semistructured data from a set of HTML pages
andfor converting the extracted information into database objects. The input to the extractor …

Query processing issues in image (multimedia) databases

S Nepal, MV Ramakrishna - Proceedings 15th International …, 1999‏ - ieeexplore.ieee.org
Multimedia database systems are essential for the effective and efficient use of large
collections of image data. The aim of such systems is to enable retrieval of images based on …

Wrapper generation for semi-structured internet sources

N Ashish, CA Knoblock - ACM Sigmod Record, 1997‏ - dl.acm.org
With the current explosion of information on the World Wide Web (WWW) a wealth of
information on many different subjects has become available on-line. Numerous sources …

DEByE–data extraction by example

AHF Laender, B Ribeiro-Neto, AS Da Silva - Data & Knowledge …, 2002‏ - Elsevier
In this paper we present DEByE (Data Extraction By Example), an approach to extracting
data from Web sources, based on a small set of examples specified by the user. The novelty …