Statistical inference links data and theory in network science

L Peel, TP Peixoto, M De Domenico - Nature Communications, 2022 - nature.com
The number of network science applications across many different fields has been rapidly
increasing. Surprisingly, the development of theory and domain-specific applications often …

Ontology population and enrichment: State of the art

G Petasis, V Karkaletsis, G Paliouras, A Krithara… - … extraction and ontology …, 2011 - Springer
Ontology learning is the process of acquiring (constructing or integrating) an ontology (semi-
) automatically. Being a knowledge acquisition task, it is a complex activity, which becomes …

[書籍][B] Principles of data integration

AH Doan, A Halevy, Z Ives - 2012 - books.google.com
Principles of Data Integration is the first comprehensive textbook of data integration,
covering theoretical principles and implementation issues as well as current challenges …

[書籍][B] Probabilistic graphical models: principles and techniques

D Koller, N Friedman - 2009 - books.google.com
A general framework for constructing and using probabilistic models of complex systems that
would enable a computer to use available information for making decisions. Most tasks …

Low-resource deep entity resolution with transfer and active learning

J Kasai, K Qian, S Gurajada, Y Li, L Popa - arxiv preprint arxiv …, 2019 - arxiv.org
Entity resolution (ER) is the task of identifying different representations of the same real-
world entities across databases. It is a key step for knowledge base creation and text mining …

Duplicate record detection: A survey

AK Elmagarmid, PG Ipeirotis… - IEEE Transactions on …, 2006 - ieeexplore.ieee.org
Often, in the real world, entities have two or more representations in databases. Duplicate
records do not share a common key and/or they contain errors that make duplicate matching …

[PDF][PDF] A Comparison of String Distance Metrics for Name-Matching Tasks.

WW Cohen, P Ravikumar, SE Fienberg - IIWeb, 2003 - pubs.dbs.uni-leipzig.de
Using an open-source, Java toolkit of name-matching methods, we experimentally compare
string distance metrics on the task of matching entity names. We investigate a number of …

[書籍][B] Introduction to statistical relational learning

L Getoor, B Taskar - 2007 - books.google.com
Advanced statistical modeling and knowledge representation techniques for a newly
emerging area of machine learning and probabilistic reasoning; includes introductory …

Information extraction

S Sarawagi - Foundations and Trends® in Databases, 2008 - nowpublishers.com
The automatic extraction of information from unstructured sources has opened up new
avenues for querying, organizing, and analyzing data by drawing upon the clean semantics …

Link mining: a survey

L Getoor, CP Diehl - Acm Sigkdd Explorations Newsletter, 2005 - dl.acm.org
Many datasets of interest today are best described as a linked collection of interrelated
objects. These may represent homogeneous networks, in which there is a single-object type …