Large language models and knowledge graphs: Opportunities and challenges

JZ Pan, S Razniewski, JC Kalo, S Singhania… - ar** machines with comprehensive knowledge of the world's entities and their
relationships has been a longstanding goal of AI. Over the last decade, large-scale …

Simple embedding for link prediction in knowledge graphs

SM Kazemi, D Poole - Advances in neural information …, 2018 - proceedings.neurips.cc
Abstract Knowledge graphs contain knowledge about the world and provide a structured
representation of this knowledge. Current knowledge graphs contain only a small subset of …

Concrete problems in AI safety

D Amodei, C Olah, J Steinhardt, P Christiano… - arxiv preprint arxiv …, 2016 - arxiv.org
Rapid progress in machine learning and artificial intelligence (AI) has brought increasing
attention to the potential impacts of AI technologies on society. In this paper we discuss one …

How large language models will disrupt data management

RC Fernandez, AJ Elmore, MJ Franklin… - Proceedings of the …, 2023 - dl.acm.org
Large language models (LLMs), such as GPT-4, are revolutionizing software's ability to
understand, process, and synthesize language. The authors of this paper believe that this …

[書籍][B] Data cleaning

IF Ilyas, X Chu - 2019 - books.google.com
This is an overview of the end-to-end data cleaning process. Data quality is one of the most
important problems in data management, since dirty data often leads to inaccurate data …

Data programming: Creating large training sets, quickly

AJ Ratner, CM De Sa, S Wu… - Advances in neural …, 2016 - proceedings.neurips.cc
Large labeled training sets are the critical building blocks of supervised learning methods
and are key enablers of deep learning techniques. For some applications, creating labeled …

Holoclean: Holistic data repairs with probabilistic inference

T Rekatsinas, X Chu, IF Ilyas, C Ré - arxiv preprint arxiv:1702.00820, 2017 - arxiv.org
We introduce HoloClean, a framework for holistic data repairing driven by probabilistic
inference. HoloClean unifies existing qualitative data repairing approaches, which rely on …

Language models enable simple systems for generating structured views of heterogeneous data lakes

S Arora, B Yang, S Eyuboglu, A Narayan… - arxiv preprint arxiv …, 2023 - arxiv.org
A long standing goal of the data management community is to develop general, automated
systems that ingest semi-structured documents and output queryable tables without human …

Semantic search on text and knowledge bases

H Bast, B Buchhold, E Haussmann - Foundations and Trends® …, 2016 - nowpublishers.com
This article provides a comprehensive overview of the broad area of semantic search on text
and knowledge bases. In a nutshell, semantic search is “search with meaning”. This …