Machine knowledge: Creation and curation of comprehensive knowledge bases

G Weikum, XL Dong, S Razniewski… - … and Trends® in …, 2021 - nowpublishers.com
Equip** machines with comprehensive knowledge of the world's entities and their
relationships has been a longstanding goal of AI. Over the last decade, large-scale …

Holistic evaluation of language models

P Liang, R Bommasani, T Lee, D Tsipras… - arxiv preprint arxiv …, 2022 - arxiv.org
Language models (LMs) are becoming the foundation for almost all major language
technologies, but their capabilities, limitations, and risks are not well understood. We present …

Tabllm: Few-shot classification of tabular data with large language models

S Hegselmann, A Buendia, H Lang… - International …, 2023 - proceedings.mlr.press
We study the application of large language models to zero-shot and few-shot classification
of tabular data. We prompt the large language model with a serialization of the tabular data …

Data cleaning and machine learning: a systematic literature review

PO Côté, A Nikanjam, N Ahmed, D Humeniuk… - Automated Software …, 2024 - Springer
Abstract Machine Learning (ML) is integrated into a growing number of systems for various
applications. Because the performance of an ML model is highly dependent on the quality of …

Can foundation models wrangle your data?

A Narayan, I Chami, L Orr, S Arora, C Ré - arxiv preprint arxiv:2205.09911, 2022 - arxiv.org
Foundation Models (FMs) are models trained on large corpora of data that, at very large
scale, can generalize to new tasks without any task-specific finetuning. As these models …

How large language models will disrupt data management

RC Fernandez, AJ Elmore, MJ Franklin… - Proceedings of the …, 2023 - dl.acm.org
Large language models (LLMs), such as GPT-4, are revolutionizing software's ability to
understand, process, and synthesize language. The authors of this paper believe that this …

CancerGPT for few shot drug pair synergy prediction using large pretrained language models

T Li, S Shetty, A Kamath, A Jaiswal, X Jiang… - NPJ Digital …, 2024 - nature.com
Large language models (LLMs) have been shown to have significant potential in few-shot
learning across various fields, even with minimal training data. However, their ability to …

Turl: Table understanding through representation learning

X Deng, H Sun, A Lees, Y Wu, C Yu - ACM SIGMOD Record, 2022 - dl.acm.org
Relational tables on the Web store a vast amount of knowledge. Owing to the wealth of such
tables, there has been tremendous progress on a variety of tasks in the area of table …

Table-gpt: Table-tuned gpt for diverse table tasks

P Li, Y He, D Yashar, W Cui, S Ge, H Zhang… - arxiv preprint arxiv …, 2023 - arxiv.org
Language models, such as GPT-3.5 and ChatGPT, demonstrate remarkable abilities to
follow diverse human instructions and perform a wide range of tasks. However, when …

Oag-bench: a human-curated benchmark for academic graph mining

F Zhang, S Shi, Y Zhu, B Chen, Y Cen, J Yu… - Proceedings of the 30th …, 2024 - dl.acm.org
With the rapid proliferation of scientific literature, versatile academic knowledge services
increasingly rely on comprehensive academic graph mining. Despite the availability of …