On llms-driven synthetic data generation, curation, and evaluation: A survey

L Long, R Wang, R **ao, J Zhao, X Ding… - arxiv preprint arxiv …, 2024 - arxiv.org
Within the evolving landscape of deep learning, the dilemma of data quantity and quality has
been a long-standing problem. The recent advent of Large Language Models (LLMs) offers …

A survey on data synthesis and augmentation for large language models

K Wang, J Zhu, M Ren, Z Liu, S Li, Z Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org
The success of Large Language Models (LLMs) is inherently linked to the availability of vast,
diverse, and high-quality data for training and evaluation. However, the growth rate of high …

Predicting text preference via structured comparative reasoning

JN Yan, T Liu, J Chiu, J Shen, Z Qin, Y Yu… - Proceedings of the …, 2024 - aclanthology.org
Comparative reasoning plays a crucial role in predicting text preferences; however, large
language models (LLMs) often demonstrate inconsistencies in their reasoning, leading to …

Medadapter: Efficient test-time adaptation of large language models towards medical reasoning

W Shi, R Xu, Y Zhuang, Y Yu, H Sun, H Wu… - arxiv preprint arxiv …, 2024 - arxiv.org
Despite their improved capabilities in generation and reasoning, adapting large language
models (LLMs) to the biomedical domain remains challenging due to their immense size …

Automated clinical data extraction with knowledge conditioned llms

D Li, A Kadav, A Gao, R Li, R Bourgon - arxiv preprint arxiv:2406.18027, 2024 - arxiv.org
The extraction of lung lesion information from clinical and medical imaging reports is crucial
for research on and clinical care of lung-related diseases. Large language models (LLMs) …

Large knowledge model: Perspectives and challenges

H Chen - arxiv preprint arxiv:2312.02706, 2023 - arxiv.org
Humankind's understanding of the world is fundamentally linked to our perception and
cognition, with\emph {human languages} serving as one of the major carriers of\emph {world …

Hydra: Model factorization framework for black-box llm personalization

Y Zhuang, H Sun, Y Yu, R Qiang, Q Wang… - arxiv preprint arxiv …, 2024 - arxiv.org
Personalization has emerged as a critical research area in modern intelligent systems,
focusing on mining users' behavioral history and adapting to their preferences for delivering …

Polyie: A dataset of information extraction from polymer material scientific literature

JJ Cheung, Y Zhuang, Y Li, P Shetty, W Zhao… - arxiv preprint arxiv …, 2023 - arxiv.org
Scientific information extraction (SciIE), which aims to automatically extract information from
scientific literature, is becoming more important than ever. However, there are no existing …

Vlm4bio: A benchmark dataset to evaluate pretrained vision-language models for trait discovery from biological images

M Maruf, A Daw, KS Mehrab, HB Manogaran… - arxiv preprint arxiv …, 2024 - arxiv.org
Images are increasingly becoming the currency for documenting biodiversity on the planet,
providing novel opportunities for accelerating scientific discoveries in the field of organismal …

On what basis? predicting text preference via structured comparative reasoning

JN Yan, T Liu, JT Chiu, J Shen, Z Qin, Y Yu… - arxiv preprint arxiv …, 2023 - arxiv.org
Comparative reasoning plays a crucial role in text preference prediction; however, large
language models (LLMs) often demonstrate inconsistencies in their reasoning. While …