On llms-driven synthetic data generation, curation, and evaluation: A survey
Within the evolving landscape of deep learning, the dilemma of data quantity and quality has
been a long-standing problem. The recent advent of Large Language Models (LLMs) offers …
been a long-standing problem. The recent advent of Large Language Models (LLMs) offers …
A survey on data synthesis and augmentation for large language models
K Wang, J Zhu, M Ren, Z Liu, S Li, Z Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org
The success of Large Language Models (LLMs) is inherently linked to the availability of vast,
diverse, and high-quality data for training and evaluation. However, the growth rate of high …
diverse, and high-quality data for training and evaluation. However, the growth rate of high …
Predicting text preference via structured comparative reasoning
Comparative reasoning plays a crucial role in predicting text preferences; however, large
language models (LLMs) often demonstrate inconsistencies in their reasoning, leading to …
language models (LLMs) often demonstrate inconsistencies in their reasoning, leading to …
Medadapter: Efficient test-time adaptation of large language models towards medical reasoning
Despite their improved capabilities in generation and reasoning, adapting large language
models (LLMs) to the biomedical domain remains challenging due to their immense size …
models (LLMs) to the biomedical domain remains challenging due to their immense size …
Automated clinical data extraction with knowledge conditioned llms
The extraction of lung lesion information from clinical and medical imaging reports is crucial
for research on and clinical care of lung-related diseases. Large language models (LLMs) …
for research on and clinical care of lung-related diseases. Large language models (LLMs) …
Large knowledge model: Perspectives and challenges
H Chen - arxiv preprint arxiv:2312.02706, 2023 - arxiv.org
Humankind's understanding of the world is fundamentally linked to our perception and
cognition, with\emph {human languages} serving as one of the major carriers of\emph {world …
cognition, with\emph {human languages} serving as one of the major carriers of\emph {world …
Hydra: Model factorization framework for black-box llm personalization
Personalization has emerged as a critical research area in modern intelligent systems,
focusing on mining users' behavioral history and adapting to their preferences for delivering …
focusing on mining users' behavioral history and adapting to their preferences for delivering …
Polyie: A dataset of information extraction from polymer material scientific literature
Scientific information extraction (SciIE), which aims to automatically extract information from
scientific literature, is becoming more important than ever. However, there are no existing …
scientific literature, is becoming more important than ever. However, there are no existing …
Vlm4bio: A benchmark dataset to evaluate pretrained vision-language models for trait discovery from biological images
Images are increasingly becoming the currency for documenting biodiversity on the planet,
providing novel opportunities for accelerating scientific discoveries in the field of organismal …
providing novel opportunities for accelerating scientific discoveries in the field of organismal …
On what basis? predicting text preference via structured comparative reasoning
Comparative reasoning plays a crucial role in text preference prediction; however, large
language models (LLMs) often demonstrate inconsistencies in their reasoning. While …
language models (LLMs) often demonstrate inconsistencies in their reasoning. While …