Scientific large language models: A survey on biological & chemical domains

Q Zhang, K Ding, T Lv, X Wang, Q Yin, Y Zhang… - ACM Computing …, 2024 - dl.acm.org
Large Language Models (LLMs) have emerged as a transformative power in enhancing
natural language comprehension, representing a significant stride toward artificial general …

A review of large language models and autonomous agents in chemistry

MC Ramos, CJ Collison, AD White - Chemical Science, 2025 - pubs.rsc.org
Large language models (LLMs) have emerged as powerful tools in chemistry, significantly
impacting molecule design, property prediction, and synthesis optimization. This review …

Mol-instructions: A large-scale biomolecular instruction dataset for large language models

Y Fang, X Liang, N Zhang, K Liu, R Huang… - ar**
Y Lyu, Z Wu, L Zhang, J Zhang, Y Li, W Ruan… - arxiv preprint arxiv …, 2024 - arxiv.org
Pre-trained large language models (LLMs) have attracted increasing attention in biomedical
domains due to their success in natural language processing. However, the complex traits …

Biomedgpt: An open multimodal large language model for biomedicine

Y Luo, J Zhang, S Fan, K Yang, M Hong… - IEEE Journal of …, 2024 - ieeexplore.ieee.org
Recent advances in large language models (LLMs) like ChatGPT have shed light on the
development of knowledgeable and versatile AI research assistants in various scientific …

Regression with large language models for materials and molecular property prediction

R Jacobs, MP Polak, LE Schultz, H Mahdavi… - arxiv preprint arxiv …, 2024 - arxiv.org
We demonstrate the ability of large language models (LLMs) to perform material and
molecular property regression tasks, a significant deviation from the conventional LLM use …

Instructbiomol: Advancing biomolecule understanding and design following human instructions

X Zhuang, K Ding, T Lyu, Y Jiang, X Li, Z **ang… - arxiv preprint arxiv …, 2024 - arxiv.org
Understanding and designing biomolecules, such as proteins and small molecules, is
central to advancing drug discovery, synthetic biology, and enzyme engineering. Recent …

Learning multi-view molecular representations with structured and unstructured knowledge

Y Luo, K Yang, M Hong, XY Liu, Z Nie, H Zhou… - Proceedings of the 30th …, 2024 - dl.acm.org
Capturing molecular knowledge with representation learning approaches holds significant
potential in vast scientific fields such as chemistry and life science. An effective and …

A quantitative analysis of knowledge-learning preferences in large language models in molecular science

P Liu, J Tao, Z Ren - Nature Machine Intelligence, 2025 - nature.com
Deep learning has significantly advanced molecular modelling and design, enabling an
efficient understanding and discovery of novel molecules. In particular, large language …