Scientific large language models: A survey on biological & chemical domains

Q Zhang, K Ding, T Lv, X Wang, Q Yin, Y Zhang… - ACM Computing …, 2024 - dl.acm.org
Large Language Models (LLMs) have emerged as a transformative power in enhancing
natural language comprehension, representing a significant stride toward artificial general …

Leveraging biomolecule and natural language through multi-modal learning: A survey

Q Pei, L Wu, K Gao, J Zhu, Y Wang, Z Wang… - arxiv preprint arxiv …, 2024 - arxiv.org
The integration of biomolecular modeling with natural language (BL) has emerged as a
promising interdisciplinary area at the intersection of artificial intelligence, chemistry and …

Large language models for inorganic synthesis predictions

S Kim, Y Jung, J Schrier - Journal of the American Chemical …, 2024 - ACS Publications
We evaluate the effectiveness of pretrained and fine-tuned large language models (LLMs)
for predicting the synthesizability of inorganic compounds and the selection of precursors …

Are large language models superhuman chemists?

A Mirza, N Alampara, S Kunchapu… - arxiv preprint arxiv …, 2024 - arxiv.org
Large language models (LLMs) have gained widespread interest due to their ability to
process human language and perform tasks on which they have not been explicitly trained …

Llasmol: Advancing large language models for chemistry with a large-scale, comprehensive, high-quality instruction tuning dataset

B Yu, FN Baker, Z Chen, X Ning, H Sun - arxiv preprint arxiv:2402.09391, 2024 - arxiv.org
Chemistry plays a crucial role in many domains, such as drug discovery and material
science. While large language models (LLMs) such as GPT-4 exhibit remarkable …

A comprehensive survey of scientific large language models and their applications in scientific discovery

Y Zhang, X Chen, B **, S Wang, S Ji, W Wang… - arxiv preprint arxiv …, 2024 - arxiv.org
In many scientific fields, large language models (LLMs) have revolutionized the way text and
other modalities of data (eg, molecules and proteins) are handled, achieving superior …

L+ m-24: Building a dataset for language+ molecules@ acl 2024

C Edwards, Q Wang, L Zhao, H Ji - arxiv preprint arxiv:2403.00791, 2024 - arxiv.org
Language-molecule models have emerged as an exciting direction for molecular discovery
and understanding. However, training these models is challenging due to the scarcity of …

[PDF][PDF] A comprehensive survey of small language models in the era of large language models: Techniques, enhancements, applications, collaboration with llms, and …

F Wang, Z Zhang, X Zhang, Z Wu, T Mo, Q Lu… - arxiv preprint arxiv …, 2024 - ai.radensa.ru
Large language models (LLM) have demonstrated emergent abilities in text generation,
question answering, and reasoning, facilitating various tasks and domains. Despite their …

Towards building specialized generalist ai with system 1 and system 2 fusion

K Zhang, B Qi, B Zhou - arxiv preprint arxiv:2407.08642, 2024 - arxiv.org
In this perspective paper, we introduce the concept of Specialized Generalist Artificial
Intelligence (SGAI or simply SGI) as a crucial milestone toward Artificial General Intelligence …

Sciriff: A resource to enhance language model instruction-following over scientific literature

D Wadden, K Shi, J Morrison, A Naik, S Singh… - arxiv preprint arxiv …, 2024 - arxiv.org
We present SciRIFF (Scientific Resource for Instruction-Following and Finetuning), a dataset
of 137K instruction-following demonstrations for 54 tasks covering five essential scientific …