Data-driven materials research enabled by natural language processing and information extraction

EA Olivetti, JM Cole, E Kim, O Kononova… - Applied Physics …, 2020 - pubs.aip.org
Given the emergence of data science and machine learning throughout all aspects of
society, but particularly in the scientific domain, there is increased importance placed on …

[HTML][HTML] Opportunities and challenges of text mining in materials research

O Kononova, T He, H Huo, A Trewartha, EA Olivetti… - Iscience, 2021 - cell.com
Research publications are the major repository of scientific knowledge. However, their
unstructured and highly heterogenous format creates a significant obstacle to large-scale …

Artificial intelligence (AI) futures: India-UK collaborations emerging from the 4th Royal Society Yusuf Hamied workshop

YK Dwivedi, L Hughes, HKDH Bhadeshia… - International Journal of …, 2024 - Elsevier
Abstract “Artificial Intelligence” in all its forms has emerged as a transformative technology
that is in the process of resha** many aspects of industry and wider society at a global …

ChemDataExtractor 2.0: Autopopulated ontologies for materials science

J Mavracic, CJ Court, T Isazawa… - Journal of Chemical …, 2021 - ACS Publications
The ever-growing abundance of data found in heterogeneous sources, such as scientific
publications, has forced the development of automated techniques for data extraction. While …

Deep learning object detection in materials science: Current state and future directions

R Jacobs - Computational Materials Science, 2022 - Elsevier
Deep learning-based object detection models have recently found widespread use in
materials science, with rapid progress made in just the past two years. Scanning and …

Automation and machine learning augmented by large language models in a catalysis study

Y Su, X Wang, Y Ye, Y **e, Y Xu, Y Jiang, C Wang - Chemical Science, 2024 - pubs.rsc.org
Recent advancements in artificial intelligence and automation are transforming catalyst
discovery and design from traditional trial-and-error manual mode into intelligent, high …

Image-based machine learning for materials science

L Zhang, S Shao - Journal of Applied Physics, 2022 - pubs.aip.org
Materials research studies are dealing with a large number of images, which can now be
facilitated via image-based machine learning techniques. In this article, we review recent …

From text to insight: large language models for materials science data extraction

M Schilling-Wilhelmi, M Ríos-García, S Shabih… - arxiv preprint arxiv …, 2024 - arxiv.org
The vast majority of materials science knowledge exists in unstructured natural language,
yet structured data is crucial for innovative and systematic materials design. Traditionally, the …

Audacity of huge: overcoming challenges of data scarcity and data quality for machine learning in computational materials discovery

A Nandy, C Duan, HJ Kulik - Current Opinion in Chemical Engineering, 2022 - Elsevier
Machine learning (ML)-accelerated discovery requires large amounts of high-fidelity data to
reveal predictive structure–property relationships. For many properties of interest in …

Looking through glass: Knowledge discovery from materials science literature using natural language processing

V Venugopal, S Sahoo, M Zaki, M Agarwal… - Patterns, 2021 - cell.com
Most of the knowledge in materials science literature is in the form of unstructured data such
as text and images. Here, we present a framework employing natural language processing …