Opportunities and challenges for ChatGPT and large language models in biomedicine and health

S Tian, Q **, L Yeganova, PT Lai, Q Zhu… - Briefings in …, 2024 - academic.oup.com
ChatGPT has drawn considerable attention from both the general public and domain experts
with its remarkable text generation capabilities. This has subsequently led to the emergence …

Recent named entity recognition and classification techniques: a systematic review

A Goyal, V Gupta, M Kumar - Computer Science Review, 2018 - Elsevier
Textual information is becoming available in abundance on the web, arising the requirement
of techniques and tools to extract the meaningful information. One of such an important …

Transfer learning in biomedical natural language processing: an evaluation of BERT and ELMo on ten benchmarking datasets

Y Peng, S Yan, Z Lu - arxiv preprint arxiv:1906.05474, 2019 - arxiv.org
Inspired by the success of the General Language Understanding Evaluation benchmark, we
introduce the Biomedical Language Understanding Evaluation (BLUE) benchmark to …

ScispaCy: fast and robust models for biomedical natural language processing

M Neumann, D King, I Beltagy, W Ammar - arxiv preprint arxiv …, 2019 - arxiv.org
Despite recent advances in natural language processing, many statistical models for
processing text perform extremely poorly under domain shift. Processing biomedical and …

[HTML][HTML] Snorkel: Rapid training data creation with weak supervision

A Ratner, SH Bach, H Ehrenberg, J Fries… - Proceedings of the …, 2017 - ncbi.nlm.nih.gov
Labeling training data is increasingly the largest bottleneck in deploying machine learning
systems. We present Snorkel, a first-of-its-kind system that enables users to train state-of-the …

Artificial intelligence for science in quantum, atomistic, and continuum systems

X Zhang, L Wang, J Helwig, Y Luo, C Fu, Y **e… - arxiv preprint arxiv …, 2023 - arxiv.org
Advances in artificial intelligence (AI) are fueling a new paradigm of discoveries in natural
sciences. Today, AI has started to advance natural sciences by improving, accelerating, and …

PubTator central: automated concept annotation for biomedical full text articles

CH Wei, A Allot, R Leaman, Z Lu - Nucleic acids research, 2019 - academic.oup.com
Abstract PubTator Central (https://www. ncbi. nlm. nih. gov/research/pubtator/) is a web
service for viewing and retrieving bioconcept annotations in full text biomedical articles …

Deep learning with word embeddings improves biomedical named entity recognition

M Habibi, L Weber, M Neves, DL Wiegandt… - …, 2017 - academic.oup.com
Motivation Text mining has become an important tool for biomedical research. The most
fundamental text-mining task is the recognition of biomedical named entities (NER), such as …

Snorkel: rapid training data creation with weak supervision

A Ratner, SH Bach, H Ehrenberg, J Fries, S Wu, C Ré - The VLDB Journal, 2020 - Springer
Labeling training data is increasingly the largest bottleneck in deploying machine learning
systems. We present Snorkel, a first-of-its-kind system that enables users to train state-of-the …

[HTML][HTML] BioCreative V CDR task corpus: a resource for chemical disease relation extraction

J Li, Y Sun, RJ Johnson, D Sciaky, CH Wei… - Database, 2016 - academic.oup.com
Community-run, formal evaluations and manually annotated text corpora are critically
important for advancing biomedical text-mining research. Recently in BioCreative V, a new …