Pre-trained language models in biomedical domain: A systematic survey

B Wang, Q **e, J Pei, Z Chen, P Tiwari, Z Li… - ACM Computing …, 2023 - dl.acm.org
Pre-trained language models (PLMs) have been the de facto paradigm for most natural
language processing tasks. This also benefits the biomedical domain: researchers from …

Domain adaptation: challenges, methods, datasets, and applications

P Singhal, R Walambe, S Ramanna, K Kotecha - IEEE access, 2023 - ieeexplore.ieee.org
Deep Neural Networks (DNNs) trained on one dataset (source domain) do not perform well
on another set of data (target domain), which is different but has similar properties as the …

BioBERT: a pre-trained biomedical language representation model for biomedical text mining

J Lee, W Yoon, S Kim, D Kim, S Kim, CH So… - …, 2020 - academic.oup.com
Motivation Biomedical text mining is becoming increasingly important as the number of
biomedical documents rapidly grows. With the progress in natural language processing …

miRBase: from microRNA sequences to function

A Kozomara, M Birgaoanu… - Nucleic acids …, 2019 - academic.oup.com
Abstract miRBase catalogs, names and distributes microRNA gene sequences. The latest
release of miRBase (v22) contains microRNA sequences from 271 organisms: 38 589 …

[HTML][HTML] A comprehensive evaluation of large language models on benchmark biomedical text processing tasks

I Jahan, MTR Laskar, C Peng, JX Huang - Computers in biology and …, 2024 - Elsevier
Abstract Recently, Large Language Models (LLMs) have demonstrated impressive
capability to solve a wide range of tasks. However, despite their success across various …

ScispaCy: fast and robust models for biomedical natural language processing

M Neumann, D King, I Beltagy, W Ammar - arxiv preprint arxiv …, 2019 - arxiv.org
Despite recent advances in natural language processing, many statistical models for
processing text perform extremely poorly under domain shift. Processing biomedical and …

PubTator central: automated concept annotation for biomedical full text articles

CH Wei, A Allot, R Leaman, Z Lu - Nucleic acids research, 2019 - academic.oup.com
Abstract PubTator Central (https://www. ncbi. nlm. nih. gov/research/pubtator/) is a web
service for viewing and retrieving bioconcept annotations in full text biomedical articles …

BioRED: a rich biomedical relation extraction dataset

L Luo, PT Lai, CH Wei, CN Arighi… - Briefings in …, 2022 - academic.oup.com
Automated relation extraction (RE) from biomedical literature is critical for many downstream
text mining applications in both research and real-world settings. However, most existing …

Deep learning with word embeddings improves biomedical named entity recognition

M Habibi, L Weber, M Neves, DL Wiegandt… - …, 2017 - academic.oup.com
Motivation Text mining has become an important tool for biomedical research. The most
fundamental text-mining task is the recognition of biomedical named entities (NER), such as …

BERN2: an advanced neural biomedical named entity recognition and normalization tool

M Sung, M Jeong, Y Choi, D Kim, J Lee, J Kang - Bioinformatics, 2022 - academic.oup.com
In biomedical natural language processing, named entity recognition (NER) and named
entity normalization (NEN) are key tasks that enable the automatic extraction of biomedical …