A survey on clinical natural language processing in the United Kingdom from 2007 to 2022
Much of the knowledge and information needed for enabling high-quality clinical research is
stored in free-text format. Natural language processing (NLP) has been used to extract …
stored in free-text format. Natural language processing (NLP) has been used to extract …
An overview of biomedical entity linking throughout the years
Abstract Biomedical Entity Linking (BEL) is the task of map** of spans of text within
biomedical documents to normalized, unique identifiers within an ontology. This is an …
biomedical documents to normalized, unique identifiers within an ontology. This is an …
Self-alignment pretraining for biomedical entity representations
Despite the widespread success of self-supervised learning via masked language models
(MLM), accurately capturing fine-grained semantic relationships in the biomedical domain …
(MLM), accurately capturing fine-grained semantic relationships in the biomedical domain …
BioBART: Pretraining and evaluation of a biomedical generative language model
Pretrained language models have served as important backbones for natural language
processing. Recently, in-domain pretraining has been shown to benefit various domain …
processing. Recently, in-domain pretraining has been shown to benefit various domain …
Benchmarking intersectional biases in NLP
There has been a recent wave of work assessing the fairness of machine learning models in
general, and more specifically, on natural language processing (NLP) models built using …
general, and more specifically, on natural language processing (NLP) models built using …
Fast, effective, and self-supervised: Transforming masked language models into universal lexical and sentence encoders
Pretrained Masked Language Models (MLMs) have revolutionised NLP in recent years.
However, previous work has indicated that off-the-shelf MLMs are not effective as universal …
However, previous work has indicated that off-the-shelf MLMs are not effective as universal …
[HTML][HTML] CODER: Knowledge-infused cross-lingual medical term embedding for term normalization
Objective This paper aims to propose knowledge-aware embedding, a critical tool for
medical term normalization. Methods We develop CODER (Cross-lingual knowledge …
medical term normalization. Methods We develop CODER (Cross-lingual knowledge …
COMETA: A corpus for medical entity linking in the social media
Whilst there has been growing progress in Entity Linking (EL) for general language, existing
datasets fail to address the complex nature of health terminology in layman's language …
datasets fail to address the complex nature of health terminology in layman's language …
Building and using personal knowledge graph to improve suicidal ideation detection on social media
A large number of individuals are suffering from suicidal ideation in the world. There are a
number of causes behind why an individual might suffer from suicidal ideation. As the most …
number of causes behind why an individual might suffer from suicidal ideation. As the most …
Data evaluation and enhancement for quality improvement of machine learning
Poor data quality has a direct impact on the performance of the machine learning system
that is built on the data. As a demonstrated effective approach for data quality improvement …
that is built on the data. As a demonstrated effective approach for data quality improvement …