An Eye on Clinical BERT: Investigating Language Model Generalization for Diabetic Eye Disease Phenoty**

K Harrigian, T Tang, A Gonzales, CX Cai… - arxiv preprint arxiv …, 2023 - arxiv.org
Diabetic eye disease is a major cause of blindness worldwide. The ability to monitor relevant
clinical trajectories and detect lapses in care is critical to managing the disease and …

Token Erasure as a Footprint of Implicit Vocabulary Items in LLMs

S Feucht, D Atkinson, B Wallace, D Bau - arxiv preprint arxiv:2406.20086, 2024 - arxiv.org
LLMs process text as sequences of tokens that roughly correspond to words, where less
common words are represented by multiple tokens. However, individual tokens are often …

How Important Is Tokenization in French Medical Masked Language Models?

Y Labrak, A Bazoge, B Daille, M Rouvier… - arxiv preprint arxiv …, 2024 - arxiv.org
Subword tokenization has become the prevailing standard in the field of natural language
processing (NLP) over recent years, primarily due to the widespread utilization of pre-trained …

Towards Robust Natural Language Processing to Promote Health Equity

K Harrigian - 2024 - jscholarship.library.jhu.edu
Natural language processing (NLP) has rapidly become an integral component of
contemporary healthcare infrastructure and is likely to become more deeply entrenched in …

[PDF][PDF] Pathological-Llama: an Explainable Medical Visual Question An

S Nguyen - 2024 - zhaw.ch
Abstract This thesis introduces Pathological-Llama, an explainable medical visual question
answering system that integrates computer vision and natural language processing to …