Linguistically inspired roadmap for building biologically reliable protein language models

MH Vu, R Akbar, PA Robert, B Swiatczak… - Nature Machine …, 2023 - nature.com
Deep neural-network-based language models (LMs) are increasingly applied to large-scale
protein sequence data to predict protein function. However, being largely black-box models …

Linguistics-based formalization of the antibody language as a basis for antibody language models

MH Vu, PA Robert, R Akbar, B Swiatczak… - Nature Computational …, 2024 - nature.com
Apparent parallels between natural language and antibody sequences have led to a surge
in deep language models applied to antibody sequences for predicting cognate antigen …

Toward real-world automated antibody design with combinatorial Bayesian optimization

A Khan, AI Cowen-Rivers, A Grosnit, PA Robert… - Cell Reports …, 2023 - cell.com
Antibodies are multimeric proteins capable of highly specific molecular recognition. The
complementarity determining region 3 of the antibody variable heavy chain (CDRH3) often …

Automated optimisation of solubility and conformational stability of antibodies and proteins

A Rosace, A Bennett, M Oeller, MM Mortensen… - Nature …, 2023 - nature.com
Biologics, such as antibodies and enzymes, are crucial in research, biotechnology,
diagnostics, and therapeutics. Often, biologics with suitable functionality are discovered, but …

Applications of machine learning in biopharmaceutical process development and manufacturing: Current trends, challenges, and opportunities

TT Khuat, R Bassett, E Otte, A Grevis-James… - arxiv preprint arxiv …, 2023 - arxiv.org
While machine learning (ML) has made significant contributions to the biopharmaceutical
field, its applications are still in the early stages in terms of providing direct support for quality …

Improving generalization of machine learning-identified biomarkers with causal modeling: an investigation into immune receptor diagnostics

M Pavlović, GSA Hajj, C Kanduri, J Pensar… - arxiv preprint arxiv …, 2022 - arxiv.org
Machine learning is increasingly used to discover diagnostic and prognostic biomarkers
from high-dimensional molecular data. However, a variety of factors related to experimental …

Guiding diffusion models for antibody sequence and structure co-design with developability properties

A Villegas-Morcillo, JM Weber, MJT Reinders - PRX Life, 2024 - APS
Recent advances in deep generative methods have allowed antibody sequence and
structure co-design. This study addresses the challenge of tailoring the highly variable …

AbNatiV: VQ-VAE-based assessment of antibody and nanobody nativeness for hit selection, humanisation, and engineering

A Ramon, M Ali, M Atkinson, A Saturnino, K Didi… - bioRxiv, 2023 - biorxiv.org
Monoclonal antibodies have emerged as a key class of therapeutics, and nanobodies are
rapidly increasing in popularity following the approval of the first nanobody drug in 2019, yet …

ExpoSeq: simplified analysis of high-throughput sequencing data from antibody discovery campaigns

CV Sørensen, N Hofmann, P Rawat… - Bioinformatics …, 2024 - academic.oup.com
High-throughput sequencing (HTS) offers a modern, fast, and explorative solution to unveil
the full potential of display techniques, like antibody phage display, in molecular biology …

Design of antigen-specific antibody CDRH3 sequences using AI and germline-based templates

TM Marinov, AA Abu-Shmais, AK Janke, IS Georgiev - bioRxiv, 2024 - pmc.ncbi.nlm.nih.gov
Antibody-antigen specificity is engendered and refined through a number of complex B cell
processes, including germline gene recombination and somatic hypermutation. Here, we …