A guide to machine learning for biologists

JG Greener, SM Kandathil, L Moffat… - Nature reviews Molecular …, 2022 - nature.com
The expanding scale and inherent complexity of biological data have encouraged a growing
use of machine learning in biology to build informative and predictive models of the …

Graph neural networks and their current applications in bioinformatics

XM Zhang, L Liang, L Liu, MJ Tang - Frontiers in genetics, 2021 - frontiersin.org
Graph neural networks (GNNs), as a branch of deep learning in non-Euclidean space,
perform particularly well in various tasks that process graph structure data. With the rapid …

Mega-scale experimental analysis of protein folding stability in biology and design

K Tsuboyama, J Dauparas, J Chen, E Laine… - Nature, 2023 - nature.com
Advances in DNA sequencing and machine learning are providing insights into protein
sequences and structures on an enormous scale. However, the energetics driving folding …

Scaffolding protein functional sites using deep learning

J Wang, S Lisanza, D Juergens, D Tischer, JL Watson… - Science, 2022 - science.org
The binding and catalytic functions of proteins are generally mediated by a small number of
functional residues held in place by the overall protein structure. Here, we describe deep …

Learning inverse folding from millions of predicted structures

C Hsu, R Verkuil, J Liu, Z Lin, B Hie… - International …, 2022 - proceedings.mlr.press
We consider the problem of predicting a protein sequence from its backbone atom
coordinates. Machine learning approaches to this problem to date have been limited by the …

Antigen-specific antibody design and optimization with diffusion-based generative models for protein structures

S Luo, Y Su, X Peng, S Wang… - Advances in Neural …, 2022 - proceedings.neurips.cc
Antibodies are immune system proteins that protect the host by binding to specific antigens
such as viruses and bacteria. The binding between antibodies and antigens is mainly …

Protein representation learning by geometric structure pretraining

Z Zhang, M Xu, A Jamasb… - arxiv preprint arxiv …, 2022 - arxiv.org
Learning effective protein representations is critical in a variety of tasks in biology such as
predicting protein function or structure. Existing approaches usually pretrain protein …

Protein structure and sequence generation with equivariant denoising diffusion probabilistic models

N Anand, T Achim - arxiv preprint arxiv:2205.15019, 2022 - arxiv.org
Proteins are macromolecules that mediate a significant fraction of the cellular processes that
underlie life. An important task in bioengineering is designing proteins with specific 3D …

De novo protein design by deep network hallucination

I Anishchenko, SJ Pellock, TM Chidyausiku… - Nature, 2021 - nature.com
There has been considerable recent progress in protein structure prediction using deep
neural networks to predict inter-residue distances from amino acid sequences,–. Here we …

Learning from protein structure with geometric vector perceptrons

B **g, S Eismann, P Suriana, RJL Townshend… - arxiv preprint arxiv …, 2020 - arxiv.org
Learning on 3D structures of large biomolecules is emerging as a distinct area in machine
learning, but there has yet to emerge a unifying network architecture that simultaneously …