Epistasis in protein evolution

TN Starr, JW Thornton - Protein science, 2016 - Wiley Online Library
The structure, function, and evolution of proteins depend on physical and genetic
interactions among amino acids. Recent studies have used new strategies to explore the …

Inverse statistical physics of protein sequences: a key issues review

S Cocco, C Feinauer, M Figliuzzi… - Reports on Progress …, 2018 - iopscience.iop.org
In the course of evolution, proteins undergo important changes in their amino acid
sequences, while their three-dimensional folded structure and their biological function …

Language models enable zero-shot prediction of the effects of mutations on protein function

J Meier, R Rao, R Verkuil, J Liu… - Advances in neural …, 2021 - proceedings.neurips.cc
Modeling the effect of sequence variation on function is a fundamental problem for
understanding and designing proteins. Since evolution encodes information about function …

Learning protein fitness models from evolutionary and assay-labeled data

C Hsu, H Nisonoff, C Fannjiang, J Listgarten - Nature biotechnology, 2022 - nature.com
Abstract Machine learning-based models of protein fitness typically learn from either
unlabeled, evolutionarily related sequences or variant sequences with experimentally …

Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences

A Rives, J Meier, T Sercu, S Goyal, Z Lin, J Liu… - Proceedings of the …, 2021 - pnas.org
In the field of artificial intelligence, a combination of scale in data and model capacity
enabled by unsupervised learning has led to major advances in representation learning and …

An evolution-based model for designing chorismate mutase enzymes

WP Russ, M Figliuzzi, C Stocker, P Barrat-Charlaix… - Science, 2020 - science.org
The rational design of enzymes is an important goal for both fundamental and practical
reasons. Here, we describe a process to learn the constraints for specifying proteins purely …

Protein design and variant prediction using autoregressive generative models

JE Shin, AJ Riesselman, AW Kollasch… - Nature …, 2021 - nature.com
The ability to design functional sequences and predict effects of variation is central to protein
engineering and biotherapeutics. State-of-art computational methods rely on models that …

Mutation effects predicted from sequence co-variation

TA Hopf, JB Ingraham, FJ Poelwijk, CPI Schärfe… - Nature …, 2017 - nature.com
Many high-throughput experimental technologies have been developed to assess the
effects of large numbers of mutations (variation) on phenotypes. However, designing …

Generating functional protein variants with variational autoencoders

A Hawkins-Hooker, F Depardieu, S Baur… - PLoS computational …, 2021 - journals.plos.org
The vast expansion of protein sequence databases provides an opportunity for new protein
design approaches which seek to learn the sequence-function relationship directly from …

Deep generative models of genetic variation capture the effects of mutations

AJ Riesselman, JB Ingraham, DS Marks - Nature methods, 2018 - nature.com
The functions of proteins and RNAs are defined by the collective interactions of many
residues, and yet most statistical models of biological sequences consider sites nearly …