[HTML][HTML] Machine learning in protein structure prediction

M AlQuraishi - Current opinion in chemical biology, 2021 - Elsevier
Prediction of protein structure from sequence has been intensely studied for many decades,
owing to the problem's importance and its uniquely well-defined physical and computational …

Deep learning techniques have significantly impacted protein structure prediction and protein design

R Pearce, Y Zhang - Current opinion in structural biology, 2021 - Elsevier
Highlights⿢The recent use of deep learning has dramatically improved the accuracy of non-
homologous protein structure modeling.⿢Protein structure prediction problem was largely …

Learning the protein language: Evolution, structure, and function

T Bepler, B Berger - Cell systems, 2021 - cell.com
Language models have recently emerged as a powerful machine-learning approach for
distilling information from massive protein sequence databases. From readily available …

The landscape of tolerated genetic variation in humans and primates

H Gao, T Hamp, J Ede, JG Schraiber, J McRae… - Science, 2023 - science.org
Personalized genome sequencing has revealed millions of genetic differences between
individuals, but our understanding of their clinical relevance remains largely incomplete. To …

Bridge RNAs direct programmable recombination of target and donor DNA

MG Durrant, NT Perry, JJ Pai, AR Jangid… - Nature, 2024 - nature.com
Genomic rearrangements, encompassing mutational changes in the genome such as
insertions, deletions or inversions, are essential for genetic diversity. These rearrangements …

MSA transformer

RM Rao, J Liu, R Verkuil, J Meier… - International …, 2021 - proceedings.mlr.press
Unsupervised protein language models trained across millions of diverse sequences learn
structure and function of proteins. Protein language models studied to date have been …

Structural mechanism of bridge RNA-guided recombination

M Hiraizumi, NT Perry, MG Durrant, T Soma… - Nature, 2024 - nature.com
Insertion sequence (IS) elements are the simplest autonomous transposable elements found
in prokaryotic genomes. We recently discovered that IS110 family elements encode a …

Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences

A Rives, J Meier, T Sercu, S Goyal, Z Lin, J Liu… - Proceedings of the …, 2021 - pnas.org
In the field of artificial intelligence, a combination of scale in data and model capacity
enabled by unsupervised learning has led to major advances in representation learning and …

Transformer protein language models are unsupervised structure learners

R Rao, J Meier, T Sercu, S Ovchinnikov, A Rives - Biorxiv, 2020 - biorxiv.org
Unsupervised contact prediction is central to uncovering physical, structural, and functional
constraints for protein structure determination and design. For decades, the predominant …

ECNet is an evolutionary context-integrated deep learning framework for protein engineering

Y Luo, G Jiang, T Yu, Y Liu, L Vo, H Ding, Y Su… - Nature …, 2021 - nature.com
Abstract Machine learning has been increasingly used for protein engineering. However,
because the general sequence contexts they capture are not specific to the protein being …