A guide to machine learning for biologists

JG Greener, SM Kandathil, L Moffat… - Nature reviews Molecular …, 2022 - nature.com
The expanding scale and inherent complexity of biological data have encouraged a growing
use of machine learning in biology to build informative and predictive models of the …

Accurate prediction of protein structures and interactions using a three-track neural network

M Baek, F DiMaio, I Anishchenko, J Dauparas… - Science, 2021 - science.org
DeepMind presented notably accurate predictions at the recent 14th Critical Assessment of
Structure Prediction (CASP14) conference. We explored network architectures that …

Novel machine learning approaches revolutionize protein knowledge

N Bordin, C Dallago, M Heinzinger, S Kim… - Trends in Biochemical …, 2023 - cell.com
Breakthrough methods in machine learning (ML), protein structure prediction, and novel
ultrafast structural aligners are revolutionizing structural biology. Obtaining accurate models …

Identification of mobile genetic elements with geNomad

AP Camargo, S Roux, F Schulz, M Babinski, Y Xu… - Nature …, 2024 - nature.com
Identifying and characterizing mobile genetic elements in sequencing data is essential for
understanding their diversity, ecology, biotechnological applications and impact on public …

RCSB Protein Data Bank (RCSB. org): delivery of experimentally-determined PDB structures alongside one million computed structure models of proteins from …

SK Burley, C Bhikadiya, C Bi, S Bittrich… - Nucleic acids …, 2023 - academic.oup.com
Abstract The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB
PDB), founding member of the Worldwide Protein Data Bank (wwPDB), is the US data center …

ProtGPT2 is a deep unsupervised language model for protein design

N Ferruz, S Schmidt, B Höcker - Nature communications, 2022 - nature.com
Protein design aims to build novel proteins customized for specific purposes, thereby
holding the potential to tackle many environmental and biomedical problems. Recent …

The Pfam protein families database in 2019

S El-Gebali, J Mistry, A Bateman, SR Eddy… - Nucleic acids …, 2019 - academic.oup.com
The last few years have witnessed significant changes in Pfam (https://pfam. xfam. org). The
number of families has grown substantially to a total of 17,929 in release 32.0. New …

Protein sequence analysis using the MPI bioinformatics toolkit

F Gabler, SZ Nam, S Till, M Mirdita… - Current Protocols in …, 2020 - Wiley Online Library
Abstract The MPI Bioinformatics Toolkit (https://toolkit. tuebingen. mpg. de) provides
interactive access to a wide range of the best‐performing bioinformatics tools and …

A completely reimplemented MPI bioinformatics toolkit with a new HHpred server at its core

L Zimmermann, A Stephens, SZ Nam, D Rau… - Journal of molecular …, 2018 - Elsevier
Abstract The MPI Bioinformatics Toolkit (https://toolkit. tuebingen. mpg. de) is a free, one-stop
web service for protein bioinformatic analysis. It currently offers 34 interconnected external …

Clustering predicted structures at the scale of the known protein universe

I Barrio-Hernandez, J Yeo, J Jänes, M Mirdita… - Nature, 2023 - nature.com
Proteins are key to all cellular processes and their structure is important in understanding
their function and evolution. Sequence-based predictions of protein structures have …