A guide to machine learning for biologists

JG Greener, SM Kandathil, L Moffat… - Nature reviews Molecular …, 2022 - nature.com
The expanding scale and inherent complexity of biological data have encouraged a growing
use of machine learning in biology to build informative and predictive models of the …

[HTML][HTML] Deep learning methods in protein structure prediction

M Torrisi, G Pollastri, Q Le - Computational and Structural Biotechnology …, 2020 - Elsevier
Abstract Protein Structure Prediction is a central topic in Structural Bioinformatics. Since
the'60s statistical methods, followed by increasingly complex Machine Learning and recently …

Prottrans: Toward understanding the language of life through self-supervised learning

A Elnaggar, M Heinzinger, C Dallago… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
Computational biology and bioinformatics provide vast data gold-mines from protein
sequences, ideal for Language Models (LMs) taken from Natural Language Processing …

Modeling aspects of the language of life through transfer-learning protein sequences

M Heinzinger, A Elnaggar, Y Wang, C Dallago… - BMC …, 2019 - Springer
Background Predicting protein function and structure from sequence is one important
challenge for computational biology. For 26 years, most state-of-the-art approaches …

NetSurfP‐2.0: Improved prediction of protein structural features by integrated deep learning

MS Klausen, MC Jespersen, H Nielsen… - Proteins: Structure …, 2019 - Wiley Online Library
The ability to predict local structural features of a protein from the primary sequence is of
paramount importance for unraveling its function in absence of experimental structural …

iLearnPlus: a comprehensive and automated machine-learning platform for nucleic acid and protein sequence analysis, prediction and visualization

Z Chen, P Zhao, C Li, F Li, D **ang… - Nucleic acids …, 2021 - academic.oup.com
Sequence-based analysis and prediction are fundamental bioinformatic tasks that facilitate
understanding of the sequence (-structure)-function paradigm for DNAs, RNAs and proteins …

Structural basis of ER-associated protein degradation mediated by the Hrd1 ubiquitin ligase complex

X Wu, M Siggel, S Ovchinnikov, W Mi, V Svetlov… - Science, 2020 - science.org
INTRODUCTION Protein homeostasis in the endoplasmic reticulum (ER) is maintained by a
quality control system. When a newly synthesized ER protein misfolds, it is ultimately …

Deep learning in proteomics

B Wen, WF Zeng, Y Liao, Z Shi, SR Savage… - …, 2020 - Wiley Online Library
Proteomics, the study of all the proteins in biological systems, is becoming a data‐rich
science. Protein sequences and structures are comprehensively catalogued in online …

Explainable deep hypergraph learning modeling the peptide secondary structure prediction

Y Jiang, R Wang, J Feng, J **, S Liang, Z Li… - Advanced …, 2023 - Wiley Online Library
Accurately predicting peptide secondary structures remains a challenging task due to the
lack of discriminative information in short peptides. In this study, PHAT is proposed, a deep …

Deep learning extends de novo protein modelling coverage of genomes using iteratively predicted structural constraints

JG Greener, SM Kandathil, DT Jones - Nature communications, 2019 - nature.com
The inapplicability of amino acid covariation methods to small protein families has limited
their use for structural annotation of whole genomes. Recently, deep learning has shown …