I-TASSER-MTD: a deep-learning-based platform for multi-domain protein structure and function prediction

X Zhou, W Zheng, Y Li, R Pearce, C Zhang, EW Bell… - Nature …, 2022 - nature.com
Most proteins in cells are composed of multiple folding units (or domains) to perform
complex functions in a cooperative manner. Relative to the rapid progress in single-domain …

OMA orthology in 2021: website overhaul, conserved isoforms, ancestral gene order and more

AM Altenhoff, CM Train, KJ Gilbert… - Nucleic acids …, 2021 - academic.oup.com
OMA is an established resource to elucidate evolutionary relationships among genes from
currently 2326 genomes covering all domains of life. OMA provides pairwise and groupwise …

InterPro in 2017—beyond protein family and domain annotations

RD Finn, TK Attwood, PC Babbitt… - Nucleic acids …, 2017 - academic.oup.com
Abstract InterPro (http://www. ebi. ac. uk/interpro/) is a freely available database used to
classify protein sequences into families and to predict the presence of important domains …

CATH: an expanded resource to predict protein function through structure and sequence

NL Dawson, TE Lewis, S Das, JG Lees… - Nucleic acids …, 2017 - academic.oup.com
The latest version of the CATH-Gene3D protein structure classification database has
recently been released (version 4.1, http://www. cathdb. info). The resource comprises over …

The OMA orthology database in 2018: retrieving evolutionary relationships among all domains of life through richer web and programmatic interfaces

AM Altenhoff, NM Glover, CM Train, K Kaleb… - Nucleic acids …, 2018 - academic.oup.com
Abstract The Orthologous Matrix (OMA) is a leading resource to relate genes across many
species from all of life. In this update paper, we review the recent algorithmic improvements …

Genomic basis of the giga-chromosomes and giga-genome of tree peony Paeonia ostii

J Yuan, S Jiang, J Jian, M Liu, Z Yue, J Xu, J Li… - Nature …, 2022 - nature.com
Tree peony (Paeonia ostii) is an economically important ornamental plant native to China. It
is also notable for its seed oil, which is abundant in unsaturated fatty acids such as α …

[HTML][HTML] An overview of comparative modelling and resources dedicated to large-scale modelling of genome sequences

SD Lam, S Das, I Sillitoe, C Orengo - Acta Crystallographica Section …, 2017 - scripts.iucr.org
Computational modelling of proteins has been a major catalyst in structural biology.
Bioinformatics groups have exploited the repositories of known structures to predict high …

Protein functional annotation of simultaneously improved stability, accuracy and false discovery rate achieved by a sequence-based deep learning

J Hong, Y Luo, Y Zhang, J Ying, W Xue… - Briefings in …, 2020 - academic.oup.com
Functional annotation of protein sequence with high accuracy has become one of the most
important issues in modern biomedical studies, and computational approaches of …

A comprehensive non-redundant gene catalog reveals extensive within-community intraspecies diversity in the human vagina

B Ma, MT France, J Crabtree, JB Holm… - Nature …, 2020 - nature.com
Abstract Analysis of metagenomic and metatranscriptomic data is complicated and typically
requires extensive computational resources. Leveraging a curated reference database of …

Gene3D: extensive prediction of globular domains in proteins

TE Lewis, I Sillitoe, N Dawson, SD Lam… - Nucleic acids …, 2018 - academic.oup.com
Abstract Gene3D (http://gene3d. biochem. ucl. ac. uk) is a database of globular domain
annotations for millions of available protein sequences. Gene3D has previously featured in …