A survey on addressing high-class imbalance in big data

JL Leevy, TM Khoshgoftaar, RA Bauder, N Seliya - Journal of Big Data, 2018 - Springer
In a majority–minority classification problem, class imbalance in the dataset (s) can
dramatically skew the performance of classifiers, introducing a prediction bias for the …

Phylogenetic tree building in the genomic age

P Kapli, Z Yang, MJ Telford - Nature Reviews Genetics, 2020 - nature.com
Knowing phylogenetic relationships among species is fundamental for many studies in
biology. An accurate phylogenetic tree underpins our understanding of the major transitions …

Protein complex prediction with AlphaFold-Multimer

R Evans, M O'Neill, A Pritzel, N Antropova, A Senior… - biorxiv, 2021 - biorxiv.org
While the vast majority of well-structured single protein chains can now be predicted to high
accuracy due to the recent AlphaFold model, the prediction of multi-chain protein complexes …

Computed structures of core eukaryotic protein complexes

IR Humphreys, J Pei, M Baek, A Krishnakumar… - Science, 2021 - science.org
INTRODUCTION Protein-protein interactions play critical roles in biology, but the structures
of many eukaryotic protein complexes are unknown, and there are likely many interactions …

OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy

DM Emms, S Kelly - Genome biology, 2015 - Springer
Identifying homology relationships between sequences is fundamental to biological
research. Here we provide a novel orthogroup inference algorithm called OrthoFinder that …

Genomic Analysis of the Necrotrophic Fungal Pathogens Sclerotinia sclerotiorum and Botrytis cinerea

J Amselem, CA Cuomo, JAL van Kan, M Viaud… - PLoS …, 2011 - journals.plos.org
Sclerotinia sclerotiorum and Botrytis cinerea are closely related necrotrophic plant
pathogenic fungi notable for their wide host ranges and environmental persistence. These …

Proteinortho: Detection of (Co-)orthologs in large-scale analysis

M Lechner, S Findeiß, L Steiner, M Marz, PF Stadler… - BMC …, 2011 - Springer
Background Orthology analysis is an important part of data analysis in many areas of
bioinformatics such as comparative genomics and molecular phylogenetics. The ever …

Protein interaction networks revealed by proteome coevolution

Q Cong, I Anishchenko, S Ovchinnikov, D Baker - Science, 2019 - science.org
Residue-residue coevolution has been observed across a number of protein-protein
interfaces, but the extent of residue coevolution between protein families on the whole …

Protein interactions in human pathogens revealed through deep learning

IR Humphreys, J Zhang, M Baek, Y Wang… - Nature …, 2024 - nature.com
Identification of bacterial protein–protein interactions and predicting the structures of these
complexes could aid in the understanding of pathogenicity mechanisms and develo** …

Why highly expressed proteins evolve slowly

DA Drummond, JD Bloom, C Adami… - Proceedings of the …, 2005 - National Acad Sciences
Much recent work has explored molecular and population-genetic constraints on the rate of
protein sequence evolution. The best predictor of evolutionary rate is expression level, for …