MeShClust: an intelligent tool for clustering DNA sequences

BT James, BB Luczak, HZ Girgis - Nucleic acids research, 2018 - academic.oup.com
Sequence clustering is a fundamental step in analyzing DNA sequences. Widely-used
software tools for sequence clustering utilize greedy approaches that are not guaranteed to …

Identity: rapid alignment-free prediction of sequence alignment identity scores using self-supervised general linear models

HZ Girgis, BT James, BB Luczak - NAR genomics and …, 2021 - academic.oup.com
Pairwise global alignment is a fundamental step in sequence analysis. Optimal alignment
algorithms are quadratic—slow especially on long sequences. In many applications that …

Look4TRs: a de novo tool for detecting simple tandem repeats using self-supervised hidden Markov models

A Velasco, BT James, VD Wells, HZ Girgis - Bioinformatics, 2020 - academic.oup.com
Motivation Simple tandem repeats, microsatellites in particular, have regulatory functions,
links to several diseases and applications in biotechnology. There is an immediate need for …

MeShClust2: Application of alignment-free identity scores in clustering long DNA sequences

BT James, HZ Girgis - BioRxiv, 2018 - biorxiv.org
Grou** sequences into similar clusters is an important part of sequence analysis. Widely
used clustering tools sacrifice quality for speed. Previously, we developed MeShClust, which …

On-line hierarchy of general linear models for selecting and ranking the best predicted protein structures

HZ Girgis, JJ Corso, D Fischer - 2009 Annual International …, 2009 - ieeexplore.ieee.org
To predict the three dimensional structure of proteins, many computational methods sample
the conformational space, generating a large number of candidate structures. Subsequently …

[KNJIGA][B] Machine-learning-based meta approaches to protein structure prediction

HZ Girgis - 2008 - search.proquest.com
The importance of knowing the three dimensional structure of proteins and the difficulty of
determining it experimentally, have led scientists to develop several computational methods …

FASTCAR: Rapid alignment-free prediction of sequence alignment identity scores using self-supervised general linear models

BT James, BB Luczak, HZ Girgis - BioRxiv, 2018 - biorxiv.org
Pairwise alignment has been the predominant algorithm in the field of bioinformatics since
its beginning. Several applications have been made in order to speed up this algorithm …

HebbPlot: an intelligent tool for learning and visualizing chromatin mark signatures

HZ Girgis, A Velasco, ZE Reyes - BMC bioinformatics, 2018 - Springer
Background Histone modifications play important roles in gene regulation, heredity,
imprinting, and many human diseases. The histone code is complex and consists of more …

[PDF][PDF] Look4TRs: A de-novo tool for detecting simple tandem repeats using self-supervised hidden Markov models

II Alfredo Velasco, BT James, VD Wells, HZ Girgis - researchgate.net
Simple tandem repeats, microsatellites in particular, have regulatory functions, links to
several diseases, and applications in biotechnology. Sequences of thousands of species …