A guide to machine learning for biologists

JG Greener, SM Kandathil, L Moffat… - Nature reviews Molecular …, 2022 - nature.com
The expanding scale and inherent complexity of biological data have encouraged a growing
use of machine learning in biology to build informative and predictive models of the …

[HTML][HTML] The human transcription factors

SA Lambert, A Jolma, LF Campitelli, PK Das, Y Yin… - Cell, 2018 - cell.com
Transcription factors (TFs) recognize specific DNA sequences to control chromatin and
transcription, forming a complex system that guides expression of the genome. Despite keen …

Effective gene expression prediction from sequence by integrating long-range interactions

Ž Avsec, V Agarwal, D Visentin, JR Ledsam… - Nature …, 2021 - nature.com
How noncoding DNA determines gene expression in different cell types is a major unsolved
problem, and critical downstream applications in human genetics depend on improved …

Obtaining genetics insights from deep learning via explainable artificial intelligence

G Novakovsky, N Dexter, MW Libbrecht… - Nature Reviews …, 2023 - nature.com
Artificial intelligence (AI) models based on deep learning now represent the state of the art
for making functional predictions in genomics research. However, the underlying basis on …

Deep learning for healthcare: review, opportunities and challenges

R Miotto, F Wang, S Wang, X Jiang… - Briefings in …, 2018 - academic.oup.com
Gaining knowledge and actionable insights from complex, high-dimensional and
heterogeneous biomedical data remains a key challenge in transforming health care …

Wilds: A benchmark of in-the-wild distribution shifts

PW Koh, S Sagawa, H Marklund… - International …, 2021 - proceedings.mlr.press
Distribution shifts—where the training distribution differs from the test distribution—can
substantially degrade the accuracy of machine learning (ML) systems deployed in the wild …

DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome

Y Ji, Z Zhou, H Liu, RV Davuluri - Bioinformatics, 2021 - academic.oup.com
Motivation Deciphering the language of non-coding DNA is one of the fundamental
problems in genome research. Gene regulatory code is highly complex due to the existence …

Opportunities and obstacles for deep learning in biology and medicine

T Ching, DS Himmelstein… - Journal of the …, 2018 - royalsocietypublishing.org
Deep learning describes a class of machine learning algorithms that are capable of
combining raw inputs into layers of intermediate features. These algorithms have recently …

Deep learning: new computational modelling techniques for genomics

G Eraslan, Ž Avsec, J Gagneur, FJ Theis - Nature Reviews Genetics, 2019 - nature.com
As a data-driven science, genomics largely utilizes machine learning to capture
dependencies in data and derive novel biological hypotheses. However, the ability to extract …

Hopfield networks is all you need

H Ramsauer, B Schäfl, J Lehner, P Seidl… - arxiv preprint arxiv …, 2020 - arxiv.org
We introduce a modern Hopfield network with continuous states and a corresponding
update rule. The new Hopfield network can store exponentially (with the dimension of the …