A guide to machine learning for biologists
The expanding scale and inherent complexity of biological data have encouraged a growing
use of machine learning in biology to build informative and predictive models of the …
use of machine learning in biology to build informative and predictive models of the …
[HTML][HTML] The human transcription factors
Transcription factors (TFs) recognize specific DNA sequences to control chromatin and
transcription, forming a complex system that guides expression of the genome. Despite keen …
transcription, forming a complex system that guides expression of the genome. Despite keen …
Effective gene expression prediction from sequence by integrating long-range interactions
How noncoding DNA determines gene expression in different cell types is a major unsolved
problem, and critical downstream applications in human genetics depend on improved …
problem, and critical downstream applications in human genetics depend on improved …
Obtaining genetics insights from deep learning via explainable artificial intelligence
Artificial intelligence (AI) models based on deep learning now represent the state of the art
for making functional predictions in genomics research. However, the underlying basis on …
for making functional predictions in genomics research. However, the underlying basis on …
Deep learning for healthcare: review, opportunities and challenges
Gaining knowledge and actionable insights from complex, high-dimensional and
heterogeneous biomedical data remains a key challenge in transforming health care …
heterogeneous biomedical data remains a key challenge in transforming health care …
Wilds: A benchmark of in-the-wild distribution shifts
Distribution shifts—where the training distribution differs from the test distribution—can
substantially degrade the accuracy of machine learning (ML) systems deployed in the wild …
substantially degrade the accuracy of machine learning (ML) systems deployed in the wild …
DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome
Motivation Deciphering the language of non-coding DNA is one of the fundamental
problems in genome research. Gene regulatory code is highly complex due to the existence …
problems in genome research. Gene regulatory code is highly complex due to the existence …
Opportunities and obstacles for deep learning in biology and medicine
T Ching, DS Himmelstein… - Journal of the …, 2018 - royalsocietypublishing.org
Deep learning describes a class of machine learning algorithms that are capable of
combining raw inputs into layers of intermediate features. These algorithms have recently …
combining raw inputs into layers of intermediate features. These algorithms have recently …
Deep learning: new computational modelling techniques for genomics
As a data-driven science, genomics largely utilizes machine learning to capture
dependencies in data and derive novel biological hypotheses. However, the ability to extract …
dependencies in data and derive novel biological hypotheses. However, the ability to extract …
Hopfield networks is all you need
We introduce a modern Hopfield network with continuous states and a corresponding
update rule. The new Hopfield network can store exponentially (with the dimension of the …
update rule. The new Hopfield network can store exponentially (with the dimension of the …