A tutorial on open-source large language models for behavioral science
Large language models (LLMs) have the potential to revolutionize behavioral science by
accelerating and improving the research cycle, from conceptualization to data analysis …
accelerating and improving the research cycle, from conceptualization to data analysis …
Antibiotic discovery in the artificial intelligence era
T Lluka, JM Stokes - Annals of the New York Academy of …, 2023 - Wiley Online Library
As the global burden of antibiotic resistance continues to grow, creative approaches to
antibiotic discovery are needed to accelerate the development of novel medicines. A rapidly …
antibiotic discovery are needed to accelerate the development of novel medicines. A rapidly …
Interpretable machine learning: Fundamental principles and 10 grand challenges
Interpretability in machine learning (ML) is crucial for high stakes decisions and
troubleshooting. In this work, we provide fundamental principles for interpretable ML, and …
troubleshooting. In this work, we provide fundamental principles for interpretable ML, and …
Uncovering new families and folds in the natural protein universe
We are now entering a new era in protein sequence and structure annotation, with hundreds
of millions of predicted protein structures made available through the AlphaFold database …
of millions of predicted protein structures made available through the AlphaFold database …
Exploring the whole rashomon set of sparse decision trees
In any given machine learning problem, there may be many models that could explain the
data almost equally well. However, most learning algorithms return only one of these …
data almost equally well. However, most learning algorithms return only one of these …
Towards a comprehensive evaluation of dimension reduction methods for transcriptomic data visualization
Dimension reduction (DR) algorithms project data from high dimensions to lower
dimensions to enable visualization of interesting high-dimensional structure. DR algorithms …
dimensions to enable visualization of interesting high-dimensional structure. DR algorithms …
Using sequences of life-events to predict human lives
Here we represent human lives in a way that shares structural similarity to language, and we
exploit this similarity to adapt natural language processing techniques to examine the …
exploit this similarity to adapt natural language processing techniques to examine the …
Network analysis reveals rare disease signatures across multiple levels of biological organization
Rare genetic diseases are typically caused by a single gene defect. Despite this clear
causal relationship between genotype and phenotype, identifying the pathobiological …
causal relationship between genotype and phenotype, identifying the pathobiological …
Neoadjuvant durvalumab plus radiation versus durvalumab alone in stages I–III non-small cell lung cancer: survival outcomes and molecular correlates of a …
We previously reported the results of a randomized phase II trial (NCT02904954) in patients
with early-stage non-small cell lung cancer (NSCLC) who were treated with either two …
with early-stage non-small cell lung cancer (NSCLC) who were treated with either two …
Theoretical foundations of t-sne for visualizing high-dimensional clustered data
This paper investigates the theoretical foundations of the t-distributed stochastic neighbor
embedding (t-SNE) algorithm, a popular nonlinear dimension reduction and data …
embedding (t-SNE) algorithm, a popular nonlinear dimension reduction and data …