A tutorial on open-source large language models for behavioral science

Z Hussain, M Binz, R Mata, DU Wulff - Behavior Research Methods, 2024 - Springer
Large language models (LLMs) have the potential to revolutionize behavioral science by
accelerating and improving the research cycle, from conceptualization to data analysis …

Antibiotic discovery in the artificial intelligence era

T Lluka, JM Stokes - Annals of the New York Academy of …, 2023 - Wiley Online Library
As the global burden of antibiotic resistance continues to grow, creative approaches to
antibiotic discovery are needed to accelerate the development of novel medicines. A rapidly …

Interpretable machine learning: Fundamental principles and 10 grand challenges

C Rudin, C Chen, Z Chen, H Huang… - Statistic …, 2022 - projecteuclid.org
Interpretability in machine learning (ML) is crucial for high stakes decisions and
troubleshooting. In this work, we provide fundamental principles for interpretable ML, and …

Uncovering new families and folds in the natural protein universe

J Durairaj, AM Waterhouse, T Mets, T Brodiazhenko… - Nature, 2023 - nature.com
We are now entering a new era in protein sequence and structure annotation, with hundreds
of millions of predicted protein structures made available through the AlphaFold database …

Exploring the whole rashomon set of sparse decision trees

R **n, C Zhong, Z Chen, T Takagi… - Advances in neural …, 2022 - proceedings.neurips.cc
In any given machine learning problem, there may be many models that could explain the
data almost equally well. However, most learning algorithms return only one of these …

Towards a comprehensive evaluation of dimension reduction methods for transcriptomic data visualization

H Huang, Y Wang, C Rudin, EP Browne - Communications biology, 2022 - nature.com
Dimension reduction (DR) algorithms project data from high dimensions to lower
dimensions to enable visualization of interesting high-dimensional structure. DR algorithms …

Using sequences of life-events to predict human lives

G Savcisens, T Eliassi-Rad, LK Hansen… - Nature Computational …, 2024 - nature.com
Here we represent human lives in a way that shares structural similarity to language, and we
exploit this similarity to adapt natural language processing techniques to examine the …

Network analysis reveals rare disease signatures across multiple levels of biological organization

P Buphamalai, T Kokotovic, V Nagy… - Nature communications, 2021 - nature.com
Rare genetic diseases are typically caused by a single gene defect. Despite this clear
causal relationship between genotype and phenotype, identifying the pathobiological …

Neoadjuvant durvalumab plus radiation versus durvalumab alone in stages I–III non-small cell lung cancer: survival outcomes and molecular correlates of a …

NK Altorki, ZH Walsh, JC Melms, JL Port… - Nature …, 2023 - nature.com
We previously reported the results of a randomized phase II trial (NCT02904954) in patients
with early-stage non-small cell lung cancer (NSCLC) who were treated with either two …

Theoretical foundations of t-sne for visualizing high-dimensional clustered data

TT Cai, R Ma - Journal of Machine Learning Research, 2022 - jmlr.org
This paper investigates the theoretical foundations of the t-distributed stochastic neighbor
embedding (t-SNE) algorithm, a popular nonlinear dimension reduction and data …