Challenges and applications of large language models

J Kaddour, J Harris, M Mozes, H Bradley… - arxiv preprint arxiv …, 2023‏ - arxiv.org
Large Language Models (LLMs) went from non-existent to ubiquitous in the machine
learning discourse within a few years. Due to the fast pace of the field, it is difficult to identify …

AI-powered therapeutic target discovery

FW Pun, IV Ozerov, A Zhavoronkov - Trends in pharmacological sciences, 2023‏ - cell.com
Disease modeling and target identification are the most crucial initial steps in drug
discovery, and influence the probability of success at every step of drug development …

The UCSC genome browser database: 2023 update

LR Nassar, GP Barber, A Benet-Pagès… - Nucleic acids …, 2023‏ - academic.oup.com
Abstract The UCSC Genome Browser (https://genome. ucsc. edu) is an omics data
consolidator, graphical viewer, and general bioinformatics resource that continues to serve …

A complete telomere-to-telomere assembly of the maize genome

J Chen, Z Wang, K Tan, W Huang, J Shi, T Li, J Hu… - Nature …, 2023‏ - nature.com
A complete telomere-to-telomere (T2T) finished genome has been the long pursuit of
genomic research. Through generating deep coverage ultralong Oxford Nanopore …

The complete sequence of a human Y chromosome

A Rhie, S Nurk, M Cechova, SJ Hoyt, DJ Taylor… - Nature, 2023‏ - nature.com
The human Y chromosome has been notoriously difficult to sequence and assemble
because of its complex repeat structure that includes long palindromes, tandem repeats and …

CADD v1. 7: using protein language models, regulatory CNNs and other nucleotide-level scores to improve genome-wide variant predictions

M Schubach, T Maass, L Nazaretyan… - Nucleic acids …, 2024‏ - academic.oup.com
Abstract Machine Learning-based scoring and classification of genetic variants aids the
assessment of clinical findings and is employed to prioritize variants in diverse genetic …

Method of the year: long-read sequencing

V Marx - Nature Methods, 2023‏ - nature.com
Method of the year: long-read sequencing | Nature Methods Skip to main content Thank you for
visiting nature.com. You are using a browser version with limited support for CSS. To obtain the …

Ensembl 2023

FJ Martin, MR Amode, A Aneja… - Nucleic acids …, 2023‏ - academic.oup.com
Abstract Ensembl (https://www. ensembl. org) has produced high-quality genomic resources
for vertebrates and model organisms for more than twenty years. During that time, our …

Database resources of the National Center for Biotechnology Information in 2023

EW Sayers, EE Bolton, JR Brister… - Nucleic acids …, 2022‏ - pmc.ncbi.nlm.nih.gov
Abstract The National Center for Biotechnology Information (NCBI) provides online
information resources for biology, including the GenBank® nucleic acid sequence database …

YaHS: yet another Hi-C scaffolding tool

C Zhou, SA McCarthy, R Durbin - Bioinformatics, 2023‏ - academic.oup.com
We present YaHS, a user-friendly command-line tool for the construction of chromosome-
scale scaffolds from Hi-C data. It can be run with a single-line command, requires minimal …