Redpajama: an open dataset for training large language models
Large language models are increasingly becoming a cornerstone technology in artificial
intelligence, the sciences, and society as a whole, yet the optimal strategies for dataset …
intelligence, the sciences, and society as a whole, yet the optimal strategies for dataset …
Foundation models for music: A survey
In recent years, foundation models (FMs) such as large language models (LLMs) and latent
diffusion models (LDMs) have profoundly impacted diverse sectors, including music. This …
diffusion models (LDMs) have profoundly impacted diverse sectors, including music. This …
Opencoder: The open cookbook for top-tier code large language models
Large language models (LLMs) for code have become indispensable in various domains,
including code generation, reasoning tasks and agent systems. While open-access code …
including code generation, reasoning tasks and agent systems. While open-access code …
Visrag: Vision-based retrieval-augmented generation on multi-modality documents
Retrieval-augmented generation (RAG) is an effective technique that enables large
language models (LLMs) to utilize external knowledge sources for generation. However …
language models (LLMs) to utilize external knowledge sources for generation. However …
Hellobench: Evaluating long text generation capabilities of large language models
In recent years, Large Language Models (LLMs) have demonstrated remarkable capabilities
in various tasks (eg, long-context understanding), and many benchmarks have been …
in various tasks (eg, long-context understanding), and many benchmarks have been …
[PDF][PDF] Baichuan-omni technical report
Y Li, H Sun, M Lin, T Li, G Dong, T Zhang… - arxiv preprint arxiv …, 2024 - researchgate.net
The salient multimodal capabilities and interactive experience of GPT-4o highlight its critical
role in practical applications, yet it lacks a high-performing open-source counterpart. In this …
role in practical applications, yet it lacks a high-performing open-source counterpart. In this …
Ddk: Distilling domain knowledge for efficient large language models
Despite the advanced intelligence abilities of large language models (LLMs) in various
applications, they still face significant computational and storage demands. Knowledge …
applications, they still face significant computational and storage demands. Knowledge …
Tablebench: A comprehensive and complex benchmark for table question answering
Recent advancements in Large Language Models (LLMs) have markedly enhanced the
interpretation and processing of tabular data, introducing previously unimaginable …
interpretation and processing of tabular data, introducing previously unimaginable …
2 OLMo 2 Furious
We present OLMo 2, the next generation of our fully open language models. OLMo 2
includes dense autoregressive models with improved architecture and training recipe …
includes dense autoregressive models with improved architecture and training recipe …
RoleAgent: Building, Interacting, and Benchmarking High-quality Role-Playing Agents from Scripts
J Liu, Z Ni, H Que, N Wang, J Yang… - Advances in …, 2025 - proceedings.neurips.cc
Believable agents can empower interactive applications ranging from immersive
environments to rehearsal spaces for interpersonal communication. Recently, generative …
environments to rehearsal spaces for interpersonal communication. Recently, generative …