Redpajama: an open dataset for training large language models
M Weber, D Fu, Q Anthony, Y Oren… - Advances in …, 2025 - proceedings.neurips.cc
Large language models are increasingly becoming a cornerstone technology in artificial
intelligence, the sciences, and society as a whole, yet the optimal strategies for dataset …
intelligence, the sciences, and society as a whole, yet the optimal strategies for dataset …
Searching for best practices in retrieval-augmented generation
Retrieval-augmented generation (RAG) techniques have proven to be effective in integrating
up-to-date information, mitigating hallucinations, and enhancing response quality …
up-to-date information, mitigating hallucinations, and enhancing response quality …
Rcagent: Cloud root cause analysis by autonomous agents with tool-augmented large language models
Large language model (LLM) applications in cloud root cause analysis (RCA) have been
actively explored recently. However, current methods are still reliant on manual workflow …
actively explored recently. However, current methods are still reliant on manual workflow …
Algorithmic management in the gig economy: A systematic review and research integration
I Kadolkar, S Kepes… - Journal of Organizational …, 2024 - Wiley Online Library
Rapid growth in the gig economy has been facilitated by the increased use of algorithmic
management (AM) in online platforms (OPs) coordinating gig work. There has been a …
management (AM) in online platforms (OPs) coordinating gig work. There has been a …
Bright: A realistic and challenging benchmark for reasoning-intensive retrieval
Existing retrieval benchmarks primarily consist of information-seeking queries (eg,
aggregated questions from search engines) where keyword or semantic-based retrieval is …
aggregated questions from search engines) where keyword or semantic-based retrieval is …
Gecko: Versatile text embeddings distilled from large language models
We present Gecko, a compact and versatile text embedding model. Gecko achieves strong
retrieval performance by leveraging a key idea: distilling knowledge from large language …
retrieval performance by leveraging a key idea: distilling knowledge from large language …
Making text embedders few-shot learners
Large language models (LLMs) with decoder-only architectures demonstrate remarkable in-
context learning (ICL) capabilities. This feature enables them to effectively handle both …
context learning (ICL) capabilities. This feature enables them to effectively handle both …
Llms4ol 2024 overview: The 1st large language models for ontology learning challenge
This paper outlines the LLMs4OL 2024, the first edition of the Large Language Models for
Ontology Learning Challenge. LLMs4OL is a community development initiative collocated …
Ontology Learning Challenge. LLMs4OL is a community development initiative collocated …
Simple is effective: The roles of graphs and large language models in knowledge-graph-based retrieval-augmented generation
Large Language Models (LLMs) demonstrate strong reasoning abilities but face limitations
such as hallucinations and outdated knowledge. Knowledge Graph (KG)-based Retrieval …
such as hallucinations and outdated knowledge. Knowledge Graph (KG)-based Retrieval …
Weblinx: Real-world website navigation with multi-turn dialogue
We propose the problem of conversational web navigation, where a digital agent controls a
web browser and follows user instructions to solve real-world tasks in a multi-turn dialogue …
web browser and follows user instructions to solve real-world tasks in a multi-turn dialogue …