Judging llm-as-a-judge with mt-bench and chatbot arena

L Zheng, WL Chiang, Y Sheng… - Advances in …, 2023 - proceedings.neurips.cc
Evaluating large language model (LLM) based chat assistants is challenging due to their
broad capabilities and the inadequacy of existing benchmarks in measuring human …

Single-cell DNA methylome and 3D multi-omic atlas of the adult mouse brain

H Liu, Q Zeng, J Zhou, A Bartlett, BA Wang, P Berube… - Nature, 2023 - nature.com
Cytosine DNA methylation is essential in brain development and is implicated in various
neurological disorders. Understanding DNA methylation diversity across the entire brain in a …

The emergence of reproducibility and consistency in diffusion models

H Zhang, J Zhou, Y Lu, M Guo, P Wang… - Forty-first International …, 2024 - openreview.net
In this work, we investigate an intriguing and prevalent phenomenon of diffusion models
which we term as" consistent model reproducibility'': given the same starting noise input and …

Spotserve: Serving generative large language models on preemptible instances

X Miao, C Shi, J Duan, X **, D Lin, B Cui… - Proceedings of the 29th …, 2024 - dl.acm.org
The high computational and memory requirements of generative large language models
(LLMs) make it challenging to serve them cheaply. This paper aims to reduce the monetary …

On the limitations of carbon-aware temporal and spatial workload shifting in the cloud

T Sukprasert, A Souza, N Bashir, D Irwin… - Proceedings of the …, 2024 - dl.acm.org
Cloud platforms have been focusing on reducing their carbon emissions by shifting
workloads across time and locations to when and where low-carbon energy is available …

Caribou: Fine-grained geospatial shifting of serverless applications for sustainability

VU Gsteiger, PH Long, Y Sun, P Javanrood… - Proceedings of the …, 2024 - dl.acm.org
Sustainability in computing is critical as environmental concerns rise. The cloud industry's
carbon footprint is significant and rapidly growing. We show that dynamic geospatial shifting …

[HTML][HTML] The computing continuum: From IoT to the cloud

A Al-Dulaimy, M Jansen, B Johansson, A Trivedi… - Internet of Things, 2024 - Elsevier
In the era of the IoT revolution, applications are becoming ever more sophisticated and
accompanied by diverse functional and non-functional requirements, including those related …

A declarative system for optimizing ai workloads

C Liu, M Russo, M Cafarella, L Cao, PB Chen… - arxiv preprint arxiv …, 2024 - arxiv.org
A long-standing goal of data management systems has been to build systems which can
compute quantitative insights over large corpora of unstructured data in a cost-effective …

Speed: Speculative pipelined execution for efficient decoding

C Hooper, S Kim, H Mohammadzadeh, H Genc… - arxiv preprint arxiv …, 2023 - arxiv.org
Generative Large Language Models (LLMs) based on the Transformer architecture have
recently emerged as a dominant foundation model for a wide range of Natural Language …

Pecan:{Cost-Efficient}{ML} Data Preprocessing with Automatic Transformation Ordering and Hybrid Placement

D Graur, O Mraz, M Li, S Pourghannad… - 2024 USENIX Annual …, 2024 - usenix.org
Input data preprocessing is a common bottleneck in machine learning (ML) jobs, that can
significantly increase training time and cost as expensive GPUs or TPUs idle waiting for …