Chatbot arena: An open platform for evaluating llms by human preference

WL Chiang, L Zheng, Y Sheng… - … on Machine Learning, 2024 - openreview.net
Large Language Models (LLMs) have unlocked new capabilities and applications; however,
evaluating the alignment with human preferences still poses significant challenges. To …

A data quality-driven view of mlops

C Renggli, L Rimanic, NM Gürel, B Karlaš… - ar** machine learning models can be seen as a process similar to the one
established for traditional software development. A key difference between the two lies in the …

Which llm to play? convergence-aware online model selection with time-increasing bandits

Y **a, F Kong, T Yu, L Guo, RA Rossi, S Kim… - Proceedings of the ACM …, 2024 - dl.acm.org
Web-based applications such as chatbots, search engines and news recommendations
continue to grow in scale and complexity with the recent surge in the adoption of large …

Equi-vocal: Synthesizing queries for compositional video events from limited user interactions

E Zhang, M Daum, D He, B Haynes, R Krishna… - Proceedings of the …, 2023 - dl.acm.org
We introduce EQUI-VOCAL: a new system that automatically synthesizes queries over
videos from limited user interactions. The user only provides a handful of positive and …

Anytime model selection in linear bandits

P Kassraie, N Emmenegger… - Advances in Neural …, 2023 - proceedings.neurips.cc
Abstract Model selection in the context of bandit optimization is a challenging problem, as it
requires balancing exploration and exploitation not only for action selection, but also for …

On the Necessity of Collaboration for Online Model Selection with Decentralized Data

J Li, Z Wu, Z Xu, I King - arxiv preprint arxiv:2404.09494, 2024 - arxiv.org
We consider online model selection with decentralized data over $ M $ clients, and study the
necessity of collaboration among clients. Previous work proposed various federated …

Personalized federated learning with mixture of models for adaptive prediction and model fine-tuning

PM Ghari, Y Shen - arxiv preprint arxiv:2410.21547, 2024 - arxiv.org
Federated learning is renowned for its efficacy in distributed model training, ensuring that
users, called clients, retain data privacy by not disclosing their data to the central server that …

Convergence-aware online model selection with time-increasing bandits

Y **a, F Kong, T Yu, L Guo, RA Rossi… - The Web Conference …, 2024 - openreview.net
Web-based applications such as chatbots, search engines and news recommendations
continue to grow in scale and complexity with the recent surge in the adoption of large …

Bayesian Online Learning for Consensus Prediction

S Showalter, AJ Boyd, P Smyth… - International …, 2024 - proceedings.mlr.press
Given a pre-trained classifier and multiple human experts, we investigate the task of online
classification where model predictions are provided for free but querying humans incurs a …

Online foundation model selection in robotics

P Li, OS Toprak, A Narayanan, U Topcu… - arxiv preprint arxiv …, 2024 - arxiv.org
Foundation models have recently expanded into robotics after excelling in computer vision
and natural language processing. The models are accessible in two ways: open-source or …