Ai alignment: A comprehensive survey

J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang… - arxiv preprint arxiv …, 2023 - arxiv.org
AI alignment aims to make AI systems behave in line with human intentions and values. As
AI systems grow more capable, the potential large-scale risks associated with misaligned AI …

Quantum machine learning: a classical perspective

C Ciliberto, M Herbster, AD Ialongo… - … of the Royal …, 2018 - royalsocietypublishing.org
Recently, increased computational power and data availability, as well as algorithmic
advances, have led machine learning (ML) techniques to impressive results in regression …

Cross-entropy loss functions: Theoretical analysis and applications

A Mao, M Mohri, Y Zhong - International conference on …, 2023 - proceedings.mlr.press
Cross-entropy is a widely used loss function in applications. It coincides with the logistic loss
applied to the outputs of a neural network, when the softmax is used. But, what guarantees …

A survey of preference-based reinforcement learning methods

C Wirth, R Akrour, G Neumann, J Fürnkranz - Journal of Machine Learning …, 2017 - jmlr.org
Reinforcement learning (RL) techniques optimize the accumulated long-term reward of a
suitably chosen reward function. However, designing such a reward function often requires …

A theoretical analysis of NDCG type ranking measures

Y Wang, L Wang, Y Li, D He… - Conference on learning …, 2013 - proceedings.mlr.press
Ranking has been extensively studied in information retrieval, machine learning and
statistics. A central problem in ranking is to design a ranking measure for evaluation of …

Iterative ranking from pair-wise comparisons

S Negahban, S Oh, D Shah - Advances in neural …, 2012 - proceedings.neurips.cc
The question of aggregating pairwise comparisons to obtain a global ranking over a
collection of objects has been of interest for a very long time: be it ranking of online gamers …

-Consistency Bounds for Pairwise Misranking Loss Surrogates

A Mao, M Mohri, Y Zhong - International conference on …, 2023 - proceedings.mlr.press
We present a detailed study of $ H $-consistency bounds for score-based ranking. These
are upper bounds on the target loss estimation error of a predictor in a hypothesis set $ H …

Learning with fenchel-young losses

M Blondel, AFT Martins, V Niculae - Journal of Machine Learning Research, 2020 - jmlr.org
Over the past decades, numerous loss functions have been been proposed for a variety of
supervised learning tasks, including regression, classification, ranking, and more generally …

Ranking with abstention

A Mao, M Mohri, Y Zhong - arxiv preprint arxiv:2307.02035, 2023 - arxiv.org
We introduce a novel framework of ranking with abstention, where the learner can abstain
from making prediction at some limited cost $ c $. We present a extensive theoretical …

A statistical convergence perspective of algorithms for rank aggregation from pairwise data

A Rajkumar, S Agarwal - International conference on …, 2014 - proceedings.mlr.press
There has been much interest recently in the problem of rank aggregation from pairwise
data. A natural question that arises is: under what sorts of statistical assumptions do various …