Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
The statistical complexity of interactive decision making
A fundamental challenge in interactive learning and decision making, ranging from bandit
problems to reinforcement learning, is to provide sample-efficient, adaptive learning …
problems to reinforcement learning, is to provide sample-efficient, adaptive learning …
Instance-dependent complexity of contextual bandits and reinforcement learning: A disagreement-based perspective
In the classical multi-armed bandit problem, instance-dependent algorithms attain improved
performance on" easy" problems with a gap between the best and second-best arm. Are …
performance on" easy" problems with a gap between the best and second-best arm. Are …
Streaming active learning with deep neural networks
Active learning is perhaps most naturally posed as an online learning problem. However,
prior active learning approaches with deep neural networks assume offline access to the …
prior active learning approaches with deep neural networks assume offline access to the …
Contextual bandits and imitation learning with preference-based active queries
We consider the problem of contextual bandits and imitation learning, where the learner
lacks direct knowledge of the executed action's reward. Instead, the learner can actively …
lacks direct knowledge of the executed action's reward. Instead, the learner can actively …
Making rl with preference-based feedback efficient via randomization
Reinforcement Learning algorithms that learn from human feedback (RLHF) need to be
efficient in terms of statistical complexity, computational complexity, and query complexity. In …
efficient in terms of statistical complexity, computational complexity, and query complexity. In …
Reinforcement learning from human feedback with active queries
Aligning large language models (LLM) with human preference plays a key role in building
modern generative models and can be achieved by reinforcement learning from human …
modern generative models and can be achieved by reinforcement learning from human …
Recent advances in scaling‐down sampling methods in machine learning
Data sampling methods have been investigated for decades in the context of machine
learning and statistical algorithms, with significant progress made in the past few years …
learning and statistical algorithms, with significant progress made in the past few years …
Active learning for cost-sensitive classification
We design an active learning algorithm for cost-sensitive multiclass classification: problems
where different errors have different costs. Our algorithm, COAL, makes predictions by …
where different errors have different costs. Our algorithm, COAL, makes predictions by …
Towards a unified information-theoretic framework for generalization
In this work, we investigate the expressiveness of the" conditional mutual information"(CMI)
framework of Steinke and Zakynthinou (2020) and the prospect of using it to provide a …
framework of Steinke and Zakynthinou (2020) and the prospect of using it to provide a …
Adversarially robust learning: A generic minimax optimal learner and characterization
We present a minimax optimal learner for the problem of learning predictors robust to
adversarial examples at test-time. Interestingly, we find that this requires new algorithmic …
adversarial examples at test-time. Interestingly, we find that this requires new algorithmic …