[PDF][PDF] Open-environment machine learning

ZH Zhou - National Science Review, 2022 - academic.oup.com
Conventional machine learning studies generally assume close-environment scenarios
where important factors of the learning process hold invariant. With the great success of …

Dynamic regret of convex and smooth functions

P Zhao, YJ Zhang, L Zhang… - Advances in Neural …, 2020 - proceedings.neurips.cc
We investigate online convex optimization in non-stationary environments and choose the
dynamic regret as the performance measure, defined as the difference between cumulative …

A simple approach for non-stationary linear bandits

P Zhao, L Zhang, Y Jiang… - … Conference on Artificial …, 2020 - proceedings.mlr.press
This paper investigates the problem of non-stationary linear bandits, where the unknown
regression parameter is evolving over time. Previous studies have adopted sophisticated …

Adaptivity and non-stationarity: Problem-dependent dynamic regret for online convex optimization

P Zhao, YJ Zhang, L Zhang, ZH Zhou - Journal of Machine Learning …, 2024 - jmlr.org
We investigate online convex optimization in non-stationary environments and choose
dynamic regret as the performance measure, defined as the difference between cumulative …

Learning with feature evolvable streams

BJ Hou, L Zhang, ZH Zhou - Advances in Neural …, 2017 - proceedings.neurips.cc
Learning with streaming data has attracted much attention during the past few years.
Though most studies consider data stream with fixed features, in real practice the features …

Non-stationary online learning with memory and non-stochastic control

P Zhao, YH Yan, YX Wang, ZH Zhou - Journal of Machine Learning …, 2023 - jmlr.org
We study the problem of Online Convex Optimization (OCO) with memory, which allows loss
functions to depend on past decisions and thus captures temporal effects of learning …

Handling concept drift via model reuse

P Zhao, LW Cai, ZH Zhou - Machine learning, 2020 - Springer
In many real-world applications, data are often collected in the form of a stream, and thus the
distribution usually changes in nature, which is referred to as concept drift in the literature …

Learning with feature and distribution evolvable streams

ZY Zhang, P Zhao, Y Jiang… - … Conference on Machine …, 2020 - proceedings.mlr.press
In many real-world applications, data are collected in the form of a stream, whose feature
space can evolve over time. For instance, in the environmental monitoring task, features can …

Bandit convex optimization in non-stationary environments

P Zhao, G Wang, L Zhang, ZH Zhou - Journal of Machine Learning …, 2021 - jmlr.org
Bandit Convex Optimization (BCO) is a fundamental framework for modeling sequential
decision-making with partial information, where the only feedback available to the player is …

A deep deterministic policy gradient approach for vehicle speed tracking control with a robotic driver

G Hao, Z Fu, X Feng, Z Gong, P Chen… - IEEE Transactions …, 2021 - ieeexplore.ieee.org
In performance tests, replacing humans with robotic drivers has many advantages, such as
high efficiency and high security. To realize the vehicle speed tracking control with a robotic …