Toward performing image classification and object detection with convolutional neural networks in autonomous driving systems: A survey

T Turay, T Vladimirova - IEEE Access, 2022 - ieeexplore.ieee.org
Nowadays Convolutional Neural Networks (CNNs) are being employed in a wide range of
industrial technologies for a variety of sectors, such as medical, automotive, aviation …

A survey of optimization methods from a machine learning perspective

S Sun, Z Cao, H Zhu, J Zhao - IEEE transactions on cybernetics, 2019 - ieeexplore.ieee.org
Machine learning develops rapidly, which has made many theoretical breakthroughs and is
widely applied in various fields. Optimization, as an important part of machine learning, has …

Re-parameterizing your optimizers rather than architectures

X Ding, H Chen, X Zhang, K Huang, J Han… - arxiv preprint arxiv …, 2022 - arxiv.org
The well-designed structures in neural networks reflect the prior knowledge incorporated
into the models. However, though different models have various priors, we are used to …

Wasserstein robust reinforcement learning

MA Abdullah, H Ren, HB Ammar, V Milenkovic… - arxiv preprint arxiv …, 2019 - arxiv.org
Reinforcement learning algorithms, though successful, tend to over-fit to training
environments hampering their application to the real-world. This paper proposes $\text …

Importance sampling techniques for policy optimization

AM Metelli, M Papini, N Montali, M Restelli - Journal of Machine Learning …, 2020 - jmlr.org
How can we effectively exploit the collected samples when solving a continuous control task
with Reinforcement Learning? Recent results have empirically demonstrated that multiple …

An off-policy trust region policy optimization method with monotonic improvement guarantee for deep reinforcement learning

W Meng, Q Zheng, Y Shi, G Pan - IEEE Transactions on Neural …, 2021 - ieeexplore.ieee.org
In deep reinforcement learning, off-policy data help reduce on-policy interaction with the
environment, and the trust region policy optimization (TRPO) method is efficient to stabilize …

Parameter optimization for point clouds denoising based on no-reference quality assessment

C Qu, Y Zhang, F Ma, K Huang - Measurement, 2023 - Elsevier
Almost all point clouds denoising methods contain various parameters, which need to be set
carefully to acquire desired results. In this paper, we introduce an evolutionary optimization …

Smoothing policies and safe policy gradients

M Papini, M Pirotta, M Restelli - Machine Learning, 2022 - Springer
Policy gradient (PG) algorithms are among the best candidates for the much-anticipated
applications of reinforcement learning to real-world control tasks, such as robotics. However …

Robust Black-Box Optimization for Stochastic Search and Episodic Reinforcement Learning

M Hüttenrauch, G Neumann - Journal of Machine Learning Research, 2024 - jmlr.org
Black-box optimization is a versatile approach to solve complex problems where the
objective function is not explicitly known and no higher order information is available. Due to …

Guided soft actor critic: A guided deep reinforcement learning approach for partially observable Markov decision processes

M Haklidir, H Temeltaş - IEEE Access, 2021 - ieeexplore.ieee.org
Most real-world problems are essentially partially observable, and the environmental model
is unknown. Therefore, there is a significant need for reinforcement learning approaches to …