Normalization techniques in training dnns: Methodology, analysis and application

L Huang, J Qin, Y Zhou, F Zhu, L Liu… - IEEE transactions on …, 2023 - ieeexplore.ieee.org
Normalization techniques are essential for accelerating the training and improving the
generalization of deep neural networks (DNNs), and have successfully been used in various …

[PDF][PDF] Linear dimensionality reduction: Survey, insights, and generalizations

JP Cunningham, Z Ghahramani - The Journal of Machine Learning …, 2015 - jmlr.org
Linear dimensionality reduction methods are a cornerstone of analyzing high dimensional
data, due to their simple geometric interpretations and typically attractive computational …

Orthogonal weight normalization: Solution to optimization over multiple dependent stiefel manifolds in deep neural networks

L Huang, X Liu, B Lang, A Yu, Y Wang… - Proceedings of the AAAI …, 2018 - ojs.aaai.org
Orthogonal matrix has shown advantages in training Recurrent Neural Networks (RNNs),
but such matrix is limited to be square for the hidden-to-hidden transformation in RNNs. In …

A generalized power iteration method for solving quadratic problem on the stiefel manifold

F Nie, R Zhang, X Li - Science China Information Sciences, 2017 - Springer
In this paper, we first propose a novel generalized power iteration (GPI) method to solve the
quadratic problem on the Stiefel manifold (QPSM) as min_ W^ TW= I min WTW= I Tr (WT …

Quadratic optimization with orthogonality constraint: explicit Łojasiewicz exponent and linear convergence of retraction-based line-search and stochastic variance …

H Liu, AMC So, W Wu - Mathematical Programming, 2019 - Springer
The problem of optimizing a quadratic form over an orthogonality constraint (QP-OC for
short) is one of the most fundamental matrix optimization problems and arises in many …

Statistics on the Stiefel manifold: Theory and applications

R Chakraborty, BC Vemuri - 2019 - projecteuclid.org
A Stiefel manifold of the compact type is often encountered in many fields of engineering
including, signal and image processing, machine learning, numerical optimization and …

High-dimensional Kuramoto models on Stiefel manifolds synchronize complex networks almost globally

J Markdahl, J Thunberg, J Goncalves - Automatica, 2020 - Elsevier
The Kuramoto model of coupled phase oscillators is often used to describe synchronization
phenomena in nature. Some applications, eg, quantum synchronization and rigid-body …

Manifold calculus in system theory and control—Fundamentals and first-order systems

S Fiori - Symmetry, 2021 - mdpi.com
The aim of the present tutorial paper is to recall notions from manifold calculus and to
illustrate how these tools prove useful in describing system-theoretic properties. Special …

Riemannian conjugate gradient methods with inverse retraction

X Zhu, H Sato - Computational Optimization and Applications, 2020 - Springer
We propose a new class of Riemannian conjugate gradient (CG) methods, in which inverse
retraction is used instead of vector transport for search direction construction. In existing …

Computing fundamental matrix decompositions accurately via the matrix sign function in two iterations: The power of Zolotarev's functions

Y Nakatsukasa, RW Freund - siam REVIEW, 2016 - SIAM
The symmetric eigenvalue decomposition and the singular value decomposition (SVD) are
fundamental matrix decompositions with many applications. Conventional algorithms for …