Algorithmic regularization in tensor optimization: Towards a lifted approach in matrix sensing

Z Ma, J Lavaei, S Sojoudi - Advances in Neural Information …, 2023 - proceedings.neurips.cc
Gradient descent (GD) is crucial for generalization in machine learning models, as it induces
implicit regularization, promoting compact representations. In this work, we examine the role …

Projected gradient descent algorithm for low-rank matrix estimation

T Zhang, X Fan - arxiv preprint arxiv:2403.02704, 2024 - arxiv.org
Most existing methodologies of estimating low-rank matrices rely on Burer-Monteiro
factorization, but these approaches can suffer from slow convergence, especially when …

Absence of spurious solutions far from ground truth: A low-rank analysis with high-order losses

Z Ma, Y Chen, J Lavaei… - … Conference on Artificial …, 2024 - proceedings.mlr.press
Matrix sensing problems exhibit pervasive non-convexity, plaguing optimization with a
proliferation of suboptimal spurious solutions. Avoiding convergence to these critical points …

[KNIHA][B] Solving Matrix Sensing to Optimality under Realistic Settings

Z Ma - 2024 - search.proquest.com
Matrix sensing represents a critical, non-convex challenge within the domain of
mathematical optimization, distinguished by its wide-ranging practical applications—such as …

[PDF][PDF] Structured Noise to Help Non-Convexity: Solving Matrix Completion as Noisy Matrix Sensing

Z Ma - gavenma.github.io
The training of all modern machine learning models including deep neural networks and
large languages models can be considered as solving nonconvex optimization problems …

[PDF][PDF] Solving Matrix Completion as Noisy Matrix Sensing

Z Ma, S Sojoudi - gavenma.github.io
Matrix completion, a crucial sub-problem of non-convex matrix sensing, is integral to
numerous machine learning applications such as recommender systems. Traditionally …

[PDF][PDF] DETERMINISTIC ESCAPE FROM LOCAL MINIMA: ORACLE FROM SIMULATED OVER-PARAMETRIZATION

T Shen, K Gao, Z Ma - gavenma.github.io
Modern machine learning problems are predominantly non-convex, often containing a
potentially infinite number of local minima that can hinder gradient-based algorithms from …