الباحث العلمي من Google

المقالات

الباحث العلمي

عدد النتائج: 2 (0.03 من الثواني)

ملفي الشخصي مكتبتي

Exact Mean Square Linear Stability Analysis for SGD

بحث في المقالات الاستشهادية

[Free GPT-4]

[PDF] arxiv.org

High dimensional analysis reveals conservative sharpening and a stochastic edge of stability‏

A Agarwala, J Pennington - arxiv preprint arxiv:2404.19261, 2024‏ - arxiv.org‏

Recent empirical and theoretical work has shown that the dynamics of the large eigenvalues
of the training loss Hessian have some remarkably robust features across models and …‏

حفظ اقتباس تم اقتباسها في عدد: 2 مقالات ذات صلة الإصدارات الـ 2كلها إصدار HTML‏

[Free GPT-4]

[PDF] arxiv.org

Guiding Two-Layer Neural Network Lipschitzness via Gradient Descent Learning Rate Constraints‏

K Sung, A Kratsios, N Forman - arxiv preprint arxiv:2502.03792, 2025‏ - arxiv.org‏

We demonstrate that applying an eventual decay to the learning rate (LR) in empirical risk
minimization (ERM), where the mean-squared-error loss is minimized using standard …‏

حفظ اقتباس مقالات ذات صلة إصدار HTML‏

إنشاء تنبيه

اقتباس

بحث متقدم

تم حفظ المقالة في مكتبتي.

Exact Mean Square Linear Stability Analysis for SGD

High dimensional analysis reveals conservative sharpening and a stochastic edge of stability‏

Guiding Two-Layer Neural Network Lipschitzness via Gradient Descent Learning Rate Constraints‏