- Academic Search

AM Saxe, Y Bansal, J Dapello, M Advani… - Journal of Statistical …, 2019 - iopscience.iop.org

The practical successes of deep neural networks have not been matched by theoretical
progress that satisfyingly explains their behavior. In this work, we study the information …

บันทึก อ้างอิง อ้างโดย699 บทความที่เกี่ยวข้อง ทั้งหมด 12 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] siam.org

Optimization methods for large-scale machine learning

L Bottou, FE Curtis, J Nocedal - SIAM review, 2018 - SIAM

This paper provides a review and commentary on the past, present, and future of numerical
optimization algorithms in the context of machine learning applications. Through case …

บันทึก อ้างอิง อ้างโดย4275 บทความที่เกี่ยวข้อง ทั้งหมด 21 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] ethernet.edu.et

[หนังสือ][B] Targeted learning in data science

MJ Van der Laan, S Rose - 2018 - Springer

This book builds on and is a sequel to our book Targeted Learning: Causal Inference for
Observational and Experimental Studies (2011). Since the publication of this first book on …

บันทึก อ้างอิง อ้างโดย330 บทความที่เกี่ยวข้อง ทั้งหมด 6 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] jmlr.org

New insights and perspectives on the natural gradient method

J Martens - Journal of Machine Learning Research, 2020 - jmlr.org

Natural gradient descent is an optimization method traditionally motivated from the
perspective of information geometry, and works well for many applications as an alternative …

บันทึก อ้างอิง อ้างโดย756 บทความที่เกี่ยวข้อง ทั้งหมด 5 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] microsoft.com

Stochastic gradient descent tricks

L Bottou - Neural networks: tricks of the trade: second edition, 2012 - Springer

Chapter 1 strongly advocates the stochastic back-propagation method to train neural
networks. This is in fact an instance of a more general technique called stochastic gradient …

บันทึก อ้างอิง อ้างโดย3397 บทความที่เกี่ยวข้อง ทั้งหมด 17 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] academia.edu

Label consistent K-SVD: Learning a discriminative dictionary for recognition

Z Jiang, Z Lin, LS Davis - IEEE transactions on pattern analysis …, 2013 - ieeexplore.ieee.org

A label consistent K-SVD (LC-KSVD) algorithm to learn a discriminative dictionary for sparse
coding is presented. In addition to using class labels of training data, we also associate label …

บันทึก อ้างอิง อ้างโดย1410 บทความที่เกี่ยวข้อง ทั้งหมด 9 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Stochastic gradient descent, weighted sampling, and the randomized Kaczmarz algorithm

D Needell, R Ward, N Srebro - Advances in neural …, 2014 - proceedings.neurips.cc

We improve a recent gurantee of Bach and Moulines on the linear convergence of SGD for
smooth and strongly convex objectives, reducing a quadratic dependence on the strong …

บันทึก อ้างอิง อ้างโดย736 บทความที่เกี่ยวข้อง ทั้งหมด 19 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] jmlr.org

Stochastic dual coordinate ascent methods for regularized loss

S Shalev-Shwartz, T Zhang - The Journal of Machine Learning …, 2013 - dl.acm.org

Stochastic Gradient Descent (SGD) has become popular for solving large scale supervised
machine learning optimization problems such as SVM, due to their strong theoretical …

บันทึก อ้างอิง อ้างโดย1257 บทความที่เกี่ยวข้อง ทั้งหมด 25 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] uoa.gr

Large-scale machine learning with stochastic gradient descent

L Bottou - Proceedings of COMPSTAT'2010: 19th International …, 2010 - Springer

During the last decade, the data sizes have grown faster than the speed of processors. In
this context, the capabilities of statistical machine learning methods is limited by the …

บันทึก อ้างอิง อ้างโดย8013 บทความที่เกี่ยวข้อง ทั้งหมด 20 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A stochastic quasi-Newton method for large-scale optimization

RH Byrd, SL Hansen, J Nocedal, Y Singer - SIAM Journal on Optimization, 2016 - SIAM

The question of how to incorporate curvature information into stochastic approximation
methods is challenging. The direct application of classical quasi-Newton updating …

บันทึก อ้างอิง อ้างโดย624 บทความที่เกี่ยวข้อง ทั้งหมด 11 ฉบับ

สร้างการแจ้งเตือน

อ้างอิง

การค้นหาขั้นสูง

บันทึกไปยังคลังของฉันแล้ว

A statistical study of on-line learning

On the information bottleneck theory of deep learning

Optimization methods for large-scale machine learning

[หนังสือ][B] Targeted learning in data science

New insights and perspectives on the natural gradient method

Stochastic gradient descent tricks

Label consistent K-SVD: Learning a discriminative dictionary for recognition

Stochastic gradient descent, weighted sampling, and the randomized Kaczmarz algorithm

Stochastic dual coordinate ascent methods for regularized loss

Large-scale machine learning with stochastic gradient descent

A stochastic quasi-Newton method for large-scale optimization