- Academic Search

บทความ

Scholar

ผลการค้นหา 5 รายการ (0.03 วินาที)

โปรไฟล์ของฉัน ห้องสมุดของฉัน

Max-margin works while large margin fails: Generalization without uniform convergence

ค้นหาในบทความที่อ้างถึง

Turnitin 降AI改写早检测系统早降重系统 Turnitin-UK版万方检测-期刊版维普编辑部版 Grammarly检测 Paperpass检测 checkpass检测 PaperYY检测

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

The benefits of mixup for feature learning

D Zou, Y Cao, Y Li, Q Gu - International Conference on …, 2023 - proceedings.mlr.press

Mixup, a simple data augmentation method that randomly mixes two data points via linear
interpolation, has been extensively applied in various deep learning applications to gain …

บันทึก อ้างอิง อ้างโดย41 บทความที่เกี่ยวข้อง ทั้งหมด 10 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Initialization-dependent sample complexity of linear predictors and neural networks

R Magen, O Shamir - Advances in Neural Information …, 2023 - proceedings.neurips.cc

We provide several new results on the sample complexity of vector-valued linear predictors
(parameterized by a matrix), and more generally neural networks. Focusing on size …

บันทึก อ้างอิง อ้างโดย4 บทความที่เกี่ยวข้อง ทั้งหมด 7 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Lower generalization bounds for gd and sgd in smooth stochastic convex optimization

P Zhang, J Teng, J Zhang - arxiv preprint arxiv:2303.10758, 2023 - arxiv.org

This work studies the generalization error of gradient methods. More specifically, we focus
on how training steps $ T $ and step-size $\eta $ might affect generalization in smooth …

บันทึก อ้างอิง อ้างโดย5 บทความที่เกี่ยวข้อง ทั้งหมด 2 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] warwick.ac.uk

Implicit regularization of AdaDelta

M Englert, R Lazic, A Semler - Transactions on Machine …, 2024 - wrap.warwick.ac.uk

We consider the AdaDelta adaptive optimization algorithm on locally Lipschitz, positively
homogeneous, and o-minimally definable neural networks, with either the exponential or the …

บันทึก อ้างอิง บทความที่เกี่ยวข้อง ดูในรูปแบบ HTML

[หนังสือ][B] Feature Learning in Neural Networks and Other Stochastic Explorations

M Glasgow - 2024 - search.proquest.com

Recent years have empirically demonstrated the unprecedented success of deep learning.
Yet our theoretical understanding of why gradient descent succeeds in training neural …

บันทึก อ้างอิง บทความที่เกี่ยวข้อง Library Search

สร้างการแจ้งเตือน

อ้างอิง

การค้นหาขั้นสูง

บันทึกไปยังคลังของฉันแล้ว

Max-margin works while large margin fails: Generalization without uniform convergence

The benefits of mixup for feature learning

Initialization-dependent sample complexity of linear predictors and neural networks

Lower generalization bounds for gd and sgd in smooth stochastic convex optimization

Implicit regularization of AdaDelta

[หนังสือ][B] Feature Learning in Neural Networks and Other Stochastic Explorations