Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Gradient norm aware minimization seeks first-order flatness and improves generalization
Recently, flat minima are proven to be effective for improving generalization and sharpness-
aware minimization (SAM) achieves state-of-the-art performance. Yet the current definition of …
aware minimization (SAM) achieves state-of-the-art performance. Yet the current definition of …
A modern look at the relationship between sharpness and generalization
Sharpness of minima is a promising quantity that can correlate with generalization in deep
networks and, when optimized during training, can improve generalization. However …
networks and, when optimized during training, can improve generalization. However …
Visual mamba: A survey and new outlooks
Mamba, a recent selective structured state space model, excels in long sequence modeling,
which is vital in the large model era. Long sequence modeling poses significant challenges …
which is vital in the large model era. Long sequence modeling poses significant challenges …
Friendly sharpness-aware minimization
Abstract Sharpness-Aware Minimization (SAM) has been instrumental in improving deep
neural network training by minimizing both training loss and loss sharpness. Despite the …
neural network training by minimizing both training loss and loss sharpness. Despite the …
Flatmatch: Bridging labeled data and unlabeled data with cross-sharpness for semi-supervised learning
Abstract Semi-Supervised Learning (SSL) has been an effective way to leverage abundant
unlabeled data with extremely scarce labeled data. However, most SSL methods are …
unlabeled data with extremely scarce labeled data. However, most SSL methods are …
Flatness-aware minimization for domain generalization
Abstract Domain generalization (DG) seeks to learn robust models that generalize well
under unknown distribution shifts. As a critical aspect of DG, optimizer selection has not …
under unknown distribution shifts. As a critical aspect of DG, optimizer selection has not …
Compute-efficient deep learning: Algorithmic trends and opportunities
Although deep learning has made great progress in recent years, the exploding economic
and environmental costs of training neural networks are becoming unsustainable. To …
and environmental costs of training neural networks are becoming unsustainable. To …