Pre-trained language models for text generation: A survey

J Li, T Tang, WX Zhao, JY Nie, JR Wen - ACM Computing Surveys, 2024 - dl.acm.org
Text Generation aims to produce plausible and readable text in human language from input
data. The resurgence of deep learning has greatly advanced this field, in particular, with the …

Neural collapse: A review on modelling principles and generalization

V Kothapalli - arxiv preprint arxiv:2206.04041, 2022 - arxiv.org
Deep classifier neural networks enter the terminal phase of training (TPT) when training
error reaches zero and tend to exhibit intriguing Neural Collapse (NC) properties. Neural …

Learning with fantasy: Semantic-aware virtual contrastive constraint for few-shot class-incremental learning

Z Song, Y Zhao, Y Shi, P Peng… - Proceedings of the …, 2023 - openaccess.thecvf.com
Few-shot class-incremental learning (FSCIL) aims at learning to classify new classes
continually from limited samples without forgetting the old classes. The mainstream …

Fsce: Few-shot object detection via contrastive proposal encoding

B Sun, B Li, S Cai, Y Yuan… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Emerging interests have been brought to recognize previously unseen objects given very
few training examples, known as few-shot object detection (FSOD). Recent researches …

Supervised contrastive learning

P Khosla, P Teterwak, C Wang… - Advances in neural …, 2020 - proceedings.neurips.cc
Contrastive learning applied to self-supervised representation learning has seen a
resurgence in recent years, leading to state of the art performance in the unsupervised …

Supervised contrastive learning for pre-trained language model fine-tuning

B Gunel, J Du, A Conneau, V Stoyanov - arxiv preprint arxiv:2011.01403, 2020 - arxiv.org
State-of-the-art natural language understanding classification models follow two-stages: pre-
training a large language model on an auxiliary task, and then fine-tuning the model on a …

Fantastic generalization measures and where to find them

Y Jiang, B Neyshabur, H Mobahi, D Krishnan… - arxiv preprint arxiv …, 2019 - arxiv.org
Generalization of deep networks has been of great interest in recent years, resulting in a
number of theoretically and empirically motivated complexity measures. However, most …

[HTML][HTML] Applying supervised contrastive learning for the detection of diabetic retinopathy and its severity levels from fundus images

MR Islam, LF Abdulrazak, M Nahiduzzaman… - Computers in biology …, 2022 - Elsevier
Diabetic Retinopathy (DR) is a major complication in human eyes among the diabetic
patients. Early detection of the DR can save many patients from permanent blindness …

Uncertainty reduction for model adaptation in semantic segmentation

F Fleuret - Proceedings of the IEEE/CVF conference on …, 2021 - openaccess.thecvf.com
Abstract Traditional methods for Unsupervised Domain Adaptation (UDA) targeting semantic
segmentation exploit information common to the source and target domains, using both …

Identifying mislabeled data using the area under the margin ranking

G Pleiss, T Zhang, E Elenberg… - Advances in Neural …, 2020 - proceedings.neurips.cc
Not all data in a typical training set help with generalization; some samples can be overly
ambiguous or outrightly mislabeled. This paper introduces a new method to identify such …