- Academic Search

C Burns, P Izmailov, JH Kirchner, B Baker… - arxiv preprint arxiv …, 2023 - arxiv.org

Widely used alignment techniques, such as reinforcement learning from human feedback
(RLHF), rely on the ability of humans to supervise model behavior-for example, to evaluate …

Lưu Trích dẫn Trích dẫn 224 bài viết Bài viết có liên quan Tất cả 9 phiên bản Xem dạng HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Freematch: Self-adaptive thresholding for semi-supervised learning

Y Wang, H Chen, Q Heng, W Hou, Y Fan, Z Wu… - arxiv preprint arxiv …, 2022 - arxiv.org

Pseudo labeling and consistency regularization approaches with confidence-based
thresholding have made great progress in semi-supervised learning (SSL). In this paper, we …

Lưu Trích dẫn Trích dẫn 361 bài viết Bài viết có liên quan Tất cả 6 phiên bản Xem dạng HTML

[Free GPT-4]
[DeepSeek]

[PDF] mit.edu

Causal inference in natural language processing: Estimation, prediction, interpretation and beyond

A Feder, KA Keith, E Manzoor, R Pryzant… - Transactions of the …, 2022 - direct.mit.edu

A fundamental goal of scientific research is to learn about causal relationships. However,
despite its critical role in the life and social sciences, causality has not had the same …

Lưu Trích dẫn Trích dẫn 272 bài viết Bài viết có liên quan Tất cả 11 phiên bản

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] Self-training: A survey

MR Amini, V Feofanov, L Pauletto, L Hadjadj… - Neurocomputing, 2025 - Elsevier

Self-training methods have gained significant attention in recent years due to their
effectiveness in leveraging small labeled datasets and large unlabeled observations for …

Lưu Trích dẫn Trích dẫn 146 bài viết Bài viết có liên quan Tất cả 12 phiên bản

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Cycle self-training for domain adaptation

H Liu, J Wang, M Long - Advances in Neural Information …, 2021 - proceedings.neurips.cc

Mainstream approaches for unsupervised domain adaptation (UDA) learn domain-invariant
representations to narrow the domain shift, which are empirically effective but theoretically …

Lưu Trích dẫn Trích dẫn 208 bài viết Bài viết có liên quan Tất cả 11 phiên bản Xem dạng HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Test time adaptation via conjugate pseudo-labels

S Goyal, M Sun, A Raghunathan… - Advances in Neural …, 2022 - proceedings.neurips.cc

Test-time adaptation (TTA) refers to adapting neural networks to distribution shifts,
specifically with just access to unlabeled test samples from the new domain at test-time …

Lưu Trích dẫn Trích dẫn 101 bài viết Bài viết có liên quan Tất cả 6 phiên bản Xem dạng HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Theoretical analysis of self-training with deep networks on unlabeled data

C Wei, K Shen, Y Chen, T Ma - arxiv preprint arxiv:2010.03622, 2020 - arxiv.org

Self-training algorithms, which train a model to fit pseudolabels predicted by another
previously-learned model, have been very successful for learning with unlabeled data using …

Lưu Trích dẫn Trích dẫn 262 bài viết Bài viết có liên quan Tất cả 3 phiên bản Xem dạng HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Theoretical analysis of weak-to-strong generalization

H Lang, D Sontag… - Advances in Neural …, 2025 - proceedings.neurips.cc

Strong student models can learn from weaker teachers: when trained on the predictions of a
weaker model, a strong pretrained student can learn to correct the weak model's errors and …

Lưu Trích dẫn Trích dẫn 10 bài viết Bài viết có liên quan Tất cả 4 phiên bản Xem dạng HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Robust learning with progressive data expansion against spurious correlation

Y Deng, Y Yang, B Mirzasoleiman… - Advances in neural …, 2023 - proceedings.neurips.cc

While deep learning models have shown remarkable performance in various tasks, they are
susceptible to learning non-generalizable _spurious features_ rather than the core features …

Lưu Trích dẫn Trích dẫn 32 bài viết Bài viết có liên quan Tất cả 10 phiên bản Xem dạng HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Masktune: Mitigating spurious correlations by forcing to explore

S Asgari, A Khani, F Khani, A Gholami… - Advances in …, 2022 - proceedings.neurips.cc

A fundamental challenge of over-parameterized deep learning models is learning
meaningful data representations that yield good performance on a downstream task without …

Lưu Trích dẫn Trích dẫn 48 bài viết Bài viết có liên quan Tất cả 10 phiên bản Xem dạng HTML

Tạo thông báo

Trích dẫn

Tìm kiếm nâng cao

Đã lưu vào Thư viện của tôi

Self-training avoids using spurious features under domain shift

Weak-to-strong generalization: Eliciting strong capabilities with weak supervision

Freematch: Self-adaptive thresholding for semi-supervised learning

Causal inference in natural language processing: Estimation, prediction, interpretation and beyond

[HTML][HTML] Self-training: A survey

Cycle self-training for domain adaptation

Test time adaptation via conjugate pseudo-labels

Theoretical analysis of self-training with deep networks on unlabeled data

Theoretical analysis of weak-to-strong generalization

Robust learning with progressive data expansion against spurious correlation

Masktune: Mitigating spurious correlations by forcing to explore