Google 學術搜尋

Turnitin 降AI改写早检测系统早降重系统 Turnitin-UK版万方检测-期刊版维普编辑部版 Grammarly检测 Paperpass检测 checkpass检测 PaperYY检测

Towards lifelong learning of large language models: A survey

J Zheng, S Qiu, C Shi, Q Ma - ACM Computing Surveys, 2024 - dl.acm.org

As the applications of large language models (LLMs) expand across diverse fields, their
ability to adapt to ongoing changes in data, tasks, and user preferences becomes crucial …

儲存引用被引用 12 次相關文章全部共 3 個版本

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Authorship attribution in the era of llms: Problems, methodologies, and challenges

B Huang, C Chen, K Shu - ACM SIGKDD Explorations Newsletter, 2025 - dl.acm.org

Accurate attribution of authorship is crucial for maintaining the integrity of digital content,
improving forensic investigations, and mitigating the risks of misinformation and plagiarism …

儲存引用被引用 11 次相關文章全部共 9 個版本

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Chatqa: Surpassing gpt-4 on conversational qa and rag

Z Liu, W **, R Roy, P Xu, C Lee… - Advances in …, 2025 - proceedings.neurips.cc

In this work, we introduce ChatQA, a suite of models that outperform GPT-4 on retrieval-
augmented generation (RAG) and conversational question answering (QA). To enhance …

儲存引用被引用 28 次相關文章全部共 3 個版本 HTML 版

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Datacomp-lm: In search of the next generation of training sets for language models

J Li, A Fang, G Smyrnis, M Ivgi, M Jordan… - arxiv preprint arxiv …, 2024 - arxiv.org

We introduce DataComp for Language Models (DCLM), a testbed for controlled dataset
experiments with the goal of improving language models. As part of DCLM, we provide a …

儲存引用被引用 47 次相關文章全部共 5 個版本 HTML 版

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey of multimodal large language model from a data-centric perspective

T Bai, H Liang, B Wan, Y Xu, X Li, S Li, L Yang… - arxiv preprint arxiv …, 2024 - arxiv.org

Multimodal large language models (MLLMs) enhance the capabilities of standard large
language models by integrating and processing data from multiple modalities, including text …

儲存引用被引用 36 次相關文章全部共 3 個版本 HTML 版

[Free GPT-4]
[DeepSeek]

[PDF] openreview.net

[PDF][PDF] Eagle and finch: Rwkv with matrix-valued states and dynamic recurrence

B Peng, D Goldstein, Q Anthony, A Albalak… - arxiv preprint arxiv …, 2024 - openreview.net

Abstract We present Eagle (RWKV-5) and Finch (RWKV-6), sequence models improving
upon the RWKV (RWKV-4)(Peng et al., 2023) architecture. Our architectural design …

儲存引用被引用 41 次相關文章全部共 4 個版本 HTML 版

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Language models scale reliably with over-training and on downstream tasks

SY Gadre, G Smyrnis, V Shankar, S Gururangan… - arxiv preprint arxiv …, 2024 - arxiv.org

Scaling laws are useful guides for derisking expensive training runs, as they predict
performance of large models using cheaper, small-scale experiments. However, there …

儲存引用被引用 23 次相關文章全部共 4 個版本 HTML 版

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Instruction pre-training: Language models are supervised multitask learners

D Cheng, Y Gu, S Huang, J Bi, M Huang… - arxiv preprint arxiv …, 2024 - arxiv.org

Unsupervised multitask pre-training has been the critical method behind the recent success
of language models (LMs). However, supervised multitask learning still holds significant …

儲存引用被引用 16 次相關文章全部共 5 個版本 HTML 版

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

From generation to judgment: Opportunities and challenges of llm-as-a-judge

D Li, B Jiang, L Huang, A Beigi, C Zhao, Z Tan… - arxiv preprint arxiv …, 2024 - arxiv.org

Assessment and evaluation have long been critical challenges in artificial intelligence (AI)
and natural language processing (NLP). However, traditional methods, whether matching …

儲存引用被引用 18 次相關文章全部共 3 個版本 HTML 版

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Scaling laws for precision

T Kumar, Z Ankner, BF Spector, B Bordelon… - arxiv preprint arxiv …, 2024 - arxiv.org

Low precision training and inference affect both the quality and cost of language models, but
current scaling laws do not account for this. In this work, we devise" precision-aware" scaling …

儲存引用被引用 17 次相關文章全部共 3 個版本 HTML 版

引用

進階搜尋

已儲存至「我的圖書館」

Towards lifelong learning of large language models: A survey

Authorship attribution in the era of llms: Problems, methodologies, and challenges

Chatqa: Surpassing gpt-4 on conversational qa and rag

Datacomp-lm: In search of the next generation of training sets for language models

A survey of multimodal large language model from a data-centric perspective

[PDF][PDF] Eagle and finch: Rwkv with matrix-valued states and dynamic recurrence

Language models scale reliably with over-training and on downstream tasks

Instruction pre-training: Language models are supervised multitask learners

From generation to judgment: Opportunities and challenges of llm-as-a-judge

Scaling laws for precision