Google Academic

D Zhang, S Zhoubian, Z Hu, Y Yue… - Advances in Neural …, 2025 - proceedings.neurips.cc

Recent methodologies in LLM self-training mostly rely on LLM generating responses and
filtering those with correct output answers as training data. This approach often yields a low …

Salvați Citați Citat de 81 ori Articole cu conținut similar Toate cele 6 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] sciencedirect.com

Benchmarking Human–AI collaboration for common evidence appraisal tools

T Woelfle, J Hirt, P Janiaud, L Kappos… - Journal of Clinical …, 2024 - Elsevier

Background It is unknown whether large language models (LLMs) may facilitate time-and
resource-intensive text-related processes in evidence appraisal. Objectives To quantify the …

Salvați Citați Citat de 7 ori Articole cu conținut similar Toate cele 7 versiuni

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Is a picture worth a thousand words? delving into spatial reasoning for vision language models

J Wang, Y Ming, Z Shi, V Vineet… - Advances in Neural …, 2025 - proceedings.neurips.cc

Large language models (LLMs) and vision-language models (VLMs) have demonstrated
remarkable performance across a wide range of tasks and domains. Despite this promise …

Salvați Citați Citat de 19 ori Articole cu conținut similar Toate cele 6 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Spider2-v: How far are multimodal agents from automating data science and engineering workflows?

R Cao, F Lei, H Wu, J Chen, Y Fu… - Advances in …, 2025 - proceedings.neurips.cc

Data science and engineering workflows often span multiple stages, from warehousing to
orchestration, using tools like BigQuery, dbt, and Airbyte. As vision language models (VLMs) …

Salvați Citați Citat de 14 ori Articole cu conținut similar Toate cele 5 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Tensor attention training: Provably efficient learning of higher-order transformers

Y Liang, Z Shi, Z Song, Y Zhou - arxiv preprint arxiv:2405.16411, 2024 - arxiv.org

Tensor Attention, a multi-view attention that is able to capture high-order correlations among
multiple modalities, can overcome the representational limitations of classical matrix …

Salvați Citați Citat de 37 ori Articole cu conținut similar Toate cele 8 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Large language model inference acceleration: A comprehensive hardware perspective

J Li, J Xu, S Huang, Y Chen, W Li, J Liu, Y Lian… - arxiv preprint arxiv …, 2024 - arxiv.org

Large Language Models (LLMs) have demonstrated remarkable capabilities across various
fields, from natural language understanding to text generation. Compared to non-generative …

Salvați Citați Citat de 10 ori Articole cu conținut similar Toate cele 3 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] openreview.net

RouteLLM: Learning to Route LLMs from Preference Data

I Ong, A Almahairi, V Wu, WL Chiang, T Wu… - The Thirteenth …, 2024 - openreview.net

Large language models (LLMs) excel at a wide range of tasks, but choosing the right model
often involves balancing performance and cost. Powerful models offer better results but are …

Salvați Citați Citat de 40 ori Articole cu conținut similar Toate cele 4 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Conv-basis: A new paradigm for efficient attention inference and gradient computation in transformers

Y Liang, H Liu, Z Shi, Z Song, Z Xu, J Yin - arxiv preprint arxiv:2405.05219, 2024 - arxiv.org

The self-attention mechanism is the key to the success of transformers in recent Large
Language Models (LLMs). However, the quadratic computational cost $ O (n^ 2) $ in the …

Salvați Citați Citat de 32 ori Articole cu conținut similar Toate cele 4 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Navigating the safety landscape: Measuring risks in finetuning large language models

SY Peng, PY Chen, M Hull, DH Chau - arxiv preprint arxiv:2405.17374, 2024 - arxiv.org

Safety alignment is crucial to ensure that large language models (LLMs) behave in ways that
align with human preferences and prevent harmful actions during inference. However …

Salvați Citați Citat de 22 ori Articole cu conținut similar Toate cele 5 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Harmful fine-tuning attacks and defenses for large language models: A survey

T Huang, S Hu, F Ilhan, SF Tekin, L Liu - arxiv preprint arxiv:2409.18169, 2024 - arxiv.org

Recent research demonstrates that the nascent fine-tuning-as-a-service business model
exposes serious safety concerns--fine-tuning over a few harmful data uploaded by the users …

Salvați Citați Citat de 16 ori Articole cu conținut similar Toate cele 5 versiuni Afișare ca HTML

Creează alerta

Citați

Căutare avansată

Salvat în Bibliotecă

Introducing meta llama 3: The most capable openly available llm to date, 2024

Rest-mcts*: Llm self-training via process reward guided tree search

Benchmarking Human–AI collaboration for common evidence appraisal tools

Is a picture worth a thousand words? delving into spatial reasoning for vision language models

Spider2-v: How far are multimodal agents from automating data science and engineering workflows?

Tensor attention training: Provably efficient learning of higher-order transformers

Large language model inference acceleration: A comprehensive hardware perspective

RouteLLM: Learning to Route LLMs from Preference Data

Conv-basis: A new paradigm for efficient attention inference and gradient computation in transformers

Navigating the safety landscape: Measuring risks in finetuning large language models

Harmful fine-tuning attacks and defenses for large language models: A survey