Študovňa Google

Články

Študovňa

Počet výsledkov: 3 (0,03 s)

Môj profil Moja knižnica

Neural lineage

Hľadať v citovaných článkoch

[免费ChatGPT] [DeepSeek可用网址] [PDF] neurips.cc

Learning-to-cache: Accelerating diffusion transformer via layer caching

X Ma, G Fang, M Bi Mi, X Wang - Advances in Neural …, 2025 - proceedings.neurips.cc

Diffusion Transformers have recently demonstrated unprecedented generative capabilities
for various tasks. The encouraging results, however, come with the cost of slow inference …

Uložiť Citovať Citované 21-krát Súvisiace články Všetky verzie 4 HTML verzia

[免费ChatGPT] [DeepSeek可用网址] [PDF] arxiv.org

Demystifying Singular Defects in Large Language Models

H Wang, T Zhang, M Salzmann - arxiv preprint arxiv:2502.07004, 2025 - arxiv.org

Large transformer models are known to produce high-norm tokens. In vision transformers
(ViTs), such tokens have been mathematically modeled through the singular vectors of the …

Uložiť Citovať Súvisiace články HTML verzia

[免费ChatGPT] [DeepSeek可用网址] [PDF] openreview.net

Unsupervised Model Tree Heritage Recovery

E Horwitz, A Shul, Y Hoshen - The Thirteenth International Conference on … - openreview.net

The number of models shared online has recently skyrocketed, with over one million public
models available on Hugging Face. Sharing models allows other users to build on existing …

Uložiť Citovať Súvisiace články HTML verzia

Vytvoriť upozornenie

Citovať

Rozšírené vyhľadávanie

Uložené do mojej knižnice

Neural lineage

Learning-to-cache: Accelerating diffusion transformer via layer caching

Demystifying Singular Defects in Large Language Models

Unsupervised Model Tree Heritage Recovery