Google Tudós

Cikkek

Tudós

2 találat (0,02 másodperc)

Saját profil Saját könyvtár

Pegasus-v1 Technical Report

Keresés az idéző cikkek között

[Free GPT-4]

[PDF] arxiv.org

Video-mme: The first-ever comprehensive evaluation benchmark of multi-modal llms in video analysis

C Fu, Y Dai, Y Luo, L Li, S Ren, R Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org

In the quest for artificial general intelligence, Multi-modal Large Language Models (MLLMs)
have emerged as a focal point in recent advancements. However, the predominant focus …

Mentés Hivatkozás Idézetek száma: 139 Kapcsolódó cikkek Mind a(z) 2 változat HTML-változat

[Free GPT-4]

[PDF] arxiv.org

Video understanding with large language models: A survey

Y Tang, J Bi, S Xu, L Song, S Liang, T Wang… - arxiv preprint arxiv …, 2023 - arxiv.org

With the burgeoning growth of online video platforms and the escalating volume of video
content, the demand for proficient video understanding tools has intensified markedly. Given …

Mentés Hivatkozás Idézetek száma: 60 Kapcsolódó cikkek Mind a(z) 2 változat HTML-változat

Értesítés létrehozása

Hivatkozás

Speciális keresés

Mentve a Saját könyvtárba

Pegasus-v1 Technical Report

Video-mme: The first-ever comprehensive evaluation benchmark of multi-modal llms in video analysis

Video understanding with large language models: A survey