Google Наука

S Zhang, H Tong, J Xu, R Maciejewski - Computational Social Networks, 2019 - Springer

Graphs naturally appear in numerous application domains, ranging from social analysis,
bioinformatics to computer vision. The unique capability of graphs enables capturing the …

Запазване Позоваване С позовавания в 1562 Сродни статии Всички 16 версии

A survey of graph neural networks in various learning paradigms: methods, applications, and challenges

L Waikhom, R Patgiri - Artificial Intelligence Review, 2023 - Springer

In the last decade, deep learning has reinvigorated the machine learning field. It has solved
many problems in computer vision, speech recognition, natural language processing, and …

Запазване Позоваване С позовавания в 87 Сродни статии Всички 5 версии

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Vision gnn: An image is worth graph of nodes

K Han, Y Wang, J Guo, Y Tang… - Advances in neural …, 2022 - proceedings.neurips.cc

Network architecture plays a key role in the deep learning-based computer vision system.
The widely-used convolutional neural network and transformer treat the image as a grid or …

Запазване Позоваване С позовавания в 447 Сродни статии Всички 8 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Compositional chain-of-thought prompting for large multimodal models

C Mitra, B Huang, T Darrell… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

The combination of strong visual backbones and Large Language Model (LLM) reasoning
has led to Large Multimodal Models (LMMs) becoming the current standard for a wide range …

Запазване Позоваване С позовавания в 74 Сродни статии Всички 5 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Grounded language-image pre-training

LH Li, P Zhang, H Zhang, J Yang, C Li… - Proceedings of the …, 2022 - openaccess.thecvf.com

This paper presents a grounded language-image pre-training (GLIP) model for learning
object-level, language-aware, and semantic-rich visual representations. GLIP unifies object …

Запазване Позоваване С позовавания в 1180 Сродни статии Всички 8 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Llm-grounded diffusion: Enhancing prompt understanding of text-to-image diffusion models with large language models

L Lian, B Li, A Yala, T Darrell - ar** big data artificial intelligence (AI) techniques with possible …

Запазване Позоваване С позовавания в 289 Сродни статии Всички 7 версии

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] Cpt: Colorful prompt tuning for pre-trained vision-language models

Y Yao, A Zhang, Z Zhang, Z Liu, TS Chua, M Sun - AI Open, 2024 - Elsevier

Abstract Vision-Language Pre-training (VLP) models have shown promising capabilities in
grounding natural language in image data, facilitating a broad range of cross-modal tasks …

Запазване Позоваване С позовавания в 274 Сродни статии Всички 4 версии

Създаване на сигнал

Позоваване

Разширено търсене

Запазено в „Моята библиотека“

Scene graph generation by iterative message passing

Graph convolutional networks: a comprehensive review

A survey of graph neural networks in various learning paradigms: methods, applications, and challenges

Vision gnn: An image is worth graph of nodes

Compositional chain-of-thought prompting for large multimodal models

Grounded language-image pre-training

Llm-grounded diffusion: Enhancing prompt understanding of text-to-image diffusion models with large language models

[HTML][HTML] Cpt: Colorful prompt tuning for pre-trained vision-language models