Google znalac

M Stefanini, M Cornia, L Baraldi… - IEEE transactions on …, 2022 - ieeexplore.ieee.org

Connecting Vision and Language plays an essential role in Generative Intelligence. For this
reason, large research efforts have been devoted to image captioning, ie describing images …

Spremi Citiraj Spominje se 393 puta Srodni članci Svih 12 inačica

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Vision Transformers in medical computer vision—A contemplative retrospection

A Parvaiz, MA Khalid, R Zafar, H Ameer, M Ali… - … Applications of Artificial …, 2023 - Elsevier

Abstract Vision Transformers (ViTs), with the magnificent potential to unravel the information
contained within images, have evolved as one of the most contemporary and dominant …

Spremi Citiraj Spominje se 182 puta Srodni članci Svih 6 inačica

[Free GPT-4]
[DeepSeek]

[PDF] stableaiprompts.com

[PDF][PDF] The dawn of lmms: Preliminary explorations with gpt-4v (ision)

Z Yang, L Li, K Lin, J Wang, CC Lin… - arxiv preprint arxiv …, 2023 - stableaiprompts.com

Large multimodal models (LMMs) extend large language models (LLMs) with multi-sensory
skills, such as visual understanding, to achieve stronger generic intelligence. In this paper …

Spremi Citiraj Spominje se 591 puta Srodni članci Svih 4 inačica Prikaži kao HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Dense text-to-image generation with attention modulation

Y Kim, J Lee, JH Kim, JW Ha… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Existing text-to-image diffusion models struggle to synthesize realistic images given dense
captions, where each text prompt provides a detailed description for a specific image region …

Spremi Citiraj Spominje se 100 puta Srodni članci Svih 7 inačica Prikaži kao HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Interactive and explainable region-guided radiology report generation

T Tanida, P Müller, G Kaissis… - Proceedings of the …, 2023 - openaccess.thecvf.com

The automatic generation of radiology reports has the potential to assist radiologists in the
time-consuming task of report writing. Existing methods generate the full report from image …

Spremi Citiraj Spominje se 138 puta Srodni članci Svih 7 inačica Prikaži kao HTML

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] High-precision multiclass classification of lung disease through customized MobileNetV2 from chest X-ray images

FMJM Shamrat, S Azam, A Karim, K Ahmed… - Computers in Biology …, 2023 - Elsevier

In this study, multiple lung diseases are diagnosed with the help of the Neural Network
algorithm. Specifically, Emphysema, Infiltration, Mass, Pleural Thickening, Pneumonia …

Spremi Citiraj Spominje se 117 puta Srodni članci Svih 5 inačica

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Gloria: A multimodal global-local representation learning framework for label-efficient medical image recognition

SC Huang, L Shen, MP Lungren… - Proceedings of the …, 2021 - openaccess.thecvf.com

In recent years, the growing number of medical imaging studies is placing an ever-
increasing burden on radiologists. Deep learning provides a promising solution for …

Spremi Citiraj Spominje se 349 puta Srodni članci Svih 6 inačica Prikaži kao HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Expressive text-to-image generation with rich text

S Ge, T Park, JY Zhu, JB Huang - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Plain text has become a prevalent interface for text-to-image synthesis. However, its limited
customization options hinder users from accurately describing desired outputs. For example …

Spremi Citiraj Spominje se 73 puta Srodni članci Svih 6 inačica Prikaži kao HTML

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] Pre-trained models: Past, present and future

X Han, Z Zhang, N Ding, Y Gu, X Liu, Y Huo, J Qiu… - AI Open, 2021 - Elsevier

Large-scale pre-trained models (PTMs) such as BERT and GPT have recently achieved
great success and become a milestone in the field of artificial intelligence (AI). Owing to …

Spremi Citiraj Spominje se 942 puta Srodni članci Svih 11 inačica

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Grit: A generative region-to-text transformer for object understanding

J Wu, J Wang, Z Yang, Z Gan, Z Liu, J Yuan… - European Conference on …, 2024 - Springer

This paper presents a Generative RegIon-to-Text transformer, GRiT, for object
understanding. The spirit of GRiT is to formulate object understanding as< region, text> …

Spremi Citiraj Spominje se 107 puta Srodni članci Svih 8 inačica

Stvori obavijest

Citiraj

Napredno pretraživanje

Spremljeno u Moju knjižnicu

Densecap: Fully convolutional localization networks for dense captioning

From show to tell: A survey on deep learning-based image captioning

Vision Transformers in medical computer vision—A contemplative retrospection

[PDF][PDF] The dawn of lmms: Preliminary explorations with gpt-4v (ision)

Dense text-to-image generation with attention modulation

Interactive and explainable region-guided radiology report generation

[HTML][HTML] High-precision multiclass classification of lung disease through customized MobileNetV2 from chest X-ray images

Gloria: A multimodal global-local representation learning framework for label-efficient medical image recognition

Expressive text-to-image generation with rich text

[HTML][HTML] Pre-trained models: Past, present and future

Grit: A generative region-to-text transformer for object understanding