- Academic Search

MDZ Hossain, F Sohel, MF Shiratuddin… - ACM Computing Surveys …, 2019 - dl.acm.org

Generating a description of an image is called image captioning. Image captioning requires
recognizing the important objects, their attributes, and their relationships in an image. It also …

Save Cite Cited by 1004 Related articles All 8 versions Free GPT-4

[Free GPT-4]

[HTML] sciencedirect.com

[HTML][HTML] Adversarial text-to-image synthesis: A review

S Frolov, T Hinz, F Raue, J Hees, A Dengel - Neural Networks, 2021 - Elsevier

With the advent of generative adversarial networks, synthesizing images from text
descriptions has recently become an active research area. It is a flexible and intuitive way for …

Save Cite Cited by 220 Related articles All 9 versions Free GPT-4

[Free GPT-4]

[PDF] thecvf.com

Diffusiondet: Diffusion model for object detection

S Chen, P Sun, Y Song, P Luo - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

We propose DiffusionDet, a new framework that formulates object detection as a denoising
diffusion process from noisy boxes to object boxes. During the training stage, object boxes …

Save Cite Cited by 484 Related articles All 5 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Beyond transmitting bits: Context, semantics, and task-oriented communications

D Gündüz, Z Qin, IE Aguerri, HS Dhillon… - IEEE Journal on …, 2022 - ieeexplore.ieee.org

Communication systems to date primarily aim at reliably communicating bit sequences.
Such an approach provides efficient engineering designs that are agnostic to the meanings …

Save Cite Cited by 429 Related articles All 6 versions Free GPT-4

[Free GPT-4]

[PDF] springer.com

Visual genome: Connecting language and vision using crowdsourced dense image annotations

R Krishna, Y Zhu, O Groth, J Johnson, K Hata… - International journal of …, 2017 - Springer

Despite progress in perceptual tasks such as image classification, computers still perform
poorly on cognitive tasks such as image description and question answering. Cognition is …

Save Cite Cited by 6281 Related articles All 14 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Past, present, and future of simultaneous localization and map**: Toward the robust-perception age

C Cadena, L Carlone, H Carrillo, Y Latif… - IEEE Transactions …, 2016 - ieeexplore.ieee.org

Simultaneous localization and map** (SLAM) consists in the concurrent construction of a
model of the environment (the map), and the estimation of the state of the robot moving …

Save Cite Cited by 4462 Related articles All 22 versions Free GPT-4

[Free GPT-4]

[PDF] thecvf.com

Gqa: A new dataset for real-world visual reasoning and compositional question answering

DA Hudson, CD Manning - … of the IEEE/CVF conference on …, 2019 - openaccess.thecvf.com

We introduce GQA, a new dataset for real-world visual reasoning and compositional
question answering, seeking to address key shortcomings of previous VQA datasets. We …

Save Cite Cited by 1975 Related articles All 8 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] thecvf.com

Clevr: A diagnostic dataset for compositional language and elementary visual reasoning

J Johnson, B Hariharan… - Proceedings of the …, 2017 - openaccess.thecvf.com

When building artificial intelligence systems that can reason and answer questions about
visual data, we need diagnostic tests to analyze our progress and discover short-comings …

Save Cite Cited by 2645 Related articles All 18 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Semantic communications: Principles and challenges

Z Qin, X Tao, J Lu, W Tong, GY Li - arxiv preprint arxiv:2201.01389, 2021 - arxiv.org

Semantic communication, regarded as the breakthrough beyond the Shannon paradigm,
aims at the successful transmission of semantic information conveyed by the source rather …

Save Cite Cited by 384 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Spice: Semantic propositional image caption evaluation

P Anderson, B Fernando, M Johnson… - Computer Vision–ECCV …, 2016 - Springer

There is considerable interest in the task of automatically generating image captions.
However, evaluation is challenging. Existing automatic evaluation metrics are primarily …

Save Cite Cited by 2331 Related articles All 13 versions Free GPT-4

Create alert

Cite

Advanced search

Saved to My library

Image retrieval using scene graphs

A comprehensive survey of deep learning for image captioning

[HTML][HTML] Adversarial text-to-image synthesis: A review

Diffusiondet: Diffusion model for object detection

Beyond transmitting bits: Context, semantics, and task-oriented communications

Visual genome: Connecting language and vision using crowdsourced dense image annotations

Past, present, and future of simultaneous localization and map**: Toward the robust-perception age

Gqa: A new dataset for real-world visual reasoning and compositional question answering

Clevr: A diagnostic dataset for compositional language and elementary visual reasoning

Semantic communications: Principles and challenges

Spice: Semantic propositional image caption evaluation