Explainable artificial intelligence: a comprehensive review

D Minh, HX Wang, YF Li, TN Nguyen - Artificial Intelligence Review, 2022 - Springer
Thanks to the exponential growth in computing power and vast amounts of data, artificial
intelligence (AI) has witnessed remarkable developments in recent years, enabling it to be …

Deep learning and knowledge graph for image/video captioning: A review of datasets, evaluation metrics, and methods

MS Wajid, H Terashima‐Marin, P Najafirad… - Engineering …, 2024 - Wiley Online Library
Generating an image/video caption has always been a fundamental problem of Artificial
Intelligence, which is usually performed using the potential of Deep Learning Methods …

Boostmis: Boosting medical image semi-supervised learning with adaptive pseudo labeling and informative active annotation

W Zhang, L Zhu, J Hallinan, S Zhang… - Proceedings of the …, 2022 - openaccess.thecvf.com
In this paper, we propose a novel semi-supervised learning (SSL) framework named
BoostMIS that combines adaptive pseudo labeling and informative active annotation to …

Fine-grained image classification for crop disease based on attention mechanism

G Yang, Y He, Y Yang, B Xu - Frontiers in Plant Science, 2020 - frontiersin.org
Fine-grained image classification is a challenging task because of the difficulty in identifying
discriminant features, it is not easy to find the subtle features that fully represent the object. In …

N24news: A new dataset for multimodal news classification

Z Wang, X Shan, X Zhang, J Yang - arxiv preprint arxiv:2108.13327, 2021 - arxiv.org
Current news datasets merely focus on text features on the news and rarely leverage the
feature of images, excluding numerous essential features for news classification. In this …

Magic: Multimodal relational graph adversarial inference for diverse and unpaired text-based image captioning

W Zhang, H Shi, J Guo, S Zhang, Q Cai, J Li… - Proceedings of the …, 2022 - ojs.aaai.org
Text-based image captioning (TextCap) requires simultaneous comprehension of visual
content and reading the text of images to generate a natural language description. Although …

Consensus graph representation learning for better grounded image captioning

W Zhang, H Shi, S Tang, J **ao, Q Yu… - Proceedings of the AAAI …, 2021 - ojs.aaai.org
The contemporary visual captioning models frequently hallucinate objects that are not
actually in a scene, due to the visual misclassification or over-reliance on priors that …

Solving one-dimensional cutting stock problems with the deep reinforcement learning

J Fang, Y Rao, Q Luo, J Xu - Mathematics, 2023 - mdpi.com
It is well known that the one-dimensional cutting stock problem (1DCSP) is a combinatorial
optimization problem with nondeterministic polynomial (NP-hard) characteristics. Heuristic …

Automatic image caption generation using deep learning

A Verma, AK Yadav, M Kumar, D Yadav - Multimedia Tools and …, 2024 - Springer
Image captioning is an interesting and challenging task with applications in diverse domains
such as image retrieval, organizing and locating images of users' interest, etc. It has huge …

Relational graph learning for grounded video description generation

W Zhang, XE Wang, S Tang, H Shi, H Shi… - Proceedings of the 28th …, 2020 - dl.acm.org
Grounded video description (GVD) encourages captioning models to attend to appropriate
video regions (eg, objects) dynamically and generate a description. Such a setting can help …