Deep image captioning: A review of methods, trends and future challenges

L Xu, Q Tang, J Lv, B Zheng, X Zeng, W Li - Neurocomputing, 2023‏ - Elsevier
Image captioning, also called report generation in medical field, aims to describe visual
content of images in human language, which requires to model semantic relationship …

Make llm a testing expert: Bringing human-like interaction to mobile gui testing via functionality-aware decisions

Z Liu, C Chen, J Wang, M Chen, B Wu, X Che… - Proceedings of the …, 2024‏ - dl.acm.org
Automated Graphical User Interface (GUI) testing plays a crucial role in ensuring app quality,
especially as mobile applications have become an integral part of our daily lives. Despite …

“They only care to show us the wheelchair”: Disability representation in text-to-image AI models

KA Mack, R Qadri, R Denton, SK Kane… - Proceedings of the 2024 …, 2024‏ - dl.acm.org
This paper reports on disability representation in images output from text-to-image (T2I)
generative AI systems. Through eight focus groups with 25 people with disabilities, we found …

Chatting with gpt-3 for zero-shot human-like mobile automated gui testing

Z Liu, C Chen, J Wang, M Chen, B Wu, X Che… - arxiv preprint arxiv …, 2023‏ - arxiv.org
Mobile apps are indispensable for people's daily life, and automated GUI (Graphical User
Interface) testing is widely used for app quality assurance. There is a growing interest in …

From provenance to aberrations: Image creator and screen reader user perspectives on alt text for AI-generated images

M Das, AJ Fiannaca, MR Morris, SK Kane… - Proceedings of the …, 2024‏ - dl.acm.org
AI-generated images are proliferating as a new visual medium. However, state-of-the-art
image generation models do not output alternative (alt) text with their images, rendering …

Understanding visual arts experiences of blind people

FM Li, L Zhang, M Bandukda, A Stangl… - Proceedings of the …, 2023‏ - dl.acm.org
Visual arts play an important role in cultural life and provide access to social heritage and
self-enrichment, but most visual arts are inaccessible to blind people. Researchers have …

Measuring representational harms in image captioning

A Wang, S Barocas, K Laird, H Wallach - Proceedings of the 2022 ACM …, 2022‏ - dl.acm.org
Previous work has largely considered the fairness of image captioning systems through the
underspecified lens of “bias.” In contrast, we present a set of techniques for measuring five …

Investigating use cases of ai-powered scene description applications for blind and low vision people

RE Gonzalez Penuela, J Collins, C Bennett… - Proceedings of the …, 2024‏ - dl.acm.org
" Scene description" applications that describe visual content in a photo are useful daily
tools for blind and low vision (BLV) people. Researchers have studied their use, but they …

“It's Kind of Context Dependent”: Understanding Blind and Low Vision People's Video Accessibility Preferences Across Viewing Scenarios

L Jiang, C Jung, M Phutane, A Stangl… - Proceedings of the 2024 …, 2024‏ - dl.acm.org
While audio description (AD) is the standard approach for making videos accessible to blind
and low vision (BLV) people, existing AD guidelines do not consider BLV users' varied …

Supporting accessible data visualization through audio data narratives

A Siu, G SH Kim, S O'Modhrain, S Follmer - Proceedings of the 2022 CHI …, 2022‏ - dl.acm.org
Online data visualizations play an important role in informing public opinion but are often
inaccessible to screen reader users. To address the need for accessible data …