From google gemini to openai q*(q-star): A survey of resha** the generative artificial intelligence (ai) research landscape

TR McIntosh, T Susnjak, T Liu, P Watters… - arxiv preprint arxiv …, 2023 - arxiv.org
This comprehensive survey explored the evolving landscape of generative Artificial
Intelligence (AI), with a specific focus on the transformative impacts of Mixture of Experts …

Neurallift-360: Lifting an in-the-wild 2d photo to a 3d object with 360deg views

D Xu, Y Jiang, P Wang, Z Fan… - Proceedings of the …, 2023 - openaccess.thecvf.com
Virtual reality and augmented reality (XR) bring increasing demand for 3D content
generation. However, creating high-quality 3D content requires tedious work from a human …

A survey on multimodal large language models for autonomous driving

C Cui, Y Ma, X Cao, W Ye, Y Zhou… - Proceedings of the …, 2024 - openaccess.thecvf.com
With the emergence of Large Language Models (LLMs) and Vision Foundation Models
(VFMs), multimodal AI systems benefiting from large models have the potential to equally …

Promptify: Text-to-image generation through interactive prompt exploration with large language models

S Brade, B Wang, M Sousa, S Oore… - Proceedings of the 36th …, 2023 - dl.acm.org
Text-to-image generative models have demonstrated remarkable capabilities in generating
high-quality images based on textual prompts. However, crafting prompts that accurately …

“An Adapt-or-Die Type of Situation”: Perception, Adoption, and Use of Text-to-Image-Generation AI by Game Industry Professionals

V Vimpari, A Kultima, P Hämäläinen… - Proceedings of the …, 2023 - dl.acm.org
Text-to-image generation (TTIG) models, a recent addition to creative AI, can generate
images based on a text description. These models have begun to rival the work of …

Navigating text-to-image customization: From lycoris fine-tuning to model evaluation

SY Yeh, YG Hsieh, Z Gao, BBW Yang… - The Twelfth …, 2023 - openreview.net
Text-to-image generative models have garnered immense attention for their ability to
produce high-fidelity images from text prompts. Among these, Stable Diffusion distinguishes …

GenAssist: Making image generation accessible

M Huh, YH Peng, A Pavel - Proceedings of the 36th Annual ACM …, 2023 - dl.acm.org
Blind and low vision (BLV) creators use images to communicate with sighted audiences.
However, creating or retrieving images is challenging for BLV creators as it is difficult to use …

[HTML][HTML] Synthetic meets authentic: Leveraging llm generated datasets for yolo11 and yolov10-based apple detection through machine vision sensors

R Sapkota, Z Meng, M Karkee - Smart Agricultural Technology, 2024 - Elsevier
Training machine learning (ML) models for artificial intelligence (AI) and computer vision-
based object detection process typically requires large, labeled datasets, a process often …

Designprompt: Using multimodal interaction for design exploration with generative ai

X Peng, J Koch, WE Mackay - Proceedings of the 2024 ACM Designing …, 2024 - dl.acm.org
Visually oriented designers often struggle to create effective generative AI (GenAI) prompts.
A preliminary study identified specific issues in composing and fine-tuning prompts, as well …

Cam: A large language model-based creative analogy mining framework

B Bhavya, J **ong, C Zhai - Proceedings of the ACM Web Conference …, 2023 - dl.acm.org
Analogies inspire creative solutions to problems, and facilitate the creative expression of
ideas and the explanation of complex concepts. They have widespread applications in …