From google gemini to openai q*(q-star): A survey of resha** the generative artificial intelligence (ai) research landscape
This comprehensive survey explored the evolving landscape of generative Artificial
Intelligence (AI), with a specific focus on the transformative impacts of Mixture of Experts …
Intelligence (AI), with a specific focus on the transformative impacts of Mixture of Experts …
Neurallift-360: Lifting an in-the-wild 2d photo to a 3d object with 360deg views
Virtual reality and augmented reality (XR) bring increasing demand for 3D content
generation. However, creating high-quality 3D content requires tedious work from a human …
generation. However, creating high-quality 3D content requires tedious work from a human …
A survey on multimodal large language models for autonomous driving
With the emergence of Large Language Models (LLMs) and Vision Foundation Models
(VFMs), multimodal AI systems benefiting from large models have the potential to equally …
(VFMs), multimodal AI systems benefiting from large models have the potential to equally …
Promptify: Text-to-image generation through interactive prompt exploration with large language models
Text-to-image generative models have demonstrated remarkable capabilities in generating
high-quality images based on textual prompts. However, crafting prompts that accurately …
high-quality images based on textual prompts. However, crafting prompts that accurately …
“An Adapt-or-Die Type of Situation”: Perception, Adoption, and Use of Text-to-Image-Generation AI by Game Industry Professionals
Text-to-image generation (TTIG) models, a recent addition to creative AI, can generate
images based on a text description. These models have begun to rival the work of …
images based on a text description. These models have begun to rival the work of …
Navigating text-to-image customization: From lycoris fine-tuning to model evaluation
Text-to-image generative models have garnered immense attention for their ability to
produce high-fidelity images from text prompts. Among these, Stable Diffusion distinguishes …
produce high-fidelity images from text prompts. Among these, Stable Diffusion distinguishes …
GenAssist: Making image generation accessible
Blind and low vision (BLV) creators use images to communicate with sighted audiences.
However, creating or retrieving images is challenging for BLV creators as it is difficult to use …
However, creating or retrieving images is challenging for BLV creators as it is difficult to use …
[HTML][HTML] Synthetic meets authentic: Leveraging llm generated datasets for yolo11 and yolov10-based apple detection through machine vision sensors
Training machine learning (ML) models for artificial intelligence (AI) and computer vision-
based object detection process typically requires large, labeled datasets, a process often …
based object detection process typically requires large, labeled datasets, a process often …
Designprompt: Using multimodal interaction for design exploration with generative ai
Visually oriented designers often struggle to create effective generative AI (GenAI) prompts.
A preliminary study identified specific issues in composing and fine-tuning prompts, as well …
A preliminary study identified specific issues in composing and fine-tuning prompts, as well …
Cam: A large language model-based creative analogy mining framework
Analogies inspire creative solutions to problems, and facilitate the creative expression of
ideas and the explanation of complex concepts. They have widespread applications in …
ideas and the explanation of complex concepts. They have widespread applications in …