Enhancing Baidu Multimodal Advertisement with Chinese Text-to-Image Generation via Bilingual Alignment and Caption Synthesis

K Zhao, X Zhao, Z **, Y Yang, W Tao, C Han… - Proceedings of the 47th …, 2024 - dl.acm.org
Recent advances in generative artificial intelligence have revolutionized information
retrieval and content generation, opening up new opportunities for the e-commerce industry …

DiffPoint: Single and multi-view point cloud reconstruction with ViT based diffusion model

Y Feng, X Shi, M Cheng, Y **ong - arxiv preprint arxiv:2402.11241, 2024 - arxiv.org
As the task of 2D-to-3D reconstruction has gained significant attention in various real-world
scenarios, it becomes crucial to be able to generate high-quality point clouds. Despite the …

FFA Sora, video generation as fundus fluorescein angiography simulator

X Wu, L Wang, R Chen, B Liu, W Zhang, X Yang… - arxiv preprint arxiv …, 2024 - arxiv.org
Fundus fluorescein angiography (FFA) is critical for diagnosing retinal vascular diseases,
but beginners often struggle with image interpretation. This study develops FFA Sora, a text …

Accelerating Vision Diffusion Transformers with Skip Branches

G Chen, X Zhao, Y Zhou, T Chen, C Yu - arxiv preprint arxiv:2411.17616, 2024 - arxiv.org
Diffusion Transformers (DiT), an emerging image and video generation model architecture,
has demonstrated great potential because of its high generation quality and scalability …

Lateralization MLP: A Simple Brain-inspired Architecture for Diffusion

Z Hu, M Rostami - arxiv preprint arxiv:2405.16098, 2024 - arxiv.org
The Transformer architecture has dominated machine learning in a wide range of tasks. The
specific characteristic of this architecture is an expensive scaled dot-product attention …

[PDF][PDF] Enhancing Image Generation with Diffusion Transformer Architecture

R Wu - scitepress.org
In image generation tasks, this study aims to explore the advantages and potential of a
fusion model that integrates transformer and diffusion models. Specifically, research …