Surgen: Text-guided diffusion model for surgical video generation

J Cho, S Schmidgall, C Zakka, M Mathur… - arxiv preprint arxiv …, 2024 - arxiv.org
Diffusion-based video generation models have made significant strides, producing outputs
with improved visual fidelity, temporal coherence, and user control. These advancements …

ChildDiffusion: Unlocking the potential of generative AI and controllable augmentations for child facial data using stable diffusion and large language models

MA Farooq, W Yao, P Corcoran - arxiv preprint arxiv:2406.11592, 2024 - arxiv.org
In this research work we have proposed high-level ChildDiffusion framework capable of
generating photorealistic child facial samples and further embedding several intelligent …

IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models

J Lei, R Zhang, X Hu, W Lin, Z Li, W Sun, R Du… - arxiv preprint arxiv …, 2025 - arxiv.org
With the rapid development of diffusion models, text-to-image (T2I) models have made
significant progress, showcasing impressive abilities in prompt following and image …

A Comparative Study on Diffusion Sampling Methods Across Diverse Medical Imaging Modalities

MA Farooq, A Abaid, I Ullah… - Proceedings of the …, 2024 - openaccess.thecvf.com
The evaluation of diffusion-based image sampling methods is pivotal in improving the
quality and reliability of synthetic data generation, particularly in medical imaging …

To train, or not to train: exploring foundation models for TBAD segmentation

A Abaid, I Ullah - IET Conference Proceedings CP887, 2024 - IET
Recent advancements in biomedical image analysis have been significantly influenced by
vision foundation models (FMs) originally designed for general computer vision tasks. These …