- Academic Search

Z **ng, Q Feng, H Chen, Q Dai, H Hu, H Xu… - ACM Computing …, 2024 - dl.acm.org

The recent wave of AI-generated content (AIGC) has witnessed substantial success in
computer vision, with the diffusion model playing a crucial role in this achievement. Due to …

Salva Cita Citato da 92 Articoli correlati Tutte e 3 le versioni

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Lavie: High-quality video generation with cascaded latent diffusion models

Y Wang, X Chen, X Ma, S Zhou, Z Huang… - International Journal of …, 2024 - Springer

This work aims to learn a high-quality text-to-video (T2V) generative model by leveraging a
pre-trained text-to-image (T2I) model as a basis. It is a highly desirable yet challenging task …

Salva Cita Citato da 222 Articoli correlati Tutte e 3 le versioni

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Vbench: Comprehensive benchmark suite for video generative models

Z Huang, Y He, J Yu, F Zhang, C Si… - Proceedings of the …, 2024 - openaccess.thecvf.com

Video generation has witnessed significant advancements yet evaluating these models
remains a challenge. A comprehensive evaluation benchmark for video generation is …

Salva Cita Citato da 219 Articoli correlati Tutte e 4 le versioni Versione HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Gauhuman: Articulated gaussian splatting from monocular human videos

S Hu, T Hu, Z Liu - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com

We present GauHuman a 3D human model with Gaussian Splatting for both fast training (1 2
minutes) and real-time rendering (up to 189 FPS) compared with existing NeRF-based …

Salva Cita Citato da 76 Articoli correlati Tutte e 3 le versioni Versione HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Freeinit: Bridging initialization gap in video diffusion models

T Wu, C Si, Y Jiang, Z Huang, Z Liu - European Conference on Computer …, 2024 - Springer

Though diffusion-based video generation has witnessed rapid progress, the inference
results of existing models still exhibit unsatisfactory temporal consistency and unnatural …

Salva Cita Citato da 40 Articoli correlati Tutte e 3 le versioni

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Videobooth: Diffusion-based video generation with image prompts

Y Jiang, T Wu, S Yang, C Si, D Lin… - Proceedings of the …, 2024 - openaccess.thecvf.com

Text-driven video generation witnesses rapid progress. However merely using text prompts
is not enough to depict the desired subject appearance that accurately aligns with users' …

Salva Cita Citato da 47 Articoli correlati Tutte e 4 le versioni Versione HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Id-animator: Zero-shot identity-preserving human video generation

X He, Q Liu, S Qian, X Wang, T Hu, K Cao… - arxiv preprint arxiv …, 2024 - arxiv.org

Generating high-fidelity human video with specified identities has attracted significant
attention in the content generation community. However, existing techniques struggle to …

Salva Cita Citato da 29 Articoli correlati Tutte e 2 le versioni Versione HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Disco: Disentangled control for realistic human dance generation

T Wang, L Li, K Lin, Y Zhai, CC Lin… - Proceedings of the …, 2024 - openaccess.thecvf.com

Generative AI has made significant strides in computer vision particularly in text-driven
image/video synthesis (T2I/T2V). Despite the notable advancements it remains challenging …

Salva Cita Citato da 58 Articoli correlati Tutte e 3 le versioni Versione HTML

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Appearance and Pose-guided Human Generation: A Survey

F Liao, X Zou, W Wong - ACM Computing Surveys, 2024 - dl.acm.org

Appearance and pose-guided human generation is a burgeoning field that has captured
significant attention. This subject's primary objective is to transfer pose information from a …

Salva Cita Citato da 5 Articoli correlati

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Compositional abilities emerge multiplicatively: Exploring diffusion models on a synthetic task

M Okawa, ES Lubana, R Dick… - Advances in Neural …, 2024 - proceedings.neurips.cc

Modern generative models exhibit unprecedented capabilities to generate extremely
realistic data. However, given the inherent compositionality of real world, reliable use of …

Salva Cita Citato da 41 Articoli correlati Tutte e 7 le versioni Versione HTML

Crea avviso

Cita

Ricerca avanzata

Salvato in La mia biblioteca

Text2performer: Text-driven human video generation

A survey on video diffusion models

Lavie: High-quality video generation with cascaded latent diffusion models

Vbench: Comprehensive benchmark suite for video generative models

Gauhuman: Articulated gaussian splatting from monocular human videos

Freeinit: Bridging initialization gap in video diffusion models

Videobooth: Diffusion-based video generation with image prompts

Id-animator: Zero-shot identity-preserving human video generation

Disco: Disentangled control for realistic human dance generation

Appearance and Pose-guided Human Generation: A Survey

Compositional abilities emerge multiplicatively: Exploring diffusion models on a synthetic task