Comprehensive Subjective and Objective Evaluation Method for Text-generated Video

Z Qi, P Shi, S Wang, Z Zhang, Z Ying, D Pan - arxiv preprint arxiv …, 2025 - arxiv.org
Recent text-to-video (T2V) technology advancements, as demonstrated by models such as
Gen3, Pika, and Sora, have significantly broadened its applicability and popularity. This …

The Art of Storytelling: Multi-Agent Generative AI for Dynamic Multimodal Narratives

S Arif, T Arif, MS Haroon, AJ Khan, AA Raza… - arxiv preprint arxiv …, 2024 - arxiv.org
This paper introduces the concept of an education tool that utilizes Generative Artificial
Intelligence (GenAI) to enhance storytelling for children. The system combines GenAI-driven …

SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing

V Biyyala, BC Kathuria, J Li, Y Zhang - arxiv preprint arxiv:2501.07554, 2025 - arxiv.org
Video editing models have advanced significantly, but evaluating their performance remains
challenging. Traditional metrics, such as CLIP text and image scores, often fall short: text …