Towards unified multimodal editing with enhanced knowledge collaboration
The swift advancement in Multimodal LLMs (MLLMs) also presents significant challenges for
effective knowledge editing. Current methods, including intrinsic knowledge editing and …
effective knowledge editing. Current methods, including intrinsic knowledge editing and …
SPDiffusion: Semantic Protection Diffusion for Multi-concept Text-to-image Generation
Y Zhang, R Zhang, X Nie, H Li, J Chen, Y Hao… - arxiv preprint arxiv …, 2024 - arxiv.org
Recent text-to-image models have achieved remarkable success in generating high-quality
images. However, when tasked with multi-concept generation which creates images …
images. However, when tasked with multi-concept generation which creates images …
RelationBooth: Towards Relation-Aware Customized Object Generation
Customized image generation is crucial for delivering personalized content based on user-
provided image prompts, aligning large-scale text-to-image diffusion models with individual …
provided image prompts, aligning large-scale text-to-image diffusion models with individual …
: Exploring Embodied Emotion Through A Large-Scale Egocentric Video Dataset
Understanding human emotions is fundamental to enhancing human-computer interaction,
especially for embodied agents that mimic human behavior. Traditional emotion analysis …
especially for embodied agents that mimic human behavior. Traditional emotion analysis …