Migc: Multi-instance generation controller for text-to-image synthesis

D Zhou, Y Li, F Ma, X Zhang… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Abstract We present a Multi-Instance Generation (MIG) task simultaneously generating
multiple instances with diverse controls in one image. Given a set of predefined coordinates …

Migc++: Advanced multi-instance generation controller for image synthesis

D Zhou, Y Li, F Ma, Z Yang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
We introduce the Multi-Instance Generation (MIG) task, which focuses on generating
multiple instances within a single image, each accurately placed at predefined positions with …

Caphuman: Capture your moments in parallel universes

C Liang, F Ma, L Zhu, Y Deng… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
We concentrate on a novel human-centric image synthesis task that is given only one
reference facial photograph it is expected to generate specific individual images with diverse …

Brain and Cognitive Science Inspired Deep Learning: A Comprehensive Survey

Z Zhang, X Ding, X Liang, Y Zhou… - IEEE Transactions on …, 2025 - ieeexplore.ieee.org
Deep learning (DL) is increasingly viewed as a foundational methodology for advancing
Artificial Intelligence (AI). However, its interpretability remains limited, and it often …

StepNet: Spatial-temporal Part-aware Network for Isolated Sign Language Recognition

X Shen, Z Zheng, Y Yang - ACM Transactions on Multimedia Computing …, 2024 - dl.acm.org
The goal of sign language recognition (SLR) is to help those who are hard of hearing or deaf
overcome the communication barrier. Most existing approaches can be typically divided into …

Brainformer: Mimic human visual brain functions to machine vision models via fMRI

XB Nguyen, X Li, P Sinha, SU Khan, K Luu - Neurocomputing, 2025 - Elsevier
Human perception plays a vital role in forming beliefs and understanding reality. A deeper
understanding of brain functionality will lead to the development of novel deep neural …

Quantum-Brain: Quantum-Inspired Neural Network Approach to Vision-Brain Understanding

HQ Nguyen, XB Nguyen, H Churchill… - arxiv preprint arxiv …, 2024 - arxiv.org
Vision-brain understanding aims to extract semantic information about brain signals from
human perceptions. Existing deep learning methods for vision-brain understanding are …

MindGrapher: Dynamic-Aware fMRI-to-Video Reconstruction

R Quan, W Song, L Li, W Wang, Y Yang - openreview.net
Existing methods for fMRI-to-video reconstruction typically focus on accurately
reconstructing visual content ($ ie $, appearance), neglecting dynamic event information …