עקוב אחר
Zhixi Cai
Zhixi Cai
Research Fellow, Monash University
כתובת אימייל מאומתת בדומיין monash.edu - דף הבית
כותרת
צוטט על ידי
צוטט על ידי
שנה
MARLIN: Masked Autoencoder for Facial Video Representation LearnINg
Z Cai, S Ghosh, K Stefanov, A Dhall, J Cai, H Rezatofighi, R Haffari, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
812023
Do you really mean that? Content driven audio-visual deepfake dataset and multimodal method for temporal forgery localization
Z Cai, K Stefanov, A Dhall, M Hayat
2022 International Conference on Digital Image Computing: Techniques and …, 2022
602022
AV-Deepfake1M: A large-scale LLM-driven audio-visual deepfake dataset
Z Cai, S Ghosh, AP Adatia, M Hayat, A Dhall, T Gedeon, K Stefanov
Proceedings of the 32nd ACM International Conference on Multimedia, 7414-7423, 2024
272024
Glitch in the matrix: A large scale benchmark for content driven audio–visual forgery detection and localization
Z Cai, S Ghosh, A Dhall, T Gedeon, K Stefanov, M Hayat
Computer Vision and Image Understanding 236, 103818, 2023
272023
1M-Deepfakes Detection Challenge
Z Cai, A Dhall, S Ghosh, M Hayat, D Kollias, K Stefanov, U Tariq
Proceedings of the 32nd ACM International Conference on Multimedia, 11355-11359, 2024
42024
HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning
F Ke*, Z Cai*, S Jahangard*, W Wang, PD Haghighi, H Rezatofighi
European Conference on Computer Vision, 2024
32024
JRDB-Social: A Multifaceted Robotic Dataset for Understanding of Context and Dynamics of Human Interactions Within Social Groups
S Jahangard, Z Cai, S Wen, H Rezatofighi
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
32024
Emolysis: A multimodal open-source group emotion analysis and visualization toolkit
S Ghosh*, Z Cai*, P Gupta, G Sharma, A Dhall, M Hayat, T Gedeon
International Conference on Affective Computing and Intelligent Interaction …, 2023
32023
Hi-slam: Scaling-up semantics in slam with a hierarchically categorical gaussian splatting
B Li, Z Cai, YF Li, I Reid, H Rezatofighi
arXiv preprint arXiv:2409.12518, 2024
12024
NEUSIS: A Compositional Neuro-Symbolic Framework for Autonomous Perception, Reasoning, and Planning in Complex UAV Search Missions
Z Cai, CR Cardenas, K Leo, C Zhang, K Backman, H Li, B Li, ...
arXiv preprint arXiv:2409.10196, 2024
12024
NAVER: A Neuro-Symbolic Compositional Automaton for Visual Grounding with Explicit Logic Reasoning
Z Cai, F Ke, S Jahangard, MG de la Banda, R Haffari, PJ Stuckey, ...
arXiv preprint arXiv:2502.00372, 2025
2025
MRAC Track 1: 2nd Workshop on Multimodal, Generative and Responsible Affective Computing
S Ghosh, Z Cai, A Dhall, D Kollias, R Goecke, T Gedeon
Proceedings of the 2nd International Workshop on Multimodal and Responsible …, 2024
2024
Content-Driven Multimodal Deepfake Generation and Temporal Localization
Z Cai
Monash University, 2024
2024
Pavlok-Nudge: A Feedback Mechanism for Atomic Behaviour Modification with Snoring Usecase
S Ghosh*, R Hasan*, P Agrawal, Z Cai, S Soon, A Dhall, T Gedeon
arXiv preprint arXiv:2305.06110, 2023
2023
המערכת אינה יכולה לבצע את הפעולה כעת. נסה שוב מאוחר יותר.
מאמרים 1–14