Yiyuan Zhang

อ้างโดย

	ทั้งหมด	ตั้งแต่ปี 2020
การอ้างอิง	664	664
ดัชนี h	9	9
ดัชนี i10	9	9

460

230

115

345

202120222023202420252 33 89 451 89

การเข้าถึงแบบสาธารณะ

ดูทั้งหมด

5 บทความ

1 บทความ

ใช้งานได้

ใช้ไม่ได้

อิงตามข้อกำหนดในการรับเงินสนับสนุน

ติดตาม

Yiyuan Zhang

ชื่ออื่นๆ张懿元

MMLab, The Chinese University of HongKong

ยืนยันอีเมลแล้วที่ ie.cuhk.edu.hk - หน้าแรก

Multimodal Representation Computer vision


ชื่อ เรียงตามการอ้างอิง เรียงตามปี เรียงตามชื่อ	อ้างโดย อ้างโดย	ปี
UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio Video Point Cloud Time-Series and Image Recognition X Ding, Y Zhang, Y Ge, S Zhao, L Song, X Yue, Y Shan Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	146	2024
Meta-transformer: A Unified Framework for Multimodal Learning Y Zhang, K Gong, K Zhang, H Li, Y Qiao, W Ouyang, X Yue arXiv preprint arXiv:2307.10802, 2023	137	2023
OneLLM: One Framework to Align All Modalities with Language J Han, K Gong, Y Zhang, J Wang, K Zhang, D Lin, Y Qiao, P Gao, X Yue IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024	104	2024
Asymmetric Convolution: An Efficient and Generalized Method to Fuse Feature Maps in Multiple Vision Tasks W Han, X Dong, Y Zhang, D Crandall, CZ Xu, J Shen IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024	102*	2024
Modality Synergy Complement Learning with Cascaded Aggregation for Visible-Infrared Person Re-identification Y Zhang, S Zhao, Y Kang, J Shen European Conference on Computer Vision (ECCV), 462-479, 2022	54	2022
Dual-Semantic Consistency Learning for Visible-Infrared Person Re-identification Y Zhang, Y Kang, S Zhao, J Shen IEEE Transactions on Information Forensics and Security 18, 1554-1565, 2022	40	2022
Unireplknet: A universal perception large-kernel convnet for audio, video, point cloud, time-series and image recognition. arXiv 2023 X Ding, Y Zhang, Y Ge, S Zhao, L Song, X Yue, Y Shan arXiv preprint arXiv:2311.15599, 2024	25	2024
Online Vectorized HD Map Construction using Geometry Z Zhang, Y Zhang, X Ding, F Jin, X Yue European Conference on Computer Vision, 73-90, 2025	19	2025
Text-to-3D Generation with Bidirectional Diffusion using both 2D and 3D priors L Ding, S Dong, Z Huang, Z Wang, Y Zhang, K Gong, D Xu, T Xue IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024	13	2024
Multimodal pathway: Improve transformers with irrelevant data from other modalities Y Zhang, X Ding, K Gong, Y Ge, Y Shan, X Yue Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	7	2024
Towards Unified and Effective Domain Generalization Y Zhang, K Gong, X Ding, K Zhang, F Lv, K Keutzer, X Yue arXiv preprint arXiv:2310.10008, 2023	6	2023
Interactivevideo: User-centric controllable video generation with synergistic multimodal instructions Y Zhang, Y Kang, Z Zhang, X Ding, S Zhao, X Yue arXiv preprint arXiv:2402.03040, 2024	4	2024
Meta-transformer: A unified framework for multimodal learning. arXiv 2023 Y Zhang, K Gong, K Zhang, H Li, Y Qiao, W Ouyang, X Yue arXiv preprint arXiv:2307.10802, 2023	4	2023
Explore the limits of omni-modal pretraining at scale Y Zhang, H Li, J Liu, X Yue arXiv preprint arXiv:2406.09412, 2024	2	2024
Scaling Up Your Kernels: Large Kernel Design in ConvNets towards Universal Representations Y Zhang, X Ding, X Yue arXiv preprint arXiv:2410.08049, 2024	1	2024

ระบบไม่สามารถดำเนินการได้ในขณะนี้ โปรดลองใหม่อีกครั้งในภายหลัง

บทความ 1–15

การอ้างอิงต่อปี

การอ้างอิงซ้ำกัน

การอ้างอิงที่รวมเข้าด้วยกัน

เพิ่มผู้เขียนร่วมผู้เขียนร่วม

ติดตาม

อ้างโดย