Yang Zhao

Παρατίθεται από

	Όλα	Από το 2020
Παραθέσεις	592	587
h-index	10	10
i10-index	11	11

300

150

225

20192020202120222023202420253 21 46 73 130 285 29

Δημόσια πρόσβαση

Προβολή όλων

10 άρθρα

0 άρθρα

διαθέσιμα

μη διαθέσιμα

Σύμφωνα με εντολές χρηματοδότησης

Παρακολούθηση

Yang Zhao

Zhejiang University

Η διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα zju.edu.cn

Computer Vision Multi-Modal Learning Video Understanding


Τίτλος Ταξινόμηση με βάση τις αναφορές Ταξινόμηση κατά έτος Ταξινόμηση κατά τίτλο	Παρατίθεται από Παρατίθεται από	Έτος
Where does it exist: Spatio-temporal video grounding for multi-form sentences Z Zhang, Z Zhao, Y Zhao, Q Wang, H Liu, L Gao Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020	121	2020
Bubogpt: Enabling visual grounding in multi-modal llms Y Zhao, Z Lin, D Zhou, Z Huang, J Feng, B Kang arXiv preprint arXiv:2307.08581, 2023	96	2023
Discriminative and Correlative Partial Multi-Label Learning. H Wang, W Liu, Y Zhao, C Zhang, T Hu, G Chen IJCAI, 3691-3697, 2019	95	2019
Cascaded prediction network via segment tree for temporal video grounding Y Zhao, Z Zhao, Z Zhang, Z Lin Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021	85	2021
Chat-3d: Data-efficiently tuning large language model for universal dialogue of 3d scenes Z Wang, H Huang, Y Zhao, Z Zhang, Z Zhao arXiv preprint arXiv:2308.08769, 2023	52	2023
Connecting multi-modal contrastive representations Z Wang, Y Zhao, H Huang, J Liu, A Yin, L Tang, L Li, Y Wang, Z Zhang, ... Advances in Neural Information Processing Systems 36, 22099-22114, 2023	32	2023
Chat-3d v2: Bridging 3d scene and large language models with object identifiers H Huang, Z Wang, R Huang, L Liu, X Cheng, Y Zhao, T Jin, Z Zhao arXiv preprint arXiv:2312.08168, 2023	25	2023
3drp-net: 3d relative position-aware network for 3d visual grounding Z Wang, H Huang, Y Zhao, L Li, X Cheng, Y Zhu, A Yin, Z Zhao arXiv preprint arXiv:2307.13363, 2023	16	2023
Distilling coarse-to-fine semantic matching knowledge for weakly supervised 3d visual grounding Z Wang, H Huang, Y Zhao, L Li, X Cheng, Y Zhu, A Yin, Z Zhao Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023	16	2023
Learning From Multi-Dimensional Partial Labels. H Wang, W Liu, Y Zhao, T Hu, K Chen, G Chen IJCAI, 2943-2949, 2020	13	2020
Extending multi-modal contrastive representations Z Wang, Z Zhang, L Liu, Y Zhao, H Huang, T Jin, Z Zhao arXiv preprint arXiv:2310.08884, 2023	10	2023
Video-guided curriculum learning for spoken video grounding Y Xia, Z Zhao, S Ye, Y Zhao, H Li, Y Ren Proceedings of the 30th ACM International Conference on Multimedia, 5191-5200, 2022	8	2022
Towards effective multi-modal interchanges in zero-resource sounding object localization Y Zhao, C Zhang, H Huang, H Li, Z Zhao Advances in Neural Information Processing Systems 35, 38089-38102, 2022	7	2022
Scene-robust natural language video localization via learning domain-invariant representations Z Wang, Y Zhao, H Huang, Y Xia, Z Zhao Findings of the Association for Computational Linguistics: ACL 2023, 144-160, 2023	6	2023
Date: Domain adaptive product seeker for e-commerce H Li, H Jiang, T Jin, M Li, Y Chen, Z Lin, Y Zhao, Z Zhao Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	6	2023
Multi-Modal Domain Adaptation Across Video Scenes for Temporal Video Grounding H Huang, Y Zhao, Z Wang, Y Xia, Z Zhao arXiv preprint arXiv:2312.13633, 2023	2	2023
Antpivot: Livestream highlight detection via hierarchical attention mechanism Y Zhao, X Lin, W Xu, M Zheng, Z Liu, Z Zhao arXiv preprint arXiv:2206.04888, 2022	2	2022

Δεν είναι δυνατή η εκτέλεση της ενέργειας από το σύστημα αυτή τη στιγμή. Προσπαθήστε ξανά αργότερα.

Άρθρα 1–17

Παραθέσεις ανά έτος

Διπλότυπες αναφορές

Συγχωνευμένες αναφορές

Προσθήκη από κοινού συγγραφέωνΣυν-συγγραφείς

Παρακολούθηση

Παρατίθεται από