Παρακολούθηση
Zheng Ge
Zheng Ge
Senior Researcher, StepFun
Η διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα fuji.waseda.jp - Αρχική σελίδα
Τίτλος
Παρατίθεται από
Παρατίθεται από
Έτος
Yolox: Exceeding yolo series in 2021
JS Ge, Z, S Liu, F Wang, Z Li
arXiv preprint arXiv:2107.08430, 2021
57672021
Bevdepth: Acquisition of reliable depth for multi-view 3d object detection
Y Li, Z Ge, G Yu, J Yang, Z Wang, Y Shi, J Sun, Z Li
Proceedings of the AAAI conference on artificial intelligence, 2022
6152022
Ota: Optimal transport assignment for object detection
Z Ge, S Liu, Z Li, O Yoshie, J Sun
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021
5512021
Bevstereo: Enhancing depth estimation in multi-view 3d object detection with dynamic temporal stereo
Y Li, H Bao, Z Ge, J Yang, J Sun, Z Li
Proceedings of the AAAI conference on artificial intelligence, 2022
218*2022
NMS by representative region: Towards crowded pedestrian detection by proposal pairing
X Huang, Z Ge, Z Jie, O Yoshie
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020
1992020
Contrast with Reconstruct: Contrastive 3D Representation Learning Guided by Generative Pretraining
Z Qi, R Dong, G Fan, Z Ge, X Zhang, K Ma, L Yi
International Conference on Machine Learning (ICML), 2023, 2023
1232023
Implicit identity leakage: The stumbling block to improving deepfake detection generalization
S Dong, J Wang, R Ji, J Liang, H Fan, Z Ge
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
1222023
Dreamllm: Synergistic multimodal comprehension and creation
R Dong, C Han, Y Peng, Z Qi, Z Ge, J Yang, L Zhao, J Sun, H Zhou, H Wei, ...
ICLR 2024 (Spotlight), 2024
1122024
Dense teacher: Dense pseudo-labels for semi-supervised object detection
H Zhou, Z Ge, S Liu, W Mao, Z Li, H Yu, J Sun
Proceedings of the European conference on computer vision (ECCV), 2022
1082022
Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning?
R Dong, Z Qi, L Zhang, J Zhang, J Sun, Z Ge, L Yi, K Ma
International Conference on Learning Representations (ICLR), 2023, 2022
952022
Vary: Scaling up the vision vocabulary for large vision-language models
H Wei, L Kong, J Chen, L Zhao, Z Ge, J Yang, J Sun, C Han, X Zhang
ECCV 2024, 2024
722024
Exploring recurrent long-term temporal fusion for multi-view 3d perception
C Han, J Yang, J Sun, Z Ge, R Dong, H Zhou, W Mao, Y Peng, X Zhang
RA-L & IROS (Oral), 2024
582024
Sts: Surround-view temporal stereo for multi-view 3d detection
Z Wang, C Min, Z Ge, Y Li, Z Li, H Yang, D Huang
arXiv preprint arXiv:2208.10145, 2022
572022
Lla: Loss-aware label assignment for dense pedestrian detection
Z Ge, J Wang, X Huang, S Liu, O Yoshie
Neurocomputing 462, 272-281, 2021
522021
Chatspot: Bootstrapping multimodal llms via precise referring instruction tuning
L Zhao, E Yu, Z Ge, J Yang, H Wei, H Zhou, J Sun, Y Peng, R Dong, ...
IJCAI 2024 (Long Oral), 2023
492023
Ps-rcnn: Detecting secondary human instances in a crowd via primary object suppression
Z Ge, Z Jie, X Huang, R Xu, O Yoshie
2020 IEEE international conference on multimedia and expo (ICME), 1-6, 2020
422020
Shapellm: Universal 3d object understanding for embodied interaction
Z Qi, R Dong, S Zhang, H Geng, C Han, Z Ge, L Yi, K Ma
ECCV 2024, 2024
402024
Matrixvt: Efficient multi-camera to bev transformation for 3d perception
H Zhou, Z Ge, Z Li, X Zhang
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
402023
Small language model meets with reinforced vision vocabulary
H Wei, L Kong, J Chen, L Zhao, Z Ge, E Yu, J Sun, C Han, X Zhang
arXiv preprint arXiv:2401.12503, 2024
302024
Align-detr: Improving detr with simple iou-aware bce loss
Z Cai, S Liu, G Wang, Z Ge, X Zhang, D Huang
arXiv preprint arXiv:2304.07527, 2023
302023
Δεν είναι δυνατή η εκτέλεση της ενέργειας από το σύστημα αυτή τη στιγμή. Προσπαθήστε ξανά αργότερα.
Άρθρα 1–20