Internlm2 technical report Z Cai, M Cao, H Chen, K Chen, K Chen, X Chen, X Chen, Z Chen, Z Chen, ... arXiv preprint arXiv:2403.17297, 2024 | 423* | 2024 |
Depthformer: Exploiting long-range correlation and local information for accurate monocular depth estimation Z Li, Z Chen, X Liu, J Jiang Machine Intelligence Research 20 (6), 837-854, 2023 | 203 | 2023 |
Are we on the right way for evaluating large vision-language models? L Chen, J Li, X Dong, P Zhang, Y Zang, Z Chen, H Duan, J Wang, Y Qiao, ... arXiv preprint arXiv:2403.20330, 2024 | 167 | 2024 |
Disentangle your dense object detector Z Chen, C Yang, Q Li, F Zhao, ZJ Zha, F Wu Proceedings of the 29th ACM international conference on multimedia, 4939-4948, 2021 | 147 | 2021 |
AutoAlignV2: Deformable Feature Aggregation for Dynamic Multi-Modal 3D Object Detection Z Chen, Z Li, S Zhang, L Fang, Q Jiang, F Zhao ECCV2022, 2022 | 132* | 2022 |
AutoAlign: Pixel-Instance Feature Aggregation for Multi-Modal 3D Object Detection Z Chen, Z Li, S Zhang, L Fang, Q Jiang, F Zhao, B Zhou, H Zhao IJCAI 2022, 2022 | 132 | 2022 |
Sharegpt4video: Improving video understanding and generation with better captions L Chen, X Wei, J Li, X Dong, P Zhang, Y Zang, Z Chen, H Duan, B Lin, ... arXiv preprint arXiv:2406.04325, 2024 | 100 | 2024 |
Stacked U-Nets with Multi-output for Road Extraction T Sun, Z Chen, W Yang, Y Wang Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018 | 83 | 2018 |
BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object Detection Z Chen, Z Li, S Zhang, L Fang, Q Jiang, F Zhao ICLR2023, 2022 | 78 | 2022 |
Plainmamba: Improving non-hierarchical mamba in visual recognition C Yang, Z Chen, M Espinosa, L Ericsson, Z Wang, J Liu, EJ Crowley arXiv preprint arXiv:2403.17695, 2024 | 74 | 2024 |
SimIPU: Simple 2d image and 3d point cloud unsupervised pre-training for spatial-aware visual representations Z Li, Z Chen, A Li, L Fang, Q Jiang, X Liu, J Jiang, B Zhou, H Zhao AAAI2022 36 (2), 1500-1508, 2022 | 69 | 2022 |
Graph-DETR3D: Rethinking Overlapping Regions for Multi-View 3D Object Detection Z Chen, Z Li, S Zhang, L Fang, Q Jiang, F Zhao ACM MM 2022, 2022 | 58 | 2022 |
Lidar-llm: Exploring the potential of large language models for 3d lidar understanding S Yang, J Liu, R Zhang, M Pan, Z Guo, X Li, Z Chen, P Gao, Y Guo, ... arXiv preprint arXiv:2312.14074, 2023 | 55 | 2023 |
Unsupervised Domain Adaptation for Monocular 3D Object Detection via Self-Training Z Li, Z Chen, A Li, L Fang, Q Jiang, X Liu, J Jiang ECCV2022, 2022 | 47 | 2022 |
Exploring sparse visual prompt for domain adaptive dense prediction S Yang, J Wu, J Liu, X Li, Q Zhang, M Pan, Y Gan, Z Chen, S Zhang Proceedings of the AAAI Conference on Artificial Intelligence 38 (15), 16334 …, 2024 | 38* | 2024 |
Agent-flan: Designing data and methods of effective agent tuning for large language models Z Chen, K Liu, Q Wang, W Zhang, J Liu, D Lin, K Chen, F Zhao arXiv preprint arXiv:2403.12881, 2024 | 37 | 2024 |
DETRDistill: A Universal Knowledge Distillation Framework for DETR-families J Chang, S Wang, G Xu, Z Chen, C Yang, F Zhao ICCV2023, 2022 | 32 | 2022 |
Towards Domain Generalization for Multi-view 3D Object Detection in Bird-Eye-View S Wang, X Zhao, HM Xu, Z Chen, D Yu, J Chang, Z Yang, F Zhao CVPR2023, 2023 | 24 | 2023 |
T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step Z Chen, W Du, W Zhang, K Liu, J Liu, M Zheng, J Zhuo, S Zhang, D Lin, ... arXiv preprint arXiv:2312.14033, 2023 | 21* | 2023 |
DDOD: Dive deeper into the disentanglement of object detector Z Chen, C Yang, J Chang, F Zhao, ZJ Zha, F Wu IEEE Transactions on Multimedia 26, 284-298, 2023 | 21 | 2023 |