Point-m2ae: multi-scale masked autoencoders for hierarchical point cloud pre-training R Zhang, Z Guo, P Gao, R Fang, B Zhao, D Wang, Y Qiao, H Li Advances in neural information processing systems 35, 27061-27074, 2022 | 264 | 2022 |
Hsa-rnn: Hierarchical structure-adaptive rnn for video summarization B Zhao, X Li, X Lu Proceedings of the IEEE conference on computer vision and pattern …, 2018 | 248 | 2018 |
Hierarchical recurrent neural network for video summarization B Zhao, X Li, X Lu Proceedings of the 25th ACM international conference on Multimedia, 863-871, 2017 | 208 | 2017 |
Gs-slam: Dense visual slam with 3d gaussian splatting C Yan, D Qu, D Xu, B Zhao, Z Wang, D Wang, X Li Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 158 | 2024 |
A CNN–RNN architecture for multi-label weather recognition B Zhao, X Li, X Lu, Z Wang Neurocomputing 322, 47-57, 2018 | 142 | 2018 |
CAM-RNN: Co-attention model based RNN for video captioning B Zhao, X Li, X Lu IEEE Transactions on Image Processing 28 (11), 5552-5565, 2019 | 140 | 2019 |
Reconstructive Sequence-Graph Network for Video Summarization B Zhao, H Li, X Lu, X Li IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021 | 132 | 2021 |
A general framework for edited video and raw video summarization X Li, B Zhao, X Lu IEEE Transactions on Image Processing 26 (8), 3652-3664, 2017 | 132 | 2017 |
C^ 3 framework: An open-source pytorch code for crowd counting J Gao, W Lin, B Zhao, D Wang, C Gao, J Wen arXiv preprint arXiv:1907.02724, 2019 | 113 | 2019 |
MAM-RNN: Multi-level attention model based RNN for video captioning. X Li, B Zhao, X Lu IJCAI 2017, 2208-2214, 2017 | 113 | 2017 |
Property-constrained dual learning for video summarization B Zhao, X Li, X Lu IEEE transactions on neural networks and learning systems 31 (10), 3989-4000, 2019 | 81 | 2019 |
TTH-RNN: Tensor-train hierarchical recurrent neural network for video summarization B Zhao, X Li, X Lu IEEE Transactions on Industrial Electronics 68 (4), 3629-3637, 2020 | 80 | 2020 |
Diffusion model is an effective planner and data synthesizer for multi-task reinforcement learning H He, C Bai, K Xu, Z Yang, W Zhang, D Wang, B Zhao, X Li Advances in neural information processing systems 36, 64896-64917, 2023 | 73 | 2023 |
Not all features matter: Enhancing few-shot clip with adaptive prior refinement X Zhu, R Zhang, B He, A Zhou, D Wang, B Zhao, P Gao Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 72 | 2023 |
Hierarchical multimodal transformer to summarize videos B Zhao, M Gong, X Li Neurocomputing 468, 360-369, 2022 | 68 | 2022 |
Key frame extraction in the summary space X Li, B Zhao, X Lu IEEE transactions on cybernetics 48 (6), 1923-1934, 2017 | 62 | 2017 |
Semantics-Consistent Representation Learning for Remote Sensing Image–Voice Retrieval H Ning, B Zhao, Y Yuan IEEE transactions on geoscience and remote sensing, 2021 | 48 | 2021 |
One-shot high-fidelity talking-head synthesis with deformable neural radiance field W Li, L Zhang, D Wang, B Zhao, Z Wang, M Chen, B Zhang, Z Wang, L Bo, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 47 | 2023 |
AudioVisual Video Summarization B Zhao, M Gong, X Li IEEE Transactions on Neural Networks and Learning Systems, 2021 | 44 | 2021 |
Weather GAN: Multi-domain weather translation using generative adversarial networks X Li, K Kou, B Zhao arXiv preprint arXiv:2103.05422, 2021 | 44 | 2021 |