Seuraa
Yuxin Chen
Yuxin Chen
National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences.
Vahvistettu sähköpostiosoite verkkotunnuksessa ia.ac.cn
Nimike
Viittaukset
Viittaukset
Vuosi
Channel-wise topology refinement graph convolution for skeleton-based action recognition
Y Chen, Z Zhang, C Yuan, B Li, Y Deng, W Hu
Proceedings of the IEEE/CVF international conference on computer vision …, 2021
7902021
Open-vocabulary one-stage detection with hierarchical visual-language knowledge distillation
Z Ma, G Luo, J Gao, L Li, Y Chen, S Wang, C Zhang, W Hu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
952022
Graph convolutional network with structure pooling and joint-wise channel attention for action recognition
Y Chen, G Ma, C Yuan, B Li, H Zhang, F Wang, W Hu
Pattern Recognition 103, 107321, 2020
652020
Pixel level data augmentation for semantic image segmentation using generative adversarial networks
S Liu, J Zhang, Y Chen, Y Liu, Z Qin, T Wan
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
592019
AED-Net: An abnormal event detection network
T Wang, Z Miao, Y Chen, Y Zhou, G Shan, H Snoussi
Engineering 5 (5), 930-939, 2019
542019
TranSkeleton: Hierarchical spatial–temporal transformer for skeleton-based action recognition
H Liu, Y Liu, Y Chen, C Yuan, B Li, W Hu
IEEE Transactions on Circuits and Systems for Video Technology 33 (8), 4137-4148, 2023
422023
Vilem: Visual-language error modeling for image-text retrieval
Y Chen, Z Ma, Z Zhang, Z Qi, C Yuan, Y Shan, B Li, W Hu, X Qie, J Wu
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023
122023
Create: A benchmark for chinese short video retrieval and title generation
Z Zhang, Y Chen, Z Ma, Z Qi, C Yuan, B Li, Y Shan, W Hu
arXiv preprint arXiv:2203.16763, 2022
72022
Taming rectified flow for inversion and editing
J Wang, J Pu, Z Qi, J Guo, Y Ma, N Huang, Y Chen, X Li, Y Shan
arXiv preprint arXiv:2411.04746, 2024
42024
Order-prompted tag sequence generation for video tagging
Z Ma, Z Zhang, Y Chen, Z Qi, Y Luo, Z Li, C Yuan, B Li, X Qie, Y Shan, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
42023
Chinese title generation for short videos: Dataset, metric and algorithm
Z Zhang, Z Ma, C Yuan, Y Chen, P Wang, Z Qi, C Hao, B Li, Y Shan, W Hu, ...
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024
32024
Doge: Towards versatile visual document grounding and referring
Y Zhou, Y Chen, H Lin, S Yang, L Zhu, Z Qi, C Ma, Y Shan
arXiv preprint arXiv:2411.17125, 2024
12024
mRAG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA
T Zhang, Z Zhang, Z Ma, Y Chen, Z Qi, C Yuan, B Li, J Pu, Y Zhao, Z Xie, ...
arXiv preprint arXiv:2411.15041, 2024
12024
How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval?
Y Chen, Z Ma, Z Zhang, Z Qi, C Yuan, B Li, J Pu, Y Shan, X Qi, W Hu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
12024
EA-VTR: Event-Aware Video-Text Retrieval
Z Ma, Z Zhang, Y Chen, Z Qi, C Yuan, B Li, Y Luo, X Li, X Qi, Y Shan, ...
European Conference on Computer Vision, 76-94, 2024
2024
Unifying Latent and Lexicon Representations for Effective Video-Text Retrieval
H Liu, Y Shi, H Xu, C Yuan, Q Ye, C Li, M Yan, J Zhang, F Huang, B Li, ...
arXiv preprint arXiv:2402.16769, 2024
2024
DARTScore: DuAl-Reconstruction Transformer for Video Captioning Evaluation
Y Chen, Z Zhang, Z Qi, C Yuan, J Wang, Y Shan, B Li, W Hu, X Qie, J Wu
IEEE Transactions on Circuits and Systems for Video Technology 34 (4), 2041-2055, 2023
2023
Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.
Artikkelit 1–17