Valley: Video assistant with large language model enhanced ability R Luo, Z Zhao, M Yang, J Dong, D Li, P Lu, T Wang, L Hu, M Qiu, Z Wei arXiv preprint arXiv:2306.07207, 2023 | 159 | 2023 |
Dolg: Single-stage image retrieval with deep orthogonal fusion of local and global features M Yang, D He, M Fan, B Shi, X Xue, F Li, E Ding, J Huang Proceedings of the IEEE/CVF International conference on Computer Vision …, 2021 | 151 | 2021 |
Real-time tunnel crack analysis system via deep learning Q Song, Y Wu, X Xin, L Yang, M Yang, H Chen, C Liu, M Hu, X Chai, J Li Ieee Access 7, 64186-64197, 2019 | 81 | 2019 |
Coder: Coupled diversity-sensitive momentum contrastive learning for image-text retrieval H Wang, D He, W Wu, B Xia, M Yang, F Li, Y Yu, Z Ji, E Ding, J Wang European Conference on Computer Vision, 700-716, 2022 | 31 | 2022 |
Dalg: Deep attentive local and global modeling for image retrieval Y Song, R Zhu, M Yang, D He arXiv preprint arXiv:2207.00287, 2022 | 15 | 2022 |
It takes two: Masked appearance-motion modeling for self-supervised video transformer pre-training Y Song, M Yang, W Wu, D He, F Li, J Wang arXiv preprint arXiv:2210.05234, 2022 | 10 | 2022 |
Semi-supervised recognition under a noisy and fine-grained dataset C Cui, Z Ye, Y Li, X Li, M Yang, K Wei, B Dai, Y Zhao, Z Liu, R Pang arXiv preprint arXiv:2006.10702 (CVPR2020 workshop), 2020 | 6 | 2020 |
MtArtGPT: A multi-task art generation system with pre-trained transformer C Jin, R Zhu, Z Zhu, L Yang, M Yang, J Luo IEEE Transactions on Circuits and Systems for Video Technology 34 (8), 6901-6912, 2024 | 5 | 2024 |
Image recognition method, electronic device and storage medium M Yang US Patent 11,899,710, 2024 | 1 | 2024 |
FashionLOGO: Prompting Multimodal Large Language Models for Fashion Logo Embeddings Z Wang, D Li, Y Su, M Yang, M Qiu, W Wang Proceedings of the 33rd ACM International Conference on Information and …, 2024 | | 2024 |
Method and apparatus for retrieving image, device, and medium H Ren, M Yang, XUE Xuetong US Patent 11,836,186, 2023 | | 2023 |
Method and apparatus for identifying key point locations in image, and medium XUE Xuetong, H Ren, M Yang US Patent 11,636,666, 2023 | | 2023 |
Method for training image search model and method for image search M Yang, R Zhu US Patent App. 17/742,994, 2022 | | 2022 |
2nd Place Solution to Google Landmark Retrieval 2020 M Yang, C Cui, X Xue, H Ren, K Wei arXiv preprint arXiv:2210.01624 (ECCV2020 workshop), 2022 | | 2022 |
Logo picture processing method, apparatus, device and medium C Cui, K Wei, M Yang EP Patent CN112580620A; EP4020311A1; US2022207286A1, 2022 | | 2022 |
Method and apparatus for analyzing video scenario XUE Xuetong, H Ren, M Yang US Patent App. 17/191,438, 2022 | | 2022 |
Image recognition method, apparatus, electronic device, storage medium and program product M Yang EP Patent CN112241764A; EP3869403A2; EP3869403A3; US2021326639A1, 2022 | | 2022 |
Recognition of Tunnel Cracks Based on Deep Convolutional Neural Network Classifier M Yang, Q Song, X Xin, L Yang Data Science: 4th International Conference of Pioneering Computer Scientists …, 2018 | | 2018 |