Folgen
Songyang Zhang
Songyang Zhang
Sonstige Namen张 松阳
Shanghai AI Laboratory
Bestätigte E-Mail-Adresse bei pjlab.org.cn - Startseite
Titel
Zitiert von
Zitiert von
Jahr
Mmbench: Is your multi-modal model an all-around player?
Y Liu, H Duan, Y Zhang, B Li, S Zhang, W Zhao, Y Yuan, J Wang, C He, ...
European conference on computer vision, 216-233, 2024
7202024
Part-aware prototype network for few-shot semantic segmentation
Y Liu, X Zhang, S Zhang, X He
Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020
3892020
Distribution Alignment: A Unified Framework for Long-tail Visual Recognition
S Zhang, Z Li, S Yan, X He, J Sun
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
3402021
Bipartite Graph Network with Adaptive Message Passing for Unbiased Scene Graph Generation
R Li, S Zhang, B Wan, X He
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
2582021
Internlm2 technical report
Z Cai, M Cao, H Chen, K Chen, K Chen, X Chen, X Chen, Z Chen, Z Chen, ...
arXiv preprint arXiv:2403.17297, 2024
2152024
Internlm-xcomposer2: Mastering free-form text-image composition and comprehension in vision-language large model
X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, X Wei, S Zhang, ...
arXiv preprint arXiv:2401.16420, 2024
2152024
OpenCompass: A universal evaluation platform for foundation models.
OC Contributors
https://github.com/open-compass/opencompass, 2023
1972023
Internlm: A multilingual language model with progressively enhanced capabilities
ILM Team
2023-01-06)[2023-09-27]. https://github. com/InternLM/InternLM, 2023
1942023
Internlm-xcomposer: A vision-language large model for advanced text-image comprehension and composition
P Zhang, X Dong, B Wang, Y Cao, C Xu, L Ouyang, Z Zhao, H Duan, ...
arXiv preprint arXiv:2309.15112, 2023
1842023
SGTR: End-to-end Scene Graph Generation with Transformer
R Li, S Zhang, X He
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2022
1172022
Internlm-xcomposer2-4khd: A pioneering large vision-language model handling resolutions from 336 pixels to 4k hd
X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, S Zhang, H Duan, ...
arXiv preprint arXiv:2404.06512, 2024
1072024
Dynamic context correspondence network for semantic alignment
S Huang, Q Wang, S Zhang, S Yan, X He
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019
1012019
LatentGNN: Learning Efficient Non-local Relations for Visual Recognition
S Zhang, S Yan, X He
Proceedings of the 36th International Conference on Machine Learning, 2019
932019
Openmmlab’s image classification toolbox and benchmark
M Contributors
URL: https://github. com/open-mmlab/mmclassification 5, 2020
732020
Lawbench: Benchmarking legal knowledge of large language models
Z Fei, X Shen, D Zhu, F Zhou, Z Han, S Zhang, K Chen, Z Shen, J Ge
arXiv preprint arXiv:2309.16289, 2023
722023
A Dual Attention Network with Semantic Embedding for Few-Shot Learning.
S Yan, S Zhang, X He
AAAI 33, 9079-9086, 2019
722019
Action Quality Assessment with Temporal Parsing Transformer
Y Bai, D Zhou, S Zhang, J Wang, E Ding, Y Guan, Y Long, J Wang
European Conference on Computer Vision, 2022
522022
An em framework for online incremental learning of semantic segmentation
S Yan, J Zhou, J Xie, S Zhang, X He
Proceedings of the 29th ACM international conference on multimedia, 3052-3060, 2021
502021
Learning Implicit Temporal Alignment for Few-shot Video Classification
S Zhang, J Zhou, X He
International Joint Conferences on Artificial Intelligence, 2021
482021
Internlm-math: Open math large language models toward verifiable reasoning
H Ying, S Zhang, L Li, Z Zhou, Y Shao, Z Fei, Y Ma, J Hong, K Liu, Z Wang, ...
arXiv preprint arXiv:2402.06332, 2024
442024
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–20