Følg
yue yang
yue yang
Shanghai Jiao Tong University, Shanghai AI Laboratory
Verificeret mail på sjtu.edu.cn - Startside
Titel
Citeret af
Citeret af
År
Mmt-bench: A comprehensive multimodal benchmark for evaluating large vision-language models towards multitask agi
K Ying, F Meng, J Wang, Z Li, H Lin, Y Yang, H Zhang, W Zhang, Y Lin, ...
arXiv preprint arXiv:2404.16006, 2024
602024
Secure federated learning model verification: A client-side backdoor triggered watermarking scheme
X Liu, S Shao, Y Yang, K Wu, W Yang, H Fang
2021 IEEE International Conference on Systems, Man, and Cybernetics (SMC …, 2021
362021
Watermarking in secure federated learning: A verification framework based on client-side backdooring
W Yang, S Shao, Y Yang, X Liu, X Liu, Z Xia, G Schaefer, H Fang
ACM Transactions on Intelligent Systems and Technology 15 (1), 1-25, 2023
242023
Convbench: A multi-turn conversation evaluation benchmark with hierarchical capability for large vision-language models
S Liu, K Ying, H Zhang, Y Yang, Y Lin, T Zhang, C Li, Y Qiao, P Luo, ...
arXiv preprint arXiv:2403.20194, 2024
122024
Align, adapt and inject: Sound-guided unified image generation
Y Yang, K Zhang, Y Ge, W Shao, Z Xue, Y Qiao, P Luo
arXiv preprint arXiv:2306.11504, 2023
52023
PhyBench: A Physical Commonsense Benchmark for Evaluating Text-to-Image Models
F Meng, W Shao, L Luo, Y Wang, Y Chen, Q Lu, Y Yang, T Yang, K Zhang, ...
arXiv preprint arXiv:2406.11802, 2024
42024
Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability, Reproducibility, and Practicality
T Zhang, L Ma, Y Yan, Y Zhang, K Wang, Y Yang, Z Guo, W Shao, Y You, ...
arXiv preprint arXiv:2406.08845, 2024
32024
DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model
L Zhao, Y Yang, K Zhang, W Shao, Y Zhang, Y Qiao, P Luo, R Ji
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
32024
Name Your Colour For the Task: Artificially Discover Colour Naming via Colour Quantisation Transformer
S Su, L Gu, Y Yang, Z Zhang, T Harada
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
32023
Position: Towards Implicit Prompt For Text-To-Image Models
Y Yang, Y Lin, H Liu, W Shao, R Chen, H Shang, Y Wang, Y Qiao, ...
Forty-first International Conference on Machine Learning, 0
2
Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification
W Peng, K Zhang, Y Yang, H Zhang, Y Qiao
Proceedings of the AAAI Conference on Artificial Intelligence 38 (5), 4506-4514, 2024
12024
Position Paper: Towards Implicit Prompt For Text-To-Image Models
Y Yang, H Liu, W Shao, R Chen, H Shang, Y Wang, Y Qiao, K Zhang, ...
arXiv preprint arXiv:2403.02118, 2024
12024
GATE OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation
P Zhou, X Peng, J Song, C Li, Z Xu, Y Yang, Z Guo, H Zhang, Y Lin, Y He, ...
arXiv preprint arXiv:2411.18499, 2024
2024
Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping
Y Yang, S Zhang, W Shao, K Zhang, Y Bin, Y Wang, P Luo
arXiv preprint arXiv:2410.08695, 2024
2024
Align, Adapt and Inject: Audio-Guided Image Generation, Editing and Stylization
Y Yang, K Zhang, Y Ge, W Shao, Z Xue, Y Qiao, P Luo
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
2024
Systemet kan ikke foretage handlingen nu. Prøv igen senere.
Artikler 1–15