Suivre
Shiyu Huang
Shiyu Huang
Autres noms黄 世宇
Zhipu AI; Tsinghua University
Adresse e-mail validée de zhipuai.cn - Page d'accueil
Titre
Citée par
Citée par
Année
Cogvideox: Text-to-video diffusion models with an expert transformer
Z Yang, J Teng, W Zheng, M Ding, S Huang, J Xu, Y Yang, W Hong, ...
arXiv preprint arXiv:2408.06072, 2024
1832024
SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks
B Yuchen Lin, Y Fu, K Yang, F Brahman, S Huang, C Bhagavatula, ...
arXiv e-prints, arXiv: 2305.17390, 2023
138*2023
Expecting the Unexpected: Training Detectors for Unusual Pedestrians with Adversarial Imposters
S Huang, D Ramanan
622017
Cogvlm2: Visual language models for image and video understanding
W Hong, W Wang, M Ding, W Yu, Q Lv, Y Wang, Y Cheng, S Huang, J Ji, ...
arXiv preprint arXiv:2408.16500, 2024
572024
TiKick: Towards Playing Multi-agent Football Full Games from Single-agent Demonstrations
S Huang, W Chen, L Zhang, Z Li, F Zhu, D Ye, T Chen, J Zhu
arXiv preprint arXiv:2110.04507, 2021
282021
Lvbench: An extreme long video understanding benchmark
W Wang, Z He, W Hong, Y Cheng, X Zhang, J Qi, X Gu, S Huang, B Xu, ...
arXiv preprint arXiv:2406.08035, 2024
272024
Deep reinforcement learning with credit assignment for combinatorial optimization
D Yan, J Weng, S Huang, C Li, Y Zhou, H Su, J Zhu
Pattern Recognition 124, 108466, 2022
272022
Combo-action: Training agent for fps game with auxiliary tasks
S Huang, H Su, J Zhu, T Chen
Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 954-961, 2019
222019
Uncertainty quantification via a memristor Bayesian deep neural network for risk-sensitive reinforcement learning
Y Lin, Q Zhang, B Gao, J Tang, P Yao, C Li, S Huang, Z Liu, Y Zhou, Y Liu, ...
Nature Machine Intelligence 5 (7), 714-723, 2023
212023
Tizero: Mastering multi-agent football with curriculum learning and self-play
F Lin, S Huang, T Pearce, W Chen, WW Tu
arXiv preprint arXiv:2302.07515, 2023
192023
Robustness and generalizability of deepfake detection: A study with diffusion models
H Song, S Huang, Y Dong, WW Tu
arXiv preprint arXiv:2309.02218, 2023
182023
Svqn: Sequential variational soft q-learning networks
S Huang, H Su, J Zhu, T Chen
International Conference on Learning Representations, 2019
142019
Llmarena: Assessing capabilities of large language models in dynamic multi-agent environments
J Chen, X Hu, S Liu, S Huang, WW Tu, Z He, L Wen
arXiv preprint arXiv:2402.16499, 2024
132024
DGPO: discovering multiple strategies with diversity-guided policy optimization
W Chen, S Huang, Y Chiang, T Pearce, WW Tu, T Chen, J Zhu
Proceedings of the AAAI Conference on Artificial Intelligence 38 (10), 11390 …, 2024
62024
Learning graph-enhanced commander-executor for multi-agent navigation
X Yang, S Huang, Y Sun, Y Yang, C Yu, WW Tu, H Yang, Y Wang
arXiv preprint arXiv:2302.04094, 2023
62023
Autosat: Automatically optimize sat solvers via large language models
Y Sun, F Ye, X Zhang, S Huang, B Zhang, K Wei, S Cai
arXiv preprint arXiv:2402.10705, 2024
52024
A Survey on Self-play Methods in Reinforcement Learning
R Zhang, Z Xu, C Ma, C Yu, WW Tu, S Huang, D Ye, W Ding, Y Yang, ...
arXiv preprint arXiv:2408.01072, 2024
32024
ICT: Image-Object Cross-Level Trusted Intervention for Mitigating Object Hallucination in Large Vision-Language Models
J Chen, T Zhang, S Huang, Y Niu, L Zhang, L Wen, X Hu
arXiv preprint arXiv:2411.15268, 2024
22024
Waterseeker: Efficient detection of watermarked segments in large documents
L Pan, A Liu, Y Lu, Z Gao, Y Di, L Wen, I King, SY Philip
CoRR, 2024
22024
OpenRL: A Unified Reinforcement Learning Framework
S Huang, W Chen, Y Sun, F Bie, WW Tu
arXiv preprint arXiv:2312.16189, 2023
22023
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–20