St-p3: End-to-end vision-based autonomous driving via spatial-temporal feature learning S Hu, L Chen, P Wu, H Li, J Yan, D Tao European Conference on Computer Vision, 533-549, 2022 | 229 | 2022 |
On Transforming Reinforcement Learning With Transformers: The Development Trajectory S Hu, L Shen, Y Zhang, Y Chen, D Tao IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024 | 34 | 2024 |
Learning Multi-Agent Communication from Graph Modeling Perspective S Hu, L Shen, Y Zhang, D Tao The Twelfth International Conference on Learning Representations (ICLR 2024), 2024 | 21 | 2024 |
Graph decision transformer S Hu, L Shen, Y Zhang, D Tao arXiv preprint arXiv:2303.03747, 2023 | 19 | 2023 |
Prompt-tuning decision transformer with preference ranking S Hu, L Shen, Y Zhang, D Tao arXiv preprint arXiv:2305.09648, 2023 | 16 | 2023 |
HarmoDT: Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning S Hu, Z Fan, L Shen, Y Zhang, Y Wang, D Tao The 41st International Conference on Machine Learning (ICML 2024), 2024 | 9 | 2024 |
Q-value regularized transformer for offline reinforcement learning S Hu, Z Fan, C Huang, L Shen, Y Zhang, Y Wang, D Tao The 41st International Conference on Machine Learning (ICML 2024), 2024 | 9 | 2024 |
Locally Estimated Global Perturbations are Better than Local Perturbations for Federated Sharpness-aware Minimization Z Fan, S Hu, J Yao, G Niu, Y Zhang, M Sugiyama, Y Wang The 41st International Conference on Machine Learning (ICML 2024), 2024 | 8 | 2024 |
FastDARTSDet: Fast differentiable architecture joint search on backbone and FPN for object detection C Wang, X Wang, Y Wang, S Hu, H Chen, X Gu, J Yan, T He Applied Sciences 12 (20), 10530, 2022 | 6 | 2022 |
Is Mamba Compatible with Trajectory Optimization in Offline Reinforcement Learning? Y Dai, O Ma, L Zhang, X Liang, S Hu, M Wang, S Ji, J Huang, L Shen The 38th Annual Conference on Neural Information Processing Systems (NeurIPS …, 2024 | 2 | 2024 |
Reconstruct the Pruned Model without Any Retraining P Wang, Z Fan, S Hu, Z Chen, Y Wang, Y Wang arXiv preprint arXiv:2407.13331, 2024 | 1 | 2024 |
Continual Task Learning through Adaptive Policy Self-Composition S Hu, Y Zhou, Z Fan, J Hu, L Shen, Y Zhang, D Tao arXiv preprint arXiv:2411.11364, 2024 | | 2024 |
Task-Aware Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning Z Fan, S Hu, Y Zhou, L Shen, Y Zhang, Y Wang, D Tao arXiv preprint arXiv:2411.01146, 2024 | | 2024 |
Prompt Tuning with Diffusion for Few-Shot Pre-trained Policy Generalization S Hu, W Zhao, W Lin, L Shen, Y Zhang, D Tao arXiv preprint arXiv:2411.01168, 2024 | | 2024 |
Communication Learning in Multi-Agent Systems from Graph Modeling Perspective S Hu, L Shen, Y Zhang, D Tao arXiv preprint arXiv:2411.00382, 2024 | | 2024 |
Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces J Hu, S Huang, L Shen, Z Yang, S Hu, S Tang, H Chen, Y Chang, D Tao, ... arXiv preprint arXiv:2410.15698, 2024 | | 2024 |