Nearest neighbor ensembles: An effective method for difficult problems in streaming classification with emerging new classes XQ Cai, P Zhao, KM Ting, X Mu, Y Jiang 2019 IEEE international conference on data mining (ICDM), 970-975, 2019 | 33 | 2019 |
Distributional pareto-optimal multi-objective reinforcement learning XQ Cai, P Zhang, L Zhao, J Bian, M Sugiyama, A Llorens Advances in Neural Information Processing Systems 36, 2024 | 15 | 2024 |
Imitation learning from pixel-level demonstrations by hashreward XQ Cai, YX Ding, Y Jiang, ZH Zhou Proceedings of the 20th International Conference on Autonomous Agents and …, 2021 | 15* | 2021 |
Seeing Differently, Acting Similarly: Imitation Learning with Heterogeneous Observations. XQ Cai, YX Ding, ZX Chen, Y Jiang, M Sugiyama, ZH Zhou The Eleventh International Conference on Learning Representations, 2023 | 9* | 2023 |
Leveraging Multi-lingual Positive Instances in Contrastive Learning to Improve Sentence Embedding K Zhao, Q Wu, XQ Cai, Y Tsuruoka Proceedings of the 8th Conference of the European Chapter of the Association …, 2023 | 7 | 2023 |
Anomaly Guided Policy Learning from Imperfect Demonstrations ZX Chen*, XQ Cai*, Y Jiang, ZH Zhou Proceedings of the 21st International Conference on Autonomous Agents and …, 2022 | 6 | 2022 |
Imitation learning from vague feedback XQ Cai, YJ Zhang, CK Chiang, M Sugiyama Advances in Neural Information Processing Systems 36, 48275-48292, 2023 | 5 | 2023 |
An Animation-based Augmentation Approach for Action Recognition from Discontinuous Video X Song, Z Li, S Chen, XQ Cai, K Demachi Proceedings of the 27th European Conference on Artificial Intelligence, 2024 | 2 | 2024 |
Reinforcement learning from bagged reward Y Tang, XQ Cai, YX Ding, Q Wu, G Liu, M Sugiyama ICML 2024 Workshop: Aligning Reinforcement Learning Experimentalists and …, 2024 | 2 | 2024 |
Soft-Label Integration for Robust Toxicity Classification Z Cheng, X Wu, J Yu, S Han, XQ Cai, X Xing arXiv preprint arXiv:2410.14894, 2024 | 1 | 2024 |
Reinforcement Learning from Bagged Reward: A Transformer-based Approach for Instance-Level Reward Redistribution Y Tang, XQ Cai, YX Ding, Q Wu, G Liu, M Sugiyama arXiv preprint arXiv:2402.03771, 2024 | 1 | 2024 |
Beyond Simple Sum of Delayed Rewards: Non-Markovian Reward Modeling for Reinforcement Learning Y Tang, XQ Cai, JC Pang, Q Wu, YX Ding, M Sugiyama arXiv preprint arXiv:2410.20176, 2024 | | 2024 |
Leveraging Domain-Unlabeled Data in Offline Reinforcement Learning across Two Domains S Nishimori, XQ Cai, J Ackermann, M Sugiyama arXiv preprint arXiv:2404.07465, 2024 | | 2024 |
IG-Net: Image-Goal Network for Offline Visual Navigation on A Large-Scale Game Map B Zhu, P Zhang, XQ Cai, L Zhao, M Sugiyama, J Bian | | |
IG-Net: Image-Goal Network for Offline Visual Navigation on A Large-Scale Game Map P Zhang, B Zhu, XQ Cai, L Zhao, M Sugiyama, J Bian | | |