S3d: Stacking segmental p3d for action quality assessment X Xiang, Y Tian, A Reiter, GD Hager, TD Tran 2018 25th IEEE international conference on image processing (ICIP), 928-932, 2018 | 67 | 2018 |
Temporal self-ensembling teacher for semi-supervised object detection C Chen, S Dong, Y Tian, K Cao, L Liu, Y Guo IEEE Transactions on Multimedia 24, 3679-3692, 2021 | 29 | 2021 |
Dong Yu. Toward self-improvement of llms via imagination, searching, and criticizing Y Tian, B Peng, L Song, L Jin, D Yu, H Mi arXiv preprint arXiv:2404.12253, 2024 | 18 | 2024 |
Self-alignment for factuality: Mitigating hallucinations in llms via self-evaluation X Zhang, B Peng, Y Tian, J Zhou, L Jin, L Song, H Mi, H Meng arXiv preprint arXiv:2402.09267, 2024 | 17 | 2024 |
Quality-Similar Diversity via Population Based Reinforcement Learning S Wu, J Yao, H Fu, Y Tian, C Qian, Y Yang, Q FU, Y Wei The Eleventh International Conference on Learning Representations, 2023 | 17 | 2023 |
Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing Y Tian, B Peng, L Song, L Jin, D Yu, L Han, H Mi, D Yu arXiv preprint arXiv:2404.12253, 2024 | 15 | 2024 |
Stabilizing RLHF through advantage model and selective rehearsal B Peng, L Song, Y Tian, L Jin, H Mi, D Yu arXiv preprint arXiv:2309.10202, 2023 | 15 | 2023 |
Greedy when sure and conservative when uncertain about the opponents H Fu, Y Tian, H Yu, W Liu, S Wu, J Xiong, Y Wen, K Li, J Xing, Q Fu, ... International Conference on Machine Learning, 6829-6848, 2022 | 12 | 2022 |
Fine-Grained Self-Endorsement Improves Factuality and Reasoning A Wang, L Song, B Peng, Y Tian, L Jin, H Mi, J Su, D Yu arXiv preprint arXiv:2402.15631, 2024 | 6 | 2024 |
Efficacious tree search for llm A Wang, L Song, Y Tian, B Peng, D Yu, H Mi, J Su, DY Litesearch arXiv preprint arXiv:2407.00320 3, 2024 | 5 | 2024 |
Dong Yu. Iterative nash policy optimization: Aligning llms with general preferences via no-regret learning Y Zhang, D Yu, B Peng, L Song, Y Tian, M Huo, N Jiang, H Mi arXiv preprint arXiv:2407.00617 12, 2024 | 5 | 2024 |
Litesearch: Efficacious tree search for llm A Wang, L Song, Y Tian, B Peng, D Yu, H Mi, J Su, D Yu arXiv preprint arXiv:2407.00320, 2024 | 3 | 2024 |
Self-Tuning: Instructing LLMs to Effectively Acquire New Knowledge through Self-Teaching X Zhang, B Peng, Y Tian, J Zhou, Y Zhang, H Mi, H Meng arXiv preprint arXiv:2406.06326, 2024 | 3 | 2024 |
Self-Consistency Boosts Calibration for Math Reasoning A Wang, L Song, Y Tian, B Peng, L Jin, H Mi, J Su, D Yu arXiv preprint arXiv:2403.09849, 2024 | 3 | 2024 |
Collaborative decoding of critical tokens for boosting factuality of large language models L Jin, B Peng, L Song, H Mi, Y Tian, D Yu arXiv preprint arXiv:2402.17982, 2024 | 3 | 2024 |
Videgothink: Assessing egocentric video understanding capabilities for embodied ai S Cheng, K Fang, Y Yu, S Zhou, B Li, Y Tian, T Li, L Han, Y Liu arXiv preprint arXiv:2410.11623, 2024 | 2 | 2024 |
Resisting large data variations via introspective transformation network Y Zhao, Y Tian, C Fowlkes, W Shen, A Yuille Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2020 | 2 | 2020 |
Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning X Wang, L Song, Y Tian, D Yu, B Peng, H Mi, F Huang, D Yu arXiv preprint arXiv:2410.06508, 2024 | 1 | 2024 |
Siam: Self-improving code-assisted mathematical reasoning of large language models D Yu, B Peng, Y Tian, L Song, H Mi, D Yu arXiv preprint arXiv:2408.15565, 2024 | 1 | 2024 |
Spatial Transformer Introspective Neural Network. Y Zhao, Y Tian, W Shen23, A Yuille arXiv preprint arXiv:1805.06447, 2018 | 1 | 2018 |