Követés
Jinyi Hu
Jinyi Hu
MIT, Tsinghua University
E-mail megerősítve itt: mit.edu - Kezdőlap
Cím
Hivatkozott rá
Hivatkozott rá
Év
Rlhf-v: Towards trustworthy mllms via behavior alignment from fine-grained correctional human feedback
T Yu, Y Yao, H Zhang, T He, Y Han, G Cui, J Hu, Z Liu, HT Zheng, M Sun, ...
CVPR 2024, 2023
1742023
OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems
C He, R Luo, Y Bai, S Hu, ZL Thai, J Shen, J Hu, X Han, Y Huang, ...
ACL 2024, 2024
662024
Large multilingual models pivot zero-shot multimodal learning across languages
J Hu, Y Yao, C Wang, S Wang, Y Pan, Q Chen, T Yu, H Wu, Y Zhao, ...
ICLR 2024 (spotlight), 2023
552023
Towards interpretable natural language understanding with explanations as latent variables
W Zhou*, J Hu*, H Zhang*, X Liang, M Sun, C Xiong, J Tang
NeurIPS 2020, 2020
432020
Reformulating Vision-Language Foundation Models and Datasets Towards Universal Multimodal Assistants
T Yu*, J Hu*, Y Yao, H Zhang, Y Zhao, C Wang, S Wang, Y Pan, J Xue, ...
arXiv preprint arXiv:2310.00653, 2023
212023
Fuse It More Deeply! A Variational Transformer with Layer-Wise Latent Variable Inference for Text Generation
J Hu, X Yi, W Li, M Sun, X Xie
NAACL 2022, 2022
212022
GUICourse: From General Vision Language Models to Versatile GUI Agents
W Chen*, J Cui*, J Hu*, Y Qin, J Fang, Y Zhao, C Wang, J Liu, G Chen, ...
arXiv preprint arXiv:2406.11317, 2024
202024
Generating major types of chinese classical poetry in a uniformed framework
J Hu, M Sun
arXiv preprint arXiv:2003.11528, 2020
182020
Revisiting non-autoregressive transformers for efficient image synthesis
Z Ni, Y Wang, R Zhou, J Guo, J Hu, Z Liu, S Song, Y Yao, G Huang
CVPR 2024, 2024
122024
scMulan: a multitask generative pre-trained language model for single-cell analysis
H Bian, Y Chen, X Dong, C Li, M Hao, S Chen, J Hu, M Sun, L Wei, ...
International Conference on Research in Computational Molecular Biology, 479-482, 2024
102024
Exploring Perceptual Limitation of Multimodal Large Language Models
J Zhang*, J Hu*, M Khayatkhoei, F Ilievski, M Sun
arXiv preprint arXiv:2402.07384, 2024
102024
Aspect-level sentiment-controllable review generation with mutual learning framework
H Chen, Y Lin, F Qi, J Hu, P Li, J Zhou, M Sun
AAAI 2021, 2021
102021
LEGENT: Open Platform for Embodied Agents
Z Cheng, J Hu, Z Wang, S Hu, A Liu, Y Tu, P Li, L Shi, Z Liu, M Sun
ACL 2024, 2024
72024
NVILA: Efficient frontier visual language models
Z Liu, L Zhu, B Shi, Z Zhang, Y Lou, S Yang, H Xi, S Cao, Y Gu, D Li, X Li, ...
arXiv preprint arXiv:2412.04468, 2024
62024
Efficient cross-lingual transfer for Chinese stable diffusion with images as pivots
J Hu, X Han, X Yi, Y Chen, W Li, Z Liu, M Sun
arXiv preprint arXiv:2305.11540, 2023
62023
Adanat: Exploring adaptive policy for token-based image generation
Z Ni, Y Wang, R Zhou, R Lu, J Guo, J Hu, Z Liu, Y Yao, G Huang
European Conference on Computer Vision, 302-319, 2024
52024
Evade the Trap of Mediocrity: Promoting Diversity and Novelty in Text Generation via Concentrating Attention
W Li, X Yi, J Hu, M Sun, X Xie
EMNLP 2022, 2022
12022
Recurrence Boosts Diversity! Revisiting Recurrent Latent Variable in Transformer-Based Variational AutoEncoder for Diverse Text Generation
J Hu, X Yi, W Li, M Sun, X Xie
Findings of EMNLP 2022, 2022
12022
EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents
Z Cheng, Y Tu, R Li, S Dai, J Hu, S Hu, J Li, Y Shi, T Yu, W Chen, L Shi, ...
arXiv preprint arXiv:2501.11858, 2025
2025
Euclid: Supercharging Multimodal LLMs with Synthetic High-Fidelity Visual Descriptions
J Zhang, O Liu, T Yu, J Hu, W Neiswanger
arXiv preprint arXiv:2412.08737, 2024
2024
A rendszer jelenleg nem tudja elvégezni a műveletet. Próbálkozzon újra később.
Cikkek 1–20