Følg
Wenqi Zhang
Wenqi Zhang
Verifisert e-postadresse på zju.edu.cn - Startside
Tittel
Sitert av
Sitert av
År
Videollama 2: Advancing spatial-temporal modeling and audio understanding in video-llms
Z Cheng, S Leng, H Zhang, Y Xin, X Li, G Chen, Y Zhu, W Zhang, Z Luo, ...
arXiv preprint arXiv:2406.07476, 2024
1512024
Data-copilot: Bridging billions of data and humans with autonomous workflow
W Zhang, Y Shen, W Lu, Y Zhuang
ICLR 2024 Workshop on LLM Agents (Outstanding paper), 2023
482023
Self-contrast: Better reflection through inconsistent solving perspectives
W Zhang, Y Shen, L Wu, Q Peng, J Wang, Y Zhuang, W Lu
Proceedings of the ACL 2024 Main, 2024
392024
PromptNER: Prompt locating and typing for named entity recognition
Y Shen, Z Tan, S Wu, W Zhang, R Zhang, Y Xi, W Lu, Y Zhuang
ACL-2023-Main, 2023
392023
Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization
W Zhang, K Tang, H Wu, M Wang, Y Shen, G Hou, Z Tan, P Li, Y Zhuang, ...
Proceedings of the ACL 2024 Main, 2024
362024
Taskbench: Benchmarking large language models for task automation
Y Shen, K Song, X Tan, W Zhang, K Ren, S Yuan, W Lu, D Li, Y Zhuang
NIPS2024, 2023
362023
Multi-View Reasoning: Consistent Contrastive Learning for Math Word Problem
W Zhang, Y Shen, Y Ma, X Cheng, Z Tan, Q Nong, W Lu
https://aclanthology.org/2022.findings-emnlp.79.pdf, 2022
222022
Deep Reinforcement Learning for Multi-contact Motion Planning of Hexapod Robots.
H Fu, K Tang, P Li, W Zhang, X Wang, G Deng, T Wang, C Chen
IJCAI, 2381-2388, 2021
152021
Multimodal self-instruct: Synthetic abstract image and visual reasoning instruction using language model
W Zhang, Z Cheng, Y He, M Wang, Y Shen, Z Tan, G Hou, M He, Y Ma, ...
EMNLP24 oral, 2024
102024
Query-based Instance Discrimination Network for Relational Triple Extraction
Z Tan, Y Shen, X Hu, W Zhang, X Cheng, W Lu, Y Zhuang
10.18653/v1/2022.emnlp-main.523, 2022
92022
An Expression Tree Decoding Strategy for Mathematical Equation Generation
W Zhang, Y Shen, Q Nong, Z Tan, Y Ma, W Lu
Proceedings of the 2023 Conference on EMNLP-main, 2023
82023
Learning to Navigate in a VUCA Environment: Hierarchical Multi-expert Approach
W Zhang, K Zhao, P Li, X Zhu, F Ye, W Jiang, H Fu, T Wang
2021 IROS (IEEE/RSJ International Conference on Intelligent Robots and …, 2021
82021
A Closed-Loop Perception, Decision-Making and Reasoning Mechanism for Human-Like Navigation
W Zhang, K Zhao, P Li, X Zhu, Y Shen, Y Ma, Y Chen, W Lu
Proceedings of the Thirty-First IJCAI Main Track, 2022
72022
TimeToM: Temporal Space is the Key to Unlocking the Door of Large Language Models' Theory-of-Mind
G Hou, W Zhang, Y Shen, L Wu, W Lu
ACL-24-Findings, 2024
62024
Advancing process verification for large language models via tree-based preference learning
M He, Y Shen, W Zhang, Z Tan, W Lu
arXiv preprint arXiv:2407.00390, 2024
42024
Enhancing emotion recognition in conversation via multi-view feature alignment and memorization
G Hou, Y Shen, W Zhang, W Xue, W Lu
Findings of the association for computational linguistics: EMNLP 2023, 12651 …, 2023
42023
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding
B Zhang, K Li, Z Cheng, Z Hu, Y Yuan, G Chen, S Leng, Y Jiang, H Zhang, ...
arXiv preprint arXiv:2501.13106, 2025
12025
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining
W Zhang, H Zhang, X Li, J Sun, Y Shen, W Lu, D Zhao, Y Zhuang, L Bing
arXiv preprint arXiv:2501.00958, 2025
12025
Entering Real Social World! Benchmarking the Theory of Mind and Socialization Capabilities of LLMs from a First-person Perspective
G Hou, W Zhang, Y Shen, Z Tan, S Shen, W Lu
arXiv preprint arXiv:2410.06195, 2024
12024
Specialized Mathematical Solving by a Step-By-Step Expression Chain Generation
W Zhang, Y Shen, G Hou, K Wang, W Lu
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
12024
Systemet kan ikke utføre handlingen. Prøv på nytt senere.
Artikler 1–20