Follow
John Yang
Title
Cited by
Cited by
Year
Webshop: Towards scalable real-world web interaction with grounded language agents
S Yao, H Chen, J Yang, K Narasimhan
NeurIPS 2022, 2022
3462022
SWE-bench: Can Language Models Resolve Real-World GitHub Issues?
CE Jimenez, J Yang, A Wettig, S Yao, K Pei, O Press, K Narasimhan
ICLR 2024, 2023
3092023
SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering
J Yang, CE Jimenez, A Wettig, K Lieret, S Yao, K Narasimhan, O Press
NeurIPS 2024, 2024
1292024
InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback
J Yang, A Prabhakar, K Narasimhan, S Yao
NeurIPS 2023 (Datasets & Benchmarks), 2023
852023
Language Agents as Hackers: Evaluating Cybersecurity Skills with Capture the Flag
J Yang, A Prabhakar, S Yao, K Pei, KR Narasimhan
Multi-Agent Security Workshop @ NeurIPS 2023, 2023
172023
Prompting Large Language Models to Tackle the Full Software Development Lifecycle: A Case Study
B Li, W Wu, Z Tang, L Shi, J Yang, J Li, S Yao, C Qian, B Hui, Q Zhang, ...
Proceedings of the 31st International Conference on Computational …, 2025
13*2025
Introducing SWE-bench Verified
N Chowdhury, J Aung, CJ Shern, O Jaffe, D Sherburn, G Starace, E Mays, ...
OpenAI, 2024
8*2024
SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?
J Yang, CE Jimenez, AL Zhang, K Lieret, J Yang, X Wu, O Press, ...
ICLR 2025, 2024
52024
Referral Augmentation for Zero-Shot Information Retrieval
M Tang, S Yao, J Yang, K Narasimhan
ACL 2024 (Findings), 2023
32023
Quartz: A framework for engineering secure smart contracts
J Kolb, J Yang, RH Katz, DE Culler
EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS …, 2020
32020
EnIGMA: Enhanced Interactive Generative Model Agent for CTF Challenges
T Abramovich, M Udeshi, M Shao, K Lieret, H Xi, K Milner, S Jancheska, ...
arXiv preprint arXiv:2409.16165, 2024
22024
Collaborative Gym: A Framework for Enabling and Evaluating Human-Agent Collaboration
Y Shao, V Samuel, Y Jiang, J Yang, D Yang
arXiv preprint arXiv:2412.15701, 2024
2024
Disentangled Prompt Learning for Transferable, Multimodal, Few-Shot Image Classification
J Yang, A Magnani, B Yang
2024 IEEE International Conference on Big Data (BigData), 3343-3352, 2024
2024
Learning Language through Interactions with the Digital World
JB Yang
Princeton University, 2023
2023
Towards an Enhanced, Faithful, and Adaptable Web Interaction Environment
J Yang, H Chen, KR Narasimhan
Second Workshop on Language and Reinforcement Learning @ NeurIPS 2022, 2022
2022
The system can't perform the operation now. Try again later.
Articles 1–15