Theo dõi
Carlos E. Jimenez
Carlos E. Jimenez
Email được xác minh tại princeton.edu - Trang chủ
Tiêu đề
Trích dẫn bởi
Trích dẫn bởi
Năm
Swe-bench: Can language models resolve real-world github issues?
CE Jimenez, J Yang, A Wettig, S Yao, K Pei, O Press, K Narasimhan
arXiv preprint arXiv:2310.06770, 2023
3542023
Swe-agent: Agent-computer interfaces enable automated software engineering
J Yang, CE Jimenez, A Wettig, K Lieret, S Yao, KR Narasimhan, O Press
The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024
1572024
C-STS: Conditional semantic textual similarity
A Deshpande, CE Jimenez, H Chen, V Murahari, V Graf, T Rajpurohit, ...
arXiv preprint arXiv:2305.15093, 2023
282023
Datamux: Data multiplexing for neural networks
V Murahari, C Jimenez, R Yang, K Narasimhan
Advances in Neural Information Processing Systems 35, 17515-17527, 2022
202022
Swe-bench: Can language models resolve real-world github issues?, 2024
CE Jimenez, J Yang, A Wettig, S Yao, K Pei, O Press, K Narasimhan
URL https://arxiv. org/abs/2310.06770, 2023
192023
Carets: A consistency and robustness evaluative test suite for vqa
CE Jimenez, O Russakovsky, K Narasimhan
arXiv preprint arXiv:2203.07613, 2022
162022
SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?
J Yang, CE Jimenez, AL Zhang, K Lieret, J Yang, X Wu, O Press, ...
arXiv preprint arXiv:2410.03859, 2024
92024
Mux-plms: Pre-training language models with data multiplexing
V Murahari, A Deshpande, C Jimenez, I Shafran, M Wang, Y Cao, ...
Proceedings of the 8th Workshop on Representation Learning for NLP (RepL4NLP …, 2023
62023
Swe-bench: Can language models resolve real-world github issues? CoRR, abs/2310.06770, 2023. doi: 10.48550
CE Jimenez, J Yang, A Wettig, S Yao, K Pei, O Press, K Narasimhan
arXiv preprint ARXIV.2310.06770, 0
6
Introducing swe-bench verified
N Chowdhury, J Aung, CJ Shern, O Jaffe, D Sherburn, G Starace, E Mays, ...
Aug, 2024
42024
EnIGMA: Enhanced Interactive Generative Model Agent for CTF Challenges
T Abramovich, M Udeshi, M Shao, K Lieret, H Xi, K Milner, S Jancheska, ...
arXiv preprint arXiv:2409.16165, 2024
22024
Mux-plms: Data multiplexing for high-throughput language models
V Murahari, A Deshpande, CE Jimenez, I Shafran, M Wang, Y Cao, ...
arXiv preprint arXiv:2302.12441, 2023
22023
Learning Physical Commonsense Knowledge
CE Jimenez
2020
Quartz: A tool for vectoring and fertility assessments in magmatic-hydrothermal ore deposits
L Zhang, D Cooke, E Orovan, N White, M Baker, H Chen, J Wilkinson, ...
University of Tasmania, 2018
2018
Hệ thống không thể thực hiện thao tác ngay bây giờ. Hãy thử lại sau.
Bài viết 1–14