Spremljaj
Alex Gu
Naslov
Navedeno
Navedeno
Leto
Starcoder: may the source be with you!
R Li, LB Allal, Y Zi, N Muennighoff, D Kocetkov, C Mou, M Marone, C Akiki, ...
arXiv preprint arXiv:2305.06161, 2023
1026*2023
The disagreement problem in explainable machine learning: A practitioner's perspective
S Krishna, T Han, A Gu, S Wu, S Jabbari, H Lakkaraju
arXiv preprint arXiv:2202.01602, 2022
2592022
SantaCoder: don't reach for the stars!
LB Allal, R Li, D Kocetkov, C Mou, C Akiki, CM Ferrandis, N Muennighoff, ...
arXiv preprint arXiv:2301.03988, 2023
246*2023
Leandojo: Theorem proving with retrieval-augmented language models
K Yang, A Swope, A Gu, R Chalamala, P Song, S Yu, S Godil, RJ Prenger, ...
Advances in Neural Information Processing Systems 36, 21573-21612, 2023
2392023
Starcoder 2 and the stack v2: The next generation
A Lozhkov, R Li, LB Allal, F Cassano, J Lamy-Poirier, N Tazi, A Tang, ...
arXiv preprint arXiv:2402.19173, 2024
2172024
Livecodebench: Holistic and contamination free evaluation of large language models for code
N Jain, K Han, A Gu, WD Li, F Yan, T Zhang, S Wang, A Solar-Lezama, ...
arXiv preprint arXiv:2403.07974, 2024
1302024
LINC: A neurosymbolic approach for logical reasoning by combining language models with first-order logic provers
TX Olausson, A Gu, B Lipkin, CE Zhang, A Solar-Lezama, JB Tenenbaum, ...
arXiv preprint arXiv:2310.15164, 2023
852023
Bigcodebench: Benchmarking code generation with diverse function calls and complex instructions
TY Zhuo, MC Vu, J Chim, H Hu, W Yu, R Widyasari, INB Yusuf, H Zhan, ...
arXiv preprint arXiv:2406.15877, 2024
692024
Cruxeval: A benchmark for code reasoning, understanding and execution
A Gu, B Rozière, H Leather, A Solar-Lezama, G Synnaeve, SI Wang
arXiv preprint arXiv:2401.03065, 2024
642024
Min-max bilevel multi-objective optimization with applications in machine learning
A Gu, S Lu, P Ram, L Weng
arXiv preprint arXiv:2203.01924, 2022
20*2022
The counterfeit conundrum: Can code language models grasp the nuances of their incorrect generations?
A Gu, WD Li, N Jain, TX Olausson, C Lee, K Sen, A Solar-Lezama
arXiv preprint arXiv:2402.19475, 2024
122024
Three operator splitting with subgradients, stochastic gradients, and adaptive learning rates
A Yurtsever, A Gu, S Sra
Advances in Neural Information Processing Systems 34, 19743-19756, 2021
62021
Mixture of parrots: Experts improve memorization more than reasoning
S Jelassi, C Mohri, D Brandfonbrener, A Gu, N Vyas, N Anand, ...
arXiv preprint arXiv:2410.19034, 2024
32024
Reproducibility report: La-maml: Look-ahead meta learning for continual learning
J Joseph, A Gu
arXiv preprint arXiv:2102.05824, 2021
3*2021
Language Agnostic Code Embeddings
S Utpala, A Gu, PY Chen
arXiv preprint arXiv:2310.16803, 2023
22023
Certified interpretability robustness for class activation mapping
A Gu, TW Weng, PY Chen, S Liu, L Daniel
arXiv preprint arXiv:2301.11324, 2023
22023
Obsynth: An interactive synthesis system for generating object models from natural language specifications
A Gu, T Mitrovska, D Velez, J Andreas, A Solar-Lezama
arXiv preprint arXiv:2210.11468, 2022
22022
Mixture of Parrots: Mixtures of experts improve memorization more than reasoning
S Jelassi, C Mohri, D Brandfonbrener, A Gu, N Vyas, N Anand, ...
NeurIPS 2024 Workshop on Mathematics of Modern Machine Learning, 0
Pruning CodeBERT for Improved Code-to-Text Efficiency
A Gu, R Sonecha, S Vedantam, B Runwal, D Misra
Sistem trenutno ne more izvesti postopka. Poskusite znova pozneje.
Članki 1–19