Mathvista: Evaluating mathematical reasoning of foundation models in visual contexts P Lu, H Bansal, T Xia, J Liu, C Li, H Hajishirzi, H Cheng, KW Chang, ... arXiv preprint arXiv:2310.02255, 2023 | 431 | 2023 |
Generated knowledge prompting for commonsense reasoning J Liu, A Liu, X Lu, S Welleck, P West, RL Bras, Y Choi, H Hajishirzi 60th Annual Meeting of the Association for Computational Linguistics (ACL …, 2021 | 327 | 2021 |
Draft, sketch, and prove: Guiding formal theorem provers with informal proofs AQ Jiang, S Welleck, JP Zhou, W Li, J Liu, M Jamnik, T Lacroix, Y Wu, ... 11th International Conference on Learning Representations (ICLR 2023), 2022 | 141 | 2022 |
Crossweigh: Training named entity tagger from imperfect annotations Z Wang, J Shang, L Liu, L Lu, J Liu, J Han The 2019 Conference on Empirical Methods in Natural Language Processing and …, 2019 | 122 | 2019 |
Inverse scaling: When bigger isn't better IR McKenzie, A Lyzhov, M Pieler, A Parrish, A Mueller, A Prabhu, ... arXiv preprint arXiv:2306.09479, 2023 | 89 | 2023 |
Naturalprover: Grounded mathematical proof generation with language models S Welleck, J Liu, L Ximing, H Hajishirzi, Y Choi 36th Conference on Neural Information Processing Systems (NeurIPS 2022), 2022 | 69 | 2022 |
Naturalproofs: Mathematical theorem proving in natural language S Welleck, J Liu, RL Bras, H Hajishirzi, Y Choi, K Cho 35th Conference on Neural Information Processing Systems (NeurIPS 2021 …, 2021 | 63 | 2021 |
Rainier: Reinforced knowledge introspector for commonsense question answering J Liu, S Hallinan, X Lu, P He, S Welleck, H Hajishirzi, Y Choi The 2022 Conference on Empirical Methods in Natural Language Processing …, 2022 | 56 | 2022 |
Vera: A general-purpose plausibility estimation model for commonsense statements J Liu, W Wang, D Wang, NA Smith, Y Choi, H Hajishirzi arXiv preprint arXiv:2305.03695, 2023 | 35 | 2023 |
Don't throw away your value model! Generating more preferable text with Value-Guided Monte-Carlo Tree Search decoding J Liu, A Cohen, R Pasunuru, Y Choi, H Hajishirzi, A Celikyilmaz arXiv preprint arXiv:2309.15028, 2023 | 34* | 2023 |
Unpacking dpo and ppo: Disentangling best practices for learning from preference feedback H Ivison, Y Wang, J Liu, Z Wu, V Pyatkin, N Lambert, NA Smith, Y Choi, ... arXiv preprint arXiv:2406.09279, 2024 | 32 | 2024 |
Crystal: Introspective reasoners reinforced with self-feedback J Liu, R Pasunuru, H Hajishirzi, Y Choi, A Celikyilmaz arXiv preprint arXiv:2310.04921, 2023 | 19 | 2023 |
Phrase grounding by soft-label chain conditional random field J Liu, J Hockenmaier The 2019 Conference on Empirical Methods in Natural Language Processing and …, 2019 | 11 | 2019 |
Towards grounded natural language proof generation S Welleck, J Liu, JM Han, Y Choi MathAI4Ed Workshop at NeurIPS 2021, 2021 | 10 | 2021 |
Infini-gram: Scaling unbounded n-gram language models to a trillion tokens J Liu, S Min, L Zettlemoyer, Y Choi, H Hajishirzi arXiv preprint arXiv:2401.17377, 2024 | 9 | 2024 |
Are Machines Better at Complex Reasoning? Unveiling Human-Machine Inference Gaps in Entailment Verification S Sanyal, T Xiao, J Liu, W Wang, X Ren arXiv preprint arXiv:2402.03686, 2024 | 4 | 2024 |
2 OLMo 2 Furious T OLMo, P Walsh, L Soldaini, D Groeneveld, K Lo, S Arora, A Bhagia, ... arXiv preprint arXiv:2501.00656, 2024 | 3 | 2024 |
Establishing Task Scaling Laws via Compute-Efficient Model Ladders A Bhagia, J Liu, A Wettig, D Heineman, O Tafjord, AH Jha, L Soldaini, ... arXiv preprint arXiv:2412.04403, 2024 | | 2024 |
NaturalProver: Grounded Natural Language Proof Generation with Language Models S Welleck, J Liu, X Lu, H Hajishirzi, Y Choi 7th Conference on Artificial Intelligence and Theorem Proving (AITP 2022), 2022 | | 2022 |
NaturalProofs: Mathematics meets Natural Language S Welleck, J Liu, R Le Bras, H Hajishirzi, Y Choi, K Cho 6th Conference on Artificial Intelligence and Theorem Proving (AITP 2021), 2021 | | 2021 |