LENS: A learnable evaluation metric for text simplification M Maddela, Y Dou, D Heineman, W Xu arXiv preprint arXiv:2212.09739, 2022 | 57 | 2022 |
Dancing between success and failure: Edit-level simplification evaluation using SALSA D Heineman, Y Dou, M Maddela, W Xu arXiv preprint arXiv:2305.14458, 2023 | 11 | 2023 |
Thresh: A unified, customizable and deployable platform for fine-grained text evaluation D Heineman, Y Dou, W Xu arXiv preprint arXiv:2308.06953, 2023 | 4 | 2023 |
Rethinking reasoning evaluation with theories of intelligence D Heineman | 2 | 2023 |
Improving Minimum Bayes Risk Decoding with Multi-Prompt D Heineman, Y Dou, W Xu arXiv preprint arXiv:2407.15343, 2024 | 1 | 2024 |
Establishing Task Scaling Laws via Compute-Efficient Model Ladders A Bhagia, J Liu, A Wettig, D Heineman, O Tafjord, AH Jha, L Soldaini, ... arXiv preprint arXiv:2412.04403, 2024 | | 2024 |
Towards a path dependent account of category fluency D Heineman, R Koenen, S Varma arXiv preprint arXiv:2405.06714, 2024 | | 2024 |