Efficient benchmarking of language models Y Perlitz, E Bandel, A Gera, O Arviv, L Ein-Dor, E Shnarch, N Slonim, ... arXiv preprint arXiv:2308.11696, 2023 | 31 | 2023 |
Genie: Achieving human parity in content-grounded datasets generation A Yehudai, B Carmeli, Y Mass, O Arviv, N Mills, A Toledo, E Shnarch, ... arXiv preprint arXiv:2401.14367, 2024 | 22* | 2024 |
Fine-grained analysis of cross-linguistic syntactic divergences D Nikolaev, O Arviv, T Karidi, N Kenneth, V Mitnik, LM Saeboe, O Abend arXiv preprint arXiv:2005.03436, 2020 | 21 | 2020 |
The benefits of bad advice: Autocontrastive decoding across model layers A Gera, R Friedman, O Arviv, C Gunasekara, B Sznajder, N Slonim, ... arXiv preprint arXiv:2305.01628, 2023 | 18 | 2023 |
HUJI-KU at MRP~ 2020: Two Transition-based Neural Parsers O Arviv, R Cui, D Hershcovich arXiv preprint arXiv:2010.05710, 2020 | 12 | 2020 |
TUPA at MRP 2019: A multi-task baseline system D Hershcovich, O Arviv Proceedings of the Shared Task on Cross-Framework Meaning Representation …, 2019 | 12 | 2019 |
Zero-shot topical text classification with llms-an experimental study S Gretz, A Halfon, I Shnayderman, O Toledo-Ronen, A Spector, L Dankin, ... Findings of the Association for Computational Linguistics: EMNLP 2023, 9647-9676, 2023 | 8 | 2023 |
Unitxt: Flexible, shareable and reusable data preparation and evaluation for generative ai E Bandel, Y Perlitz, E Venezian, R Friedman-Melamed, O Arviv, M Orbach, ... arXiv preprint arXiv:2401.14019, 2024 | 6 | 2024 |
On the relation between syntactic divergence and zero-shot performance O Arviv, D Nikolaev, T Karidi, O Abend arXiv preprint arXiv:2110.04644, 2021 | 4 | 2021 |
Do these LLM benchmarks agree? Fixing benchmark evaluation with BenchBench Y Perlitz, A Gera, O Arviv, A Yehudai, E Bandel, E Shnarch, ... arXiv preprint arXiv:2407.13696, 2024 | 3 | 2024 |
Benchmark agreement testing done right: A guide for llm benchmark evaluation Y Perlitz, A Gera, O Arviv, A Yehudai, E Bandel, E Shnarch, ... arXiv e-prints, arXiv: 2407.13696, 2024 | 3 | 2024 |
Autocontrastive Decoding Among Model Layers G Ariel, R Friedman-Melamed, O Arviv, B Sznajder, C Gunasekara, ... US Patent App. 18/224,497, 2025 | | 2025 |
Stay Tuned: An Empirical Study of the Impact of Hyperparameters on LLM Tuning in Real-World Applications A Halfon, S Gretz, O Arviv, A Spector, O Toledo-Ronen, Y Katz, L Ein-Dor, ... arXiv preprint arXiv:2407.18990, 2024 | | 2024 |
Improving Cross-Lingual Transfer through Subtree-Aware Word Reordering O Arviv, D Nikolaev, T Karidi, O Abend arXiv preprint arXiv:2310.13583, 2023 | | 2023 |