Kmmlu: Measuring massive multitask language understanding in korean G Son, H Lee, S Kim, S Kim, N Muennighoff, T Choi, C Park, KM Yoo, ... arXiv preprint arXiv:2402.11548, 2024 | 30 | 2024 |
Beyond classification: Financial reasoning in state-of-the-art language models G Son, H Jung, M Hahm, K Na, S Jin arXiv preprint arXiv:2305.01505, 2023 | 28* | 2023 |
Hae-rae bench: Evaluation of korean knowledge in language models G Son, H Lee, S Kim, H Kim, J Lee, JW Yeom, J Jung, JW Kim, S Kim arXiv preprint arXiv:2309.02706, 2023 | 14* | 2023 |
The biggen bench: A principled benchmark for fine-grained evaluation of language models with language models S Kim, J Suk, JY Cho, S Longpre, C Kim, D Yoon, G Son, Y Cho, ... arXiv preprint arXiv:2406.05761, 2024 | 13* | 2024 |
Llm-as-a-judge & reward model: What they can and cannot do G Son, H Ko, H Lee, Y Kim, S Hong arXiv preprint arXiv:2409.11239, 2024 | 9 | 2024 |
Removing Non-Stationary Knowledge From Pre-Trained Language Models for Entity-Level Sentiment Classification in Finance G Son, H Lee, N Kang, M Hahm to appear at Muffin@2023, 2023 | 9 | 2023 |
Multi-Task Inference: Can Large Language Models Follow Multiple Instructions at Once? G Son, S Baek, S Nam, I Jeong, S Kim ACL 2024, 2024 | 7 | 2024 |
MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models G Son, D Yoon, J Suk, J Aula-Blasco, M Aslan, VT Kim, SB Islam, ... arXiv preprint arXiv:2410.17578, 2024 | 4* | 2024 |
KRX Bench: Automating Financial Benchmark Creation via Large Language Models G Son, H Jeon, C Hwang, H Jung Proceedings of the Joint Workshop of the 7th Financial Technology and …, 2024 | 2 | 2024 |
Esg classification by implicit rule learning via gpt-4 Y Hyojeong, K Chanyoung, M Hahm, K Kim, G Son Proceedings of the Joint Workshop of the 7th Financial Technology and …, 2024 | 1 | 2024 |
FINALE: Finance domain instruction-tuning dataset with high-quality rationales via chain-of-thought prompting S Lee, S Oh, S Park, G Son, P Kang Proceedings of the Eighth Financial Technology and Natural Language …, 2024 | 1 | 2024 |
Multi-Step Reasoning in Korean and the Emergent Mirage G Son, H Ko, D Choi arXiv preprint arXiv:2501.05712, 2025 | | 2025 |
Understand, Solve and Translate: Bridging the Multilingual Mathematical Reasoning Gap H Ko, G Son, D Choi arXiv preprint arXiv:2501.02448, 2025 | | 2025 |
Improving Fine-grained Visual Understanding in VLMs through Text-Only Training D Choi, G Son, SY Kim, G Paik, S Hong arXiv preprint arXiv:2412.12940, 2024 | | 2024 |
Neural Networks for Delta Hedging G Son, J Kim arXiv preprint arXiv:2112.10084, 2021 | | 2021 |
SG-MLP: Switch Gated Multi-Layer Perceptron Model for Natural Language Understanding G Son, S Kim, SJ Joo, W Cho, JE Nah Proceedings of the Korea Information Processing Society Conference, 1116-1119, 2021 | | 2021 |