JMedBench: A Benchmark for Evaluating Japanese Biomedical Large Language Models

J Jiang, J Huang, A Aizawa - arxiv preprint arxiv:2409.13317, 2024 - arxiv.org
Recent developments in Japanese large language models (LLMs) primarily focus on
general domains, with fewer advancements in Japanese biomedical LLMs. One obstacle is …