ChineseEcomQA: A Scalable E-commerce Concept Evaluation Benchmark for Large Language Models

H Chen, K Lv, C Hu, Y Li, Y Yuan, Y He… - arxiv preprint arxiv …, 2025 - arxiv.org
With the increasing use of Large Language Models (LLMs) in fields such as e-commerce,
domain-specific concept evaluation benchmarks are crucial for assessing their domain …