From generation to judgment: Opportunities and challenges of llm-as-a-judge
Assessment and evaluation have long been critical challenges in artificial intelligence (AI)
and natural language processing (NLP). However, traditional methods, whether matching …
and natural language processing (NLP). However, traditional methods, whether matching …
Llms-as-judges: a comprehensive survey on llm-based evaluation methods
The rapid advancement of Large Language Models (LLMs) has driven their expanding
application across various fields. One of the most promising applications is their role as …
application across various fields. One of the most promising applications is their role as …
LegalAgentBench: Evaluating LLM Agents in Legal Domain
With the increasing intelligence and autonomy of LLM agents, their potential applications in
the legal domain are becoming increasingly apparent. However, existing general-domain …
the legal domain are becoming increasingly apparent. However, existing general-domain …