Lexeval: A comprehensive chinese legal benchmark for evaluating large language models
Large language models (LLMs) have made significant progress in natural language
processing tasks and demonstrate considerable potential in the legal domain. However …
processing tasks and demonstrate considerable potential in the legal domain. However …
A survey on large language models for critical societal domains: Finance, healthcare, and law
In the fast-evolving domain of artificial intelligence, large language models (LLMs) such as
GPT-3 and GPT-4 are revolutionizing the landscapes of finance, healthcare, and law …
GPT-3 and GPT-4 are revolutionizing the landscapes of finance, healthcare, and law …
A comprehensive survey on generative AI for metaverse: enabling immersive experience
Abstract Generative Artificial Intelligence models are Artificial Intelligence models that
generate new content based on a prompt or input. The output content can be in various …
generate new content based on a prompt or input. The output content can be in various …
A Survey on LLM-as-a-Judge
Accurate and consistent evaluation is crucial for decision-making across numerous fields,
yet it remains a challenging task due to inherent subjectivity, variability, and scale. Large …
yet it remains a challenging task due to inherent subjectivity, variability, and scale. Large …
[HTML][HTML] Pre-trained language models for keyphrase prediction: A review
Keyphrase Prediction (KP) is essential for identifying keyphrases in a document that can
summarize its content. However, recent Natural Language Processing (NLP) advances have …
summarize its content. However, recent Natural Language Processing (NLP) advances have …
Exploiting privacy vulnerabilities in open source llms using maliciously crafted prompts
G Choquet, A Aizier, G Bernollin - 2024 - researchsquare.com
The proliferation of AI technologies has brought to the forefront concerns regarding the
privacy and security of user data, particularly with the increasing deployment of powerful …
privacy and security of user data, particularly with the increasing deployment of powerful …
Multi-modal and multi-agent systems meet rationality: A survey
Rationality is characterized by logical thinking and decision-making that align with evidence
and logical rules. This quality is essential for effective problem-solving, as it ensures that …
and logical rules. This quality is essential for effective problem-solving, as it ensures that …
[HTML][HTML] Potential of multimodal large language models for data mining of medical images and free-text reports
Medical images and radiology reports are essential for physicians to diagnose medical
conditions. However, the vast diversity and cross-source heterogeneity inherent in these …
conditions. However, the vast diversity and cross-source heterogeneity inherent in these …
Rupbench: Benchmarking reasoning under perturbations for robustness evaluation in large language models
With the increasing use of large language models (LLMs), ensuring reliable performance in
diverse, real-world environments is essential. Despite their remarkable achievements, LLMs …
diverse, real-world environments is essential. Despite their remarkable achievements, LLMs …
Programming refusal with conditional activation steering
LLMs have shown remarkable capabilities, but precisely controlling their response behavior
remains challenging. Existing activation steering methods alter LLM behavior …
remains challenging. Existing activation steering methods alter LLM behavior …