Evaluating large language models: A comprehensive survey

Z Guo, R **, C Liu, Y Huang, D Shi, L Yu, Y Liu… - arxiv preprint arxiv …, 2023 - arxiv.org
Large language models (LLMs) have demonstrated remarkable capabilities across a broad
spectrum of tasks. They have attracted significant attention and been deployed in numerous …

Beyond discrimination: Generative AI applications and ethical challenges in forensic psychiatry

L Tortora - Frontiers in Psychiatry, 2024 - frontiersin.org
The advent and growing popularity of generative artificial intelligence (GenAI) holds the
potential to revolutionise AI applications in forensic psychiatry and criminal justice, which …

Evaluating large language models in class-level code generation

X Du, M Liu, K Wang, H Wang, J Liu, Y Chen… - Proceedings of the …, 2024 - dl.acm.org
Recently, many large language models (LLMs) have been proposed, showing advanced
proficiency in code generation. Meanwhile, many efforts have been dedicated to evaluating …

Simple techniques to bypass GenAI text detectors: implications for inclusive education

M Perkins, J Roe, BH Vu, D Postma… - International Journal of …, 2024 - Springer
This study investigates the efficacy of six major Generative AI (GenAI) text detectors when
confronted with machine-generated content modified to evade detection (n= 805). We …

Explainable fake news detection with large language model via defense among competing wisdom

B Wang, J Ma, H Lin, Z Yang, R Yang, Y Tian… - Proceedings of the …, 2024 - dl.acm.org
Most fake news detection methods learn latent feature representations based on neural
networks, which makes them black boxes to classify a piece of news without giving any …

GenAI detection tools, adversarial techniques and implications for inclusivity in higher education

M Perkins, J Roe, BH Vu, D Postma… - arxiv preprint arxiv …, 2024 - arxiv.org
This study investigates the efficacy of six major Generative AI (GenAI) text detectors when
confronted with machine-generated content that has been modified using techniques …

Evaluating large language models in process mining: Capabilities, benchmarks, and evaluation strategies

A Berti, H Kourani, H Häfke, CY Li… - … Conference on Business …, 2024 - Springer
Abstract Using Large Language Models (LLMs) for Process Mining (PM) tasks is becoming
increasingly essential, and initial approaches yield promising results. However, little …

Why do we need to employ exemplars in moral education? Insights from recent advances in research on artificial intelligence

H Han - Ethics & Behavior, 2024 - Taylor & Francis
In this paper, I examine why moral exemplars are useful and even necessary in moral
education despite several critiques. To support my point, I review recent AI research …

Leveraging large language models for preliminary security risk analysis: A mission-critical case study

M Esposito, F Palagiano - … of the 28th International Conference on …, 2024 - dl.acm.org
Preliminary security risk analysis (PSRA) provides a quick approach to identify, evaluate,
and propose remediation to potential risks in specific scenarios. The extensive expertise …

Semantic communication: A survey of its theoretical development

G **n, P Fan, KB Letaief - Entropy, 2024 - mdpi.com
In recent years, semantic communication has received significant attention from both
academia and industry, driven by the growing demands for ultra-low latency and high …