Evaluation of LLM-based chatbots for OSINT-based Cyber Threat Awareness

S Shafee, A Bessani, PM Ferreira - Expert Systems with Applications, 2024 - Elsevier
Abstract Knowledge sharing about emerging threats is crucial in the rapidly advancing field
of cybersecurity and forms the foundation of Cyber Threat Intelligence (CTI). In this context …

Assessing the medical reasoning skills of GPT-4 in complex ophthalmology cases

D Milad, F Antaki, J Milad, A Farah, T Khairy… - British Journal of …, 2024 - bjo.bmj.com
Background/aims This study assesses the proficiency of Generative Pre-trained Transformer
(GPT)-4 in answering questions about complex clinical ophthalmology cases. Methods We …

Low-cost language models: Survey and performance evaluation on Python code generation

JL Espejel, MSY Alassan, M Bouhandi… - … Applications of Artificial …, 2025 - Elsevier
Abstract Large Language Models (LLMs) have become a popular choice for many Natural
Language Processing (NLP) tasks due to their versatility and ability to produce high-quality …

Offline prompt polishing for low quality instructions

J Yu, Z Zhou, L Li, L Li, Y Yan, R Xu, Z Lan - Neurocomputing, 2024 - Elsevier
Instruction-tuning is an effective avenue for making large language models (LLMs) better at
following real users' instructions. However, it is challenging in aligning to human preference …