A systematic survey and critical review on evaluating large language models: Challenges, limitations, and recommendations
Abstract Large Language Models (LLMs) have recently gained significant attention due to
their remarkable capabilities in performing diverse tasks across various domains. However …
their remarkable capabilities in performing diverse tasks across various domains. However …
[HTML][HTML] A comprehensive evaluation of large language models on benchmark biomedical text processing tasks
Abstract Recently, Large Language Models (LLMs) have demonstrated impressive
capability to solve a wide range of tasks. However, despite their success across various …
capability to solve a wide range of tasks. However, despite their success across various …
Investigating hallucinations in pruned large language models for abstractive summarization
Despite the remarkable performance of generative large language models (LLMs) on
abstractive summarization, they face two significant challenges: their considerable size and …
abstractive summarization, they face two significant challenges: their considerable size and …
Cognitive overload: Jailbreaking large language models with overloaded logical thinking
While large language models (LLMs) have demonstrated increasing power, they have also
given rise to a wide range of harmful behaviors. As representatives, jailbreak attacks can …
given rise to a wide range of harmful behaviors. As representatives, jailbreak attacks can …
CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization
Abstractive dialogue summarization is the task of distilling conversations into informative
and concise summaries. Although focused reviews have been conducted on this topic, there …
and concise summaries. Although focused reviews have been conducted on this topic, there …
Akal Badi ya Bias: An Exploratory Study of Gender Bias in Hindi Language Technology
Existing research in measuring and mitigating gender bias predominantly centers on
English, overlooking the intricate challenges posed by non-English languages and the …
English, overlooking the intricate challenges posed by non-English languages and the …
Tiny Titans: Can Smaller Large Language Models Punch Above Their Weight in the Real World for Meeting Summarization?
Large Language Models (LLMs) have demonstrated impressive capabilities to solve a wide
range of tasks without being explicitly fine-tuned on task-specific datasets. However …
range of tasks without being explicitly fine-tuned on task-specific datasets. However …
What's Wrong? Refining Meeting Summaries with LLM Feedback
Meeting summarization has become a critical task since digital encounters have become a
common practice. Large language models (LLMs) show great potential in summarization …
common practice. Large language models (LLMs) show great potential in summarization …
Exploring the opportunities of large language models for summarizing palliative care consultations: A pilot comparative study
Introduction Recent developments in the field of large language models have showcased
impressive achievements in their ability to perform natural language processing tasks …
impressive achievements in their ability to perform natural language processing tasks …
TutoAI: a cross-domain framework for AI-assisted mixed-media tutorial creation on physical tasks
Mixed-media tutorials, which integrate videos, images, text, and diagrams to teach
procedural skills, offer more browsable alternatives than timeline-based videos. However …
procedural skills, offer more browsable alternatives than timeline-based videos. However …