- Academic Search

A systematic survey and critical review on evaluating large language models: Challenges, limitations, and recommendations

MTR Laskar, S Alqahtani, MS Bari… - Proceedings of the …, 2024 - aclanthology.org

Abstract Large Language Models (LLMs) have recently gained significant attention due to
their remarkable capabilities in performing diverse tasks across various domains. However …

Save Cite Cited by 13 Related articles All 4 versions Free GPT-4 View as HTML

[Free GPT-4]

[HTML] sciencedirect.com

[HTML][HTML] A comprehensive evaluation of large language models on benchmark biomedical text processing tasks

I Jahan, MTR Laskar, C Peng, JX Huang - Computers in biology and …, 2024 - Elsevier

Abstract Recently, Large Language Models (LLMs) have demonstrated impressive
capability to solve a wide range of tasks. However, despite their success across various …

Save Cite Cited by 51 Related articles All 5 versions Free GPT-4

[Free GPT-4]

[PDF] mit.edu

Investigating hallucinations in pruned large language models for abstractive summarization

G Chrysostomou, Z Zhao, M Williams… - Transactions of the …, 2024 - direct.mit.edu

Despite the remarkable performance of generative large language models (LLMs) on
abstractive summarization, they face two significant challenges: their considerable size and …

Save Cite Cited by 5 Related articles All 6 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Cognitive overload: Jailbreaking large language models with overloaded logical thinking

N Xu, F Wang, B Zhou, BZ Li, C **ao… - arxiv preprint arxiv …, 2023 - arxiv.org

While large language models (LLMs) have demonstrated increasing power, they have also
given rise to a wide range of harmful behaviors. As representatives, jailbreak attacks can …

Save Cite Cited by 49 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] jair.org Full View

CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization

F Kirstein, JP Wahle, B Gipp, T Ruas - Journal of Artificial Intelligence …, 2025 - jair.org

Abstractive dialogue summarization is the task of distilling conversations into informative
and concise summaries. Although focused reviews have been conducted on this topic, there …

[Free GPT-4]

[PDF] arxiv.org

Akal Badi ya Bias: An Exploratory Study of Gender Bias in Hindi Language Technology

R Hada, S Husain, V Gumma, H Diddee… - The 2024 ACM …, 2024 - dl.acm.org

Existing research in measuring and mitigating gender bias predominantly centers on
English, overlooking the intricate challenges posed by non-English languages and the …

Save Cite Cited by 2 Related articles All 6 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Tiny Titans: Can Smaller Large Language Models Punch Above Their Weight in the Real World for Meeting Summarization?

XY Fu, MTR Laskar, E Khasanova, C Chen… - arxiv preprint arxiv …, 2024 - arxiv.org

Large Language Models (LLMs) have demonstrated impressive capabilities to solve a wide
range of tasks without being explicitly fine-tuned on task-specific datasets. However …

Save Cite Cited by 20 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

What's Wrong? Refining Meeting Summaries with LLM Feedback

F Kirstein, T Ruas, B Gipp - arxiv preprint arxiv:2407.11919, 2024 - arxiv.org

Meeting summarization has become a critical task since digital encounters have become a
common practice. Large language models (LLMs) show great potential in summarization …

Save Cite Cited by 2 Related articles All 4 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] sagepub.com Full View

Exploring the opportunities of large language models for summarizing palliative care consultations: A pilot comparative study

X Chen, W Zhou, R Hoda, A Li, C Bain… - Digital Health, 2024 - journals.sagepub.com

Introduction Recent developments in the field of large language models have showcased
impressive achievements in their ability to perform natural language processing tasks …

Save Cite Cited by 1 Related articles All 4 versions Free GPT-4

[Free GPT-4]

[PDF] acm.org

TutoAI: a cross-domain framework for AI-assisted mixed-media tutorial creation on physical tasks

Y Chen, VI Morariu, A Truong, Z Liu - … of the CHI Conference on Human …, 2024 - dl.acm.org

Mixed-media tutorials, which integrate videos, images, text, and diagrams to teach
procedural skills, offer more browsable alternatives than timeline-based videos. However …

Create alert

Cite

Advanced search

Saved to My library

Building real-world meeting summarization systems using large language models: A practical...

A systematic survey and critical review on evaluating large language models: Challenges, limitations, and recommendations

[HTML][HTML] A comprehensive evaluation of large language models on benchmark biomedical text processing tasks

Investigating hallucinations in pruned large language models for abstractive summarization

Cognitive overload: Jailbreaking large language models with overloaded logical thinking

CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization

Akal Badi ya Bias: An Exploratory Study of Gender Bias in Hindi Language Technology

Tiny Titans: Can Smaller Large Language Models Punch Above Their Weight in the Real World for Meeting Summarization?

What's Wrong? Refining Meeting Summaries with LLM Feedback

Exploring the opportunities of large language models for summarizing palliative care consultations: A pilot comparative study

TutoAI: a cross-domain framework for AI-assisted mixed-media tutorial creation on physical tasks