- Academic Search

A systematic survey and critical review on evaluating large language models: Challenges, limitations, and recommendations

MTR Laskar, S Alqahtani, MS Bari… - Proceedings of the …, 2024 - aclanthology.org

Abstract Large Language Models (LLMs) have recently gained significant attention due to
their remarkable capabilities in performing diverse tasks across various domains. However …

Enregistrer Citer Cité 12 fois Autres articles Les 4 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

A comprehensive survey on evaluating large language model applications in the medical industry

Y Huang, K Tang, M Chen, B Wang - ar** review of effectiveness, feasibility, and applications

M Casu, S Triscari, S Battiato, L Guarnera… - Appl. Sci, 2024 - mirkocasu.github.io

Mental health disorders are a leading cause of disability worldwide, and there is a global
shortage of mental health professionals. AI chatbots have emerged as a potential solution …

Enregistrer Citer Cité 17 fois Autres articles Les 3 versions Free GPT-4 Version HTML

Automated legal consulting in construction procurement using metaheuristically optimized large language models

CY Liu, JS Chou - Automation in Construction, 2025 - Elsevier

This paper introduces a hybrid optimization algorithm, Pilgrimage Walk Optimization-
Differential Evolution (PWO-DE), inspired by Taiwan's cultural traditions, to fine-tune large …

Enregistrer Citer Cité 1 fois Autres articles

[Free GPT-4]

[PDF] arxiv.org

Assessing and enhancing large language models in rare disease question-answering

G Wang, J Ran, R Tang, CY Chang, YN Chuang… - arxiv preprint arxiv …, 2024 - arxiv.org

Despite the impressive capabilities of Large Language Models (LLMs) in general medical
domains, questions remain about their performance in diagnosing rare diseases. To answer …

Enregistrer Citer Cité 6 fois Autres articles Les 2 versions Free GPT-4 Version HTML

[Free GPT-4]

[HTML] sciencedirect.com

[HTML][HTML] Exploring the effectiveness of instruction tuning in biomedical language processing

O Rohanian, M Nouriborji, S Kouchaki… - Artificial intelligence in …, 2024 - Elsevier

Abstract Large Language Models (LLMs), particularly those similar to ChatGPT, have
significantly influenced the field of Natural Language Processing (NLP). While these models …

Enregistrer Citer Cité 8 fois Autres articles Les 2 versions Free GPT-4

[Free GPT-4]

[PDF] aclanthology.org

Can large language models fix data annotation errors? an empirical study using debatepedia for query-focused text summarization

MTR Laskar, M Rahman, I Jahan… - Findings of the …, 2023 - aclanthology.org

Debatepedia is a publicly available dataset consisting of arguments and counter-arguments
on controversial topics that has been widely used for the single-document query-focused …

Enregistrer Citer Cité 6 fois Autres articles Les 2 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

Tiny Titans: Can Smaller Large Language Models Punch Above Their Weight in the Real World for Meeting Summarization?

XY Fu, MTR Laskar, E Khasanova, C Chen… - arxiv preprint arxiv …, 2024 - arxiv.org

Large Language Models (LLMs) have demonstrated impressive capabilities to solve a wide
range of tasks without being explicitly fine-tuned on task-specific datasets. However …

Enregistrer Citer Cité 19 fois Autres articles Les 2 versions Free GPT-4 Version HTML

Créer l'alerte

Citer

Recherche avancée

Enregistré dans Ma bibliothèque

A comprehensive evaluation of large language models on benchmark biomedical text processing tasks

A systematic survey and critical review on evaluating large language models: Challenges, limitations, and recommendations

A comprehensive survey on evaluating large language model applications in the medical industry

Automated legal consulting in construction procurement using metaheuristically optimized large language models

Assessing and enhancing large language models in rare disease question-answering

[HTML][HTML] Exploring the effectiveness of instruction tuning in biomedical language processing

Can large language models fix data annotation errors? an empirical study using debatepedia for query-focused text summarization

Tiny Titans: Can Smaller Large Language Models Punch Above Their Weight in the Real World for Meeting Summarization?