[PDF][PDF] A comprehensive survey of small language models in the era of large language models: Techniques, enhancements, applications, collaboration with llms, and …

F Wang, Z Zhang, X Zhang, Z Wu, T Mo, Q Lu… - arxiv preprint arxiv …, 2024 - ai.radensa.ru
Large language models (LLM) have demonstrated emergent abilities in text generation,
question answering, and reasoning, facilitating various tasks and domains. Despite their …

Reinforcement Learning Enhanced LLMs: A Survey

S Wang, S Zhang, J Zhang, R Hu, X Li, T Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org
This paper surveys research in the rapidly growing field of enhancing large language
models (LLMs) with reinforcement learning (RL), a technique that enables LLMs to improve …

Mastering Board Games by External and Internal Planning with Language Models

J Schultz, J Adamek, M Jusup, M Lanctot… - arxiv preprint arxiv …, 2024 - arxiv.org
While large language models perform well on a range of complex tasks (eg, text generation,
question answering, summarization), robust multi-step planning and reasoning remains a …

Large Language Models are In-context Teachers for Knowledge Reasoning

J Zhao, Z Yao, Z Yang, H Yu - Findings of the Association for …, 2024 - aclanthology.org
In this work, we study in-context teaching (ICT), where a teacher provides in-context
example rationales to teach a student to reasonover unseen cases. Human teachers are …

Mentor-KD: Making Small Language Models Better Multi-step Reasoners

H Lee, J Kim, SK Lee - arxiv preprint arxiv:2410.09037, 2024 - arxiv.org
Large Language Models (LLMs) have displayed remarkable performances across various
complex tasks by leveraging Chain-of-Thought (CoT) prompting. Recently, studies have …

InstructRAG: Instructing Retrieval-Augmented Generation with Explicit Denoising

Z Wei, WL Chen, Y Meng - arxiv preprint arxiv:2406.13629, 2024 - arxiv.org
Retrieval-augmented generation (RAG) has shown promising potential to enhance the
accuracy and factuality of language models (LMs). However, imperfect retrievers or noisy …

[PDF][PDF] Leveraging Advanced Prompting Strategies in Llama-8b for Enhanced Hyperpartisan News Detection

MJ Maggini, EB Marino, PG Otero - 2024 - ceur-ws.org
This paper explores advanced prompting strategies for hyperpartisan news detection using
the Llama3-8b-Instruct model, an open-source LLM developed by Meta AI. We evaluate zero …