[PDF][PDF] A comprehensive survey of small language models in the era of large language models: Techniques, enhancements, applications, collaboration with llms, and …
Large language models (LLM) have demonstrated emergent abilities in text generation,
question answering, and reasoning, facilitating various tasks and domains. Despite their …
question answering, and reasoning, facilitating various tasks and domains. Despite their …
Reinforcement Learning Enhanced LLMs: A Survey
This paper surveys research in the rapidly growing field of enhancing large language
models (LLMs) with reinforcement learning (RL), a technique that enables LLMs to improve …
models (LLMs) with reinforcement learning (RL), a technique that enables LLMs to improve …
Mastering Board Games by External and Internal Planning with Language Models
While large language models perform well on a range of complex tasks (eg, text generation,
question answering, summarization), robust multi-step planning and reasoning remains a …
question answering, summarization), robust multi-step planning and reasoning remains a …
Large Language Models are In-context Teachers for Knowledge Reasoning
In this work, we study in-context teaching (ICT), where a teacher provides in-context
example rationales to teach a student to reasonover unseen cases. Human teachers are …
example rationales to teach a student to reasonover unseen cases. Human teachers are …
Mentor-KD: Making Small Language Models Better Multi-step Reasoners
Large Language Models (LLMs) have displayed remarkable performances across various
complex tasks by leveraging Chain-of-Thought (CoT) prompting. Recently, studies have …
complex tasks by leveraging Chain-of-Thought (CoT) prompting. Recently, studies have …
InstructRAG: Instructing Retrieval-Augmented Generation with Explicit Denoising
Retrieval-augmented generation (RAG) has shown promising potential to enhance the
accuracy and factuality of language models (LMs). However, imperfect retrievers or noisy …
accuracy and factuality of language models (LMs). However, imperfect retrievers or noisy …
[PDF][PDF] Leveraging Advanced Prompting Strategies in Llama-8b for Enhanced Hyperpartisan News Detection
MJ Maggini, EB Marino, PG Otero - 2024 - ceur-ws.org
This paper explores advanced prompting strategies for hyperpartisan news detection using
the Llama3-8b-Instruct model, an open-source LLM developed by Meta AI. We evaluate zero …
the Llama3-8b-Instruct model, an open-source LLM developed by Meta AI. We evaluate zero …