Codexgraph: Bridging large language models and code repositories via code graph databases
Large Language Models (LLMs) excel in stand-alone code tasks like HumanEval and
MBPP, but struggle with handling entire code repositories. This challenge has prompted …
MBPP, but struggle with handling entire code repositories. This challenge has prompted …
SysBench: Can Large Language Models Follow System Messages?
Large Language Models (LLMs) have become instrumental across various applications,
with the customization of these models to specific scenarios becoming increasingly critical …
with the customization of these models to specific scenarios becoming increasingly critical …
Critic-cot: Boosting the reasoning abilities of large language model via chain-of-thoughts critic
Self-critic has become a crucial mechanism for enhancing the reasoning performance of
LLMs. However, current approaches mainly involve basic prompts for intuitive instance-level …
LLMs. However, current approaches mainly involve basic prompts for intuitive instance-level …
Scilitllm: How to adapt llms for scientific literature understanding
Scientific literature understanding is crucial for extracting targeted information and garnering
insights, thereby significantly advancing scientific discovery. Despite the remarkable …
insights, thereby significantly advancing scientific discovery. Despite the remarkable …
Training language models to critique with multi-agent feedback
Critique ability, a meta-cognitive capability of humans, presents significant challenges for
LLMs to improve. Recent works primarily rely on supervised fine-tuning (SFT) using critiques …
LLMs to improve. Recent works primarily rely on supervised fine-tuning (SFT) using critiques …
LLMCO2: Advancing Accurate Carbon Footprint Prediction for LLM Inferences
Throughout its lifecycle, a large language model (LLM) generates a substantially larger
carbon footprint during inference than training. LLM inference requests vary in batch size …
carbon footprint during inference than training. LLM inference requests vary in batch size …
A Comprehensive Survey of Direct Preference Optimization: Datasets, Theories, Variants, and Applications
With the rapid advancement of large language models (LLMs), aligning policy models with
human preferences has become increasingly critical. Direct Preference Optimization (DPO) …
human preferences has become increasingly critical. Direct Preference Optimization (DPO) …
PoisonBench: Assessing Large Language Model Vulnerability to Data Poisoning
Preference learning is a central component for aligning current LLMs, but this process can
be vulnerable to data poisoning attacks. To address this concern, we introduce …
be vulnerable to data poisoning attacks. To address this concern, we introduce …
TensorOpera Router: A Multi-Model Router for Efficient LLM Inference
With the rapid growth of Large Language Models (LLMs) across various domains, numerous
new LLMs have emerged, each possessing domain-specific expertise. This proliferation has …
new LLMs have emerged, each possessing domain-specific expertise. This proliferation has …
Aligning large language models via self-steering optimization
Automated alignment develops alignment systems with minimal human intervention. The
key to automated alignment lies in providing learnable and accurate preference signals for …
key to automated alignment lies in providing learnable and accurate preference signals for …