-Guard: Robust Reasoning Enabled LLM Guardrail via Knowledge-Enhanced Logical Reasoning
As LLMs become increasingly prevalent across various applications, it is critical to establish
safety guardrails to moderate input/output content of LLMs. Existing guardrail models treat …
safety guardrails to moderate input/output content of LLMs. Existing guardrail models treat …
C-rag: Certified generation risks for retrieval-augmented language models
Despite the impressive capabilities of large language models (LLMs) across diverse
applications, they still suffer from trustworthiness issues, such as hallucinations and …
applications, they still suffer from trustworthiness issues, such as hallucinations and …
Legend: Leveraging Representation Engineering to Annotate Safety Margin for Preference Datasets
The success of the reward model in distinguishing between responses with subtle safety
differences depends critically on the high-quality preference dataset, which should capture …
differences depends critically on the high-quality preference dataset, which should capture …
A Survey of Generative Techniques for Spatial-Temporal Data Mining
This paper focuses on the integration of generative techniques into spatial-temporal data
mining, considering the significant growth and diverse nature of spatial-temporal data. With …
mining, considering the significant growth and diverse nature of spatial-temporal data. With …
Adaptive Token Biaser: Knowledge Editing via Biasing Key Entities
The parametric knowledge memorized by large language models (LLMs) becomes outdated
quickly. In-context editing (ICE) is currently the most effective method for updating the …
quickly. In-context editing (ICE) is currently the most effective method for updating the …