Google Academic

Safetyprompts: a systematic review of open datasets for evaluating and improving large language model safety

P Röttger, F Pernisi, B Vidgen, D Hovy - arxiv preprint arxiv:2404.05399, 2024 - arxiv.org

The last two years have seen a rapid growth in concerns around the safety of large
language models (LLMs). Researchers and practitioners have met these concerns by …

Salvați Citați Citat de 21 ori Articole cu conținut similar Toate cele 3 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Towards bidirectional human-ai alignment: A systematic review for clarifications, framework, and future directions

H Shen, T Knearem, R Ghosh, K Alkiek… - arxiv preprint arxiv …, 2024 - arxiv.org

Recent advancements in general-purpose AI have highlighted the importance of guiding AI
systems towards the intended goals, ethical principles, and values of individuals and …

Salvați Citați Citat de 22 ori Articole cu conținut similar Toate cele 3 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Political compass or spinning arrow? towards more meaningful evaluations for values and opinions in large language models

P Röttger, V Hofmann, V Pyatkin, M Hinck… - arxiv preprint arxiv …, 2024 - arxiv.org

Much recent work seeks to evaluate values and opinions in large language models (LLMs)
using multiple-choice surveys and questionnaires. Most of this work is motivated by …

Salvați Citați Citat de 61 ori Articole cu conținut similar Toate cele 5 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Open problems in technical ai governance

A Reuel, B Bucknall, S Casper, T Fist, L Soder… - arxiv preprint arxiv …, 2024 - arxiv.org

AI progress is creating a growing range of risks and opportunities, but it is often unclear how
they should be navigated. In many cases, the barriers and uncertainties faced are at least …

Salvați Citați Citat de 28 ori Articole cu conținut similar Toate cele 4 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Conifer: Improving complex constrained instruction-following ability of large language models

H Sun, L Liu, J Li, F Wang, B Dong, R Lin… - arxiv preprint arxiv …, 2024 - arxiv.org

The ability of large language models (LLMs) to follow instructions is crucial to real-world
applications. Despite recent advances, several studies have highlighted that LLMs struggle …

Salvați Citați Citat de 31 ori Articole cu conținut similar Toate cele 2 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org

Gender, race, and intersectional bias in resume screening via language model retrieval

K Wilson, A Caliskan - Proceedings of the AAAI/ACM Conference on AI …, 2024 - ojs.aaai.org

Artificial intelligence (AI) hiring tools have revolutionized resume screening, and large
language models (LLMs) have the potential to do the same. However, given the biases …

Salvați Citați Citat de 8 ori Articole cu conținut similar Toate cele 6 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Beyond static AI evaluations: advancing human interaction evaluations for LLM harms and risks

L Ibrahim, S Huang, L Ahmad, M Anderljung - arxiv preprint arxiv …, 2024 - arxiv.org

Model evaluations are central to understanding the safety, risks, and societal impacts of AI
systems. While most real-world AI applications involve human-AI interaction, most current …

Salvați Citați Citat de 13 ori Articole cu conținut similar Toate cele 2 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Structured chemistry reasoning with large language models

S Ouyang, Z Zhang, B Yan, X Liu, Y Choi, J Han… - arxiv preprint arxiv …, 2023 - arxiv.org

Large Language Models (LLMs) excel in diverse areas, yet struggle with complex scientific
reasoning, especially in the field of chemistry. Different from the simple chemistry tasks (eg …

Salvați Citați Citat de 23 ori Articole cu conținut similar Toate cele 8 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Instruct and extract: Instruction tuning for on-demand information extraction

Y Jiao, M Zhong, S Li, R Zhao, S Ouyang, H Ji… - arxiv preprint arxiv …, 2023 - arxiv.org

Large language models with instruction-following capabilities open the door to a wider
group of users. However, when it comes to information extraction-a classic task in natural …

Salvați Citați Citat de 25 ori Articole cu conținut similar Toate cele 7 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] mit.edu

Dolomites: Domain-Specific Long-Form Methodical Tasks

C Malaviya, P Agrawal, K Ganchev… - Transactions of the …, 2025 - direct.mit.edu

Experts in various fields routinely perform methodical writing tasks to plan, organize, and
report their work. From a clinician writing a differential diagnosis for a patient, to a teacher …

Salvați Citați Citat de 2 ori Articole cu conținut similar Toate cele 5 versiuni

Creează alerta

Citați

Căutare avansată

Salvat în Bibliotecă

The shifted and the overlooked: A task-oriented investigation of user-GPT interactions

Safetyprompts: a systematic review of open datasets for evaluating and improving large language model safety

Towards bidirectional human-ai alignment: A systematic review for clarifications, framework, and future directions

Political compass or spinning arrow? towards more meaningful evaluations for values and opinions in large language models

Open problems in technical ai governance

Conifer: Improving complex constrained instruction-following ability of large language models

Gender, race, and intersectional bias in resume screening via language model retrieval

Beyond static AI evaluations: advancing human interaction evaluations for LLM harms and risks

Structured chemistry reasoning with large language models

Instruct and extract: Instruction tuning for on-demand information extraction

Dolomites: Domain-Specific Long-Form Methodical Tasks