- Academic Search

S Casper, X Davies, C Shi, TK Gilbert… - ar** to commit crimes or producing racist text. One approach to fine …‏

שמור צטט צוטט על ידי 24 מאמרים בנושא זה כל 16 הגרסאות פתיחה בתור HTML

[HTML][HTML] Fairness for machine learning software in education: A systematic map** study‏

N Pham, PN Hung, A Nguyen-Duc - Journal of Systems and Software, 2024‏ - Elsevier‏

The integration of machine learning (ML) systems into various sectors, notably education,
has great potential to transform business workflows and decision-making processes …‏

שמור צטט צוטט על ידי 3 מאמרים בנושא זה כל 3 הגרסאות

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Personalized language modeling from personalized human feedback‏

X Li, R Zhou, ZC Lipton, L Leqi - arxiv preprint arxiv:2402.05133, 2024‏ - arxiv.org‏

Personalized large language models (LLMs) are designed to tailor responses to individual
user preferences. While Reinforcement Learning from Human Feedback (RLHF) is a …‏

שמור צטט צוטט על ידי 23 מאמרים בנושא זה כל 4 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org

Proportional aggregation of preferences for sequential decision making‏

N Chandak, S Goel, D Peters - Proceedings of the AAAI Conference on …, 2024‏ - ojs.aaai.org‏

We study the problem of fair sequential decision making given voter preferences. In each
round, a decision rule must choose a decision from a set of alternatives where each voter …‏

שמור צטט צוטט על ידי 15 מאמרים בנושא זה כל 11 הגרסאות פתיחה בתור HTML

יצירת התראה

צטט

חיפוש מתקדם

נשמר בספרייה שלי

Moral machine or Tyranny of the majority?

Open problems and fundamental limitations of reinforcement learning from human feedback‏

[HTML][HTML] Fairness for machine learning software in education: A systematic map** study‏

Personalized language modeling from personalized human feedback‏

Proportional aggregation of preferences for sequential decision making‏