Följ
Liwei Jiang
Liwei Jiang
PhD Student, Paul G. Allen School of Computer Science & Engineering, University of Washington
Verifierad e-postadress på cs.washington.edu - Startsida
Titel
Citeras av
Citeras av
År
Faith and Fate: Limits of Transformers on Compositionality
N Dziri, X Lu, M Sclar, XL Li, L Jiang, BY Lin, S Welleck, P West, ...
NeurIPS 2023, 2024
3232024
Symbolic Knowledge Distillation: from General Language Models to Commonsense Models
P West, C Bhagavatula, J Hessel, JD Hwang, L Jiang, RL Bras, X Lu, ...
NAACL 2022, 2021
3092021
Investigating Machine Moral Judgement through the Delphi Experiment
L Jiang, JD Hwang, C Bhagavatula, RL Bras, JT Liang, S Levine, J Dodge, ...
Nature Machine Intelligence, 1-16, 2025
248*2025
Quizbot: A Dialogue-Based Adaptive Learning System for Factual Knowledge
S Ruan, L Jiang, J Xu, BJK Tham, Z Qiu, Y Zhu, EL Murnane, E Brunskill, ...
CHI 2019, 2019
2312019
Quark: Controllable Text Generation with Reinforced Unlearning
X Lu, S Welleck, J Hessel, L Jiang, L Qin, P West, P Ammanabrolu, Y Choi
NeurIPS 2022, 2022
1812022
Neurologic A* esque Decoding: Constrained Text Generation with Lookahead Heuristics
X Lu, S Welleck, P West, L Jiang, J Kasai, D Khashabi, RL Bras, L Qin, ...
NAACL 2022, 2021
1602021
Soda: Million-Scale Dialogue Distillation with Social Commonsense Contextualization
H Kim, J Hessel, L Jiang, P West, X Lu, Y Yu, P Zhou, RL Bras, M Alikhani, ...
EMNLP 2023, 2022
1272022
Bookbuddy: Turning Digital Materials into Interactive Foreign Language Lessons through a Voice Chatbot
S Ruan, A Willis, Q Xu, GM Davis, L Jiang, E Brunskill, JA Landay
Proceedings of the sixth (2019) ACM conference on learning@ scale, 1-4, 2019
1172019
ProsocialDialog: A Prosocial Backbone for Conversational Agents
H Kim, Y Yu, L Jiang, X Lu, D Khashabi, G Kim, Y Choi, M Sap
EMNLP 2022, 2022
1072022
A Roadmap to Pluralistic Alignment
T Sorensen, J Moore, J Fisher, M Gordon, N Mireshghallah, CM Rytting, ...
ICML 2024, 2024
87*2024
Englishbot: An AI-Powered Conversational System for Second Language Learning
S Ruan*, L Jiang*, Q Xu*, Z Liu, GM Davis, E Brunskill, JA Landay
IUI 2021, 2021
862021
Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties
T Sorensen, L Jiang, JD Hwang, S Levine, V Pyatkin, P West, N Dziri, ...
AAAI 2024, 2024
81*2024
Phenomenal Yet Puzzling: Testing Inductive Reasoning Capabilities of Language Models with Hypothesis Refinement
L Qiu, L Jiang, X Lu, M Sclar, V Pyatkin, C Bhagavatula, B Wang, Y Kim, ...
ICLR 2024, 2023
69*2023
Aligning to Social Norms and Values in Interactive Narratives
P Ammanabrolu, L Jiang, M Sap, H Hajishirzi, Y Choi
NAACL 2022, 2022
412022
ClarifyDelphi: Reinforced Clarification Questions with Defeasibility Rewards for Social and Moral Situations
V Pyatkin, JD Hwang, V Srikumar, X Lu, L Jiang, Y Choi, C Bhagavatula
ACL 2023, 2022
40*2022
"I'm Not Mad": Commonsense Implications of Negation and Contradiction
L Jiang, A Bosselut, C Bhagavatula, Y Choi
NAACL 2021, 2021
362021
WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs
S Han*, K Rao*, A Ettinger+, L Jiang+, BY Lin, N Lambert, Y Choi, N Dziri
NeurIPS D&B 2024, 2024
302024
The Generative AI Paradox:“What It Can Create, It May Not Understand”
P West, X Lu, N Dziri, F Brahman, L Li, JD Hwang, L Jiang, J Fisher, ...
ICLR 2024, 2023
192023
WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models
L Jiang, K Rao, S Han, A Ettinger, F Brahman, S Kumar, N Mireshghallah, ...
NeurIPS 2024, 2024
17*2024
CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs'(Lack of) Multicultural Knowledge
YY Chiu, L Jiang, M Antoniak, CY Park, SS Li, M Bhatia, S Ravi, ...
arXiv preprint arXiv:2404.06664, 2024
132024
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–20