Robust question answering against distribution shifts with test-time adaptation: An empirical study
A deployed question answering (QA) model can easily fail when the test data has a
distribution shift compared to the training data. Robustness tuning (RT) methods have been …
distribution shift compared to the training data. Robustness tuning (RT) methods have been …