Lava: Data valuation without pre-specified learning algorithms HA Just*, F Kang*, JT Wang, Y Zeng, M Ko, M Jin, R Jia ICLR 2023 (Spotlight Presentation), 2023 | 58 | 2023 |
Data acquisition: A new frontier in data-centric AI L Chen, B Acun, N Ardalani, Y Sun, F Kang, H Lyu, Y Kwon, R Jia, CJ Wu, ... arXiv preprint arXiv:2311.13712, 2023 | 12 | 2023 |
Performance Scaling via Optimal Transport: Enabling Data Selection from Partially Revealed Sources F Kang*, HA Just*, AK Sahu, R Jia NeurIPS 2023, 2023 | 11 | 2023 |
Towards robustness certification against universal perturbations Y Zeng, Z Shi, M Jin, F Kang, L Lyu, CJ Hsieh, R Jia ICLR 2023, 2022 | 10 | 2022 |
Get more for less: Principled Data Selection for Warming Up Fine-Tuning in LLMs F Kang, HA Just, Y Sun, H Jahagirdar, Y Zhang, R Du, AK Sahu, R Jia ICLR 2024, 2023 | 8 | 2023 |
Visual navigation with a 2-pixel camera---possibilities and limitations J Baillieul, F Kang IFAC World Congress 2020, 2021 | 4 | 2021 |
Prediction-based fast thermoelectric generator reconfiguration for energy harvesting from vehicle radiators H Yang*, F Kang*, C Ding, J Li, J Kim, D Baek, S Nazarian, X Lin, ... 2018 Design, Automation & Test in Europe Conference & Exhibition (DATE), 877-880, 2018 | 3 | 2018 |
The Mirrored Influence Hypothesis: Efficient Data Influence Estimation by Harnessing Forward Passes M Ko, F Kang, W Shi, M Jin, Z Yu, R Jia CVPR 2024, 2024 | 2 | 2024 |
ASR data selection from multiple sources: A practical approach on performance scaling HA Just, IF Chen, F Kang, Y Zhang, AK Sahu, R Jia NeurIPS 2023 Workshop on Efficient Natural Language and Speech Processing …, 2023 | 2 | 2023 |
Data-Centric Defense: Shaping Loss Landscape with Augmentations to Counter Model Inversion S Chen, F Kang, N Abhyankar, M Jin, R Jia DMLR@ICML 2023, 2023 | 2 | 2023 |
Autoscale: Automatic prediction of compute-optimal data composition for training llms F Kang, Y Sun, B Wen, S Chen, D Song, R Mahmood, R Jia arXiv preprint arXiv:2407.20177, 2024 | 1 | 2024 |
FASTTRACK: Reliable Fact Tracing via Clustering and LLM-Powered Evidence Validation S Chen, F Kang, N Yu, R Jia Findings of the Association for Computational Linguistics: EMNLP 2024, 5821-5836, 2024 | | 2024 |
FASTTRACK: Fast and Accurate Fact Tracing for LLMs S Chen, F Kang, N Yu, R Jia arXiv preprint arXiv:2404.15157, 2024 | | 2024 |
2nd Workshop on Navigating and Addressing Data Problems for Foundation Models (DPFM) R Jia, PW Koh, D Song, J Andrews, HA Just, F Kang, JT Wang ICLR 2025 Workshop Proposals, 0 | | |
Navigating and Addressing Data Problems for Foundation Models (DPFM) R Jia, T Hashimoto, PW Koh, J Andrews, SM Xie, L Chen, F Kang ICLR 2024 Workshops, 0 | | |