Sentiment analysis based error detection for large-scale systems KA Alharthi, A Jhumka, S Di, F Cappello, E Chuah 2021 51st Annual IEEE/IFIP International Conference on Dependable Systems …, 2021 | 11 | 2021 |
A survey on error-bounded lossy compression for scientific datasets S Di, J Liu, K Zhao, X Liang, R Underwood, Z Zhang, M Shah, Y Huang, ... arXiv preprint arXiv:2404.02840, 2024 | 10 | 2024 |
Clairvoyant: a log-based transformer-decoder for failure prediction in large-scale systems KA Alharthi, A Jhumka, S Di, F Cappello Proceedings of the 36th ACM International Conference on Supercomputing, 1-14, 2022 | 7 | 2022 |
Time machine: Generative real-time model for failure (and lead time) prediction in hpc systems KA Alharthi, A Jhumka, S Di, L Gui, F Cappello, S McIntosh-Smith 2023 53rd Annual IEEE/IFIP International Conference on Dependable Systems …, 2023 | 6 | 2023 |
The terminator: an AI-based framework to handle dependability threats in large-scale distributed systems KA Alharthi University of Warwick, 2023 | 2 | 2023 |
FedFa: A Fully Asynchronous Training Paradigm for Federated Learning H Xu, Z Zhang, S Di, B Liu, KA Alharthi, J Cao https://dl.acm.org/doi/10.24963/ijcai.2024/584, 2024 | 1 | 2024 |