B Liu, MA Ojewale, Y Ding, M Canini - … of the 15th ACM SIGOPS Asia …, 2024 - dl.acm.org
We propose NeuronaBox, a flexible, user-friendly, and high-fidelity approach to emulate DNN training workloads. We argue that to accurately observe performance, it is possible to …
NK Jha, B Reagen - arxiv preprint arxiv:2501.03489, 2025 - arxiv.org
The pervasiveness of proprietary language models has raised critical privacy concerns, necessitating advancements in private inference (PI), where computations are performed …
NK Jha, B Reagen - arxiv preprint arxiv:2410.13060, 2024 - arxiv.org
The pervasiveness of proprietary language models has raised privacy concerns for users' sensitive data, emphasizing the need for private inference (PI), where inference is performed …
NK Jha, B Reagen - arxiv preprint arxiv:2410.09637, 2024 - arxiv.org
LayerNorm is a critical component in modern large language models (LLMs) for stabilizing training and ensuring smooth optimization. However, it introduces significant challenges in …
Outlier Features (OFs) are neurons whose activation magnitudes significantly exceed the average over a neural network's (NN) width. They are well known to emerge during standard …
The concept of knowledge distillation (KD) describes the training of a student model with a teacher model and is a widespread technique in deep learning. However, it is still not clear …
Deep neural networks (NNs) recently revolutionized the field of Artificial Intelligence, making great progress in computer vision, natural language processing, complex game play …