Mimonets: Multiple-input-multiple-output neural networks exploiting computation in superposition

N Menet, M Hersche, G Karunaratne… - Advances in …, 2023 - proceedings.neurips.cc
With the advent of deep learning, progressively larger neural networks have been designed
to solve complex tasks. We take advantage of these capacity-rich models to lower the cost of …

RevMUX: Data Multiplexing with Reversible Adapters for Efficient LLM Batch Inference

Y Xu, X Guo, Z Zeng, C Miao - arxiv preprint arxiv:2410.04519, 2024 - arxiv.org
Large language models (LLMs) have brought a great breakthrough to the natural language
processing (NLP) community, while leading the challenge of handling concurrent customer …