Scbench: A kv cache-centric analysis of long-context methods
Long-context LLMs have enabled numerous downstream applications but also introduced
significant challenges related to computational and memory efficiency. To address these …
significant challenges related to computational and memory efficiency. To address these …