- Academic Search

L Liu, Q Zhang, ZJ Zhai, C Yue, X Ma - Energy and Buildings, 2020 - Elsevier

Data center consumes a great amount of energy and accounts for an increasing proportion
of global energy demand. Low efficiency of cooling systems leads to a cooling cost at about …

Speichern Zitieren Zitiert von: 70 Ähnliche Artikel Alle 2 Versionen

[Free GPT-4]

[PDF] arxiv.org

Dynamollm: Designing llm inference clusters for performance and energy efficiency

J Stojkovic, C Zhang, Í Goiri, J Torrellas… - ar** to shave rare power peaks and add more servers to a
datacenter, thereby oversubscribing its resources and lowering capital costs. This works well …

Speichern Zitieren Zitiert von: 56 Ähnliche Artikel Alle 5 Versionen HTML-Version

[Free GPT-4]

[PDF] usc.edu

AsymNVM: An efficient framework for implementing persistent data structures on asymmetric NVM architecture

T Ma, M Zhang, K Chen, Z Song, Y Wu… - Proceedings of the Twenty …, 2020 - dl.acm.org

The byte-addressable non-volatile memory (NVM) is a promising technology since it
simultaneously provides DRAM-like performance, disk-like capacity, and persistency. The …

Speichern Zitieren Zitiert von: 61 Ähnliche Artikel Alle 4 Versionen

[Free GPT-4]

[PDF] fontoura.org

Flex: High-availability datacenters with zero reserved power

C Zhang, AG Kumbhare, I Manousakis… - 2021 ACM/IEEE 48th …, 2021 - ieeexplore.ieee.org

Cloud providers, like Amazon and Microsoft, must guarantee high availability for a large
fraction of their workloads. For this reason, they build datacenters with redundant …

Speichern Zitieren Zitiert von: 42 Ähnliche Artikel Alle 7 Versionen

[Free GPT-4]

[HTML] usenix.org

[HTML][HTML] Thunderbolt:{Throughput-Optimized},{Quality-of-Service-Aware} Power Cap** at Scale

S Li, X Wang, F Kalim, X Zhang, SA Jyothi… - … USENIX Symposium on …, 2020 - usenix.org

OSDI '20 Technical Sessions | USENIX Sign In Conferences Attend Registration Information
Student Grant Application Diversity Grant Application Grants for Black Computer Science …

Speichern Zitieren Zitiert von: 47 Ähnliche Artikel Alle 8 Versionen Im Cache

[Free GPT-4]

[PDF] mit.edu

CuttleSys: Data-driven resource management for interactive services on reconfigurable multicores

N Kulkarni, G Gonzalez-Pumariega… - 2020 53rd Annual …, 2020 - ieeexplore.ieee.org

Multi-tenancy for latency-critical applications leads to resource interference and
unpredictable performance. Core reconfiguration opens up more opportunities for …

Speichern Zitieren Zitiert von: 28 Ähnliche Artikel Alle 7 Versionen

[Free GPT-4]

[PDF] arxiv.org

TAPAS: Thermal-and Power-Aware Scheduling for LLM Inference in Cloud Platforms

J Stojkovic, C Zhang, Í Goiri, E Choukse, H Qiu… - arxiv preprint arxiv …, 2025 - arxiv.org

The rising demand for generative large language models (LLMs) poses challenges for
thermal and power management in cloud datacenters. Traditional techniques often are …

Speichern Zitieren Zitiert von: 2 Ähnliche Artikel Alle 2 Versionen HTML-Version

Alert erstellen

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

Smoothoperator: Reducing power fragmentation and improving power utilization in large-scale...

State-of-the-art on thermal energy storage technologies in data center

Dynamollm: Designing llm inference clusters for performance and energy efficiency

AsymNVM: An efficient framework for implementing persistent data structures on asymmetric NVM architecture

Flex: High-availability datacenters with zero reserved power

[HTML][HTML] Thunderbolt:{Throughput-Optimized},{Quality-of-Service-Aware} Power Cap** at Scale

CuttleSys: Data-driven resource management for interactive services on reconfigurable multicores

TAPAS: Thermal-and Power-Aware Scheduling for LLM Inference in Cloud Platforms