Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Characterizing power management opportunities for llms in the cloud
Recent innovation in large language models (LLMs), and their myriad use cases have
rapidly driven up the compute demand for datacenter GPUs. Several cloud providers and …
rapidly driven up the compute demand for datacenter GPUs. Several cloud providers and …
Designing cloud servers for lower carbon
To mitigate climate change, we must reduce carbon emissions from hyperscale cloud
computing. We find that cloud compute servers cause the majority of emissions in a general …
computing. We find that cloud compute servers cause the majority of emissions in a general …
Dynamollm: Designing llm inference clusters for performance and energy efficiency
The rapid evolution and widespread adoption of generative large language models (LLMs)
have made them a pivotal workload in various applications. Today, LLM inference clusters …
have made them a pivotal workload in various applications. Today, LLM inference clusters …
Hyrax:{Fail-in-Place} server operation in cloud platforms
Today's cloud platforms handle server hardware failures by shutting down the affected
server and only turning it back online once it has been repaired by a technician. At cloud …
server and only turning it back online once it has been repaired by a technician. At cloud …
Cost-efficient overclocking in immersion-cooled datacenters
Cloud providers typically use air-based solutions for cooling servers in datacenters.
However, increasing transistor counts and the end of Dennard scaling will result in chips …
However, increasing transistor counts and the end of Dennard scaling will result in chips …
Peeling back the carbon curtain: Carbon optimization challenges in cloud computing
The increasing carbon emissions from cloud computing requires new methods to reduce its
environmental impact. We explore extending data center server lifetimes to reduce …
environmental impact. We explore extending data center server lifetimes to reduce …
Flex: High-availability datacenters with zero reserved power
Cloud providers, like Amazon and Microsoft, must guarantee high availability for a large
fraction of their workloads. For this reason, they build datacenters with redundant …
fraction of their workloads. For this reason, they build datacenters with redundant …
SmartOClock: Workload-and risk-aware overclocking in the cloud
Operating server components beyond their voltage and power design limit (ie, overclocking)
enables improving performance and lowering cost for cloud workloads. However …
enables improving performance and lowering cost for cloud workloads. However …
Towards improved power management in cloud gpus
As modern server GPUs are increasingly power intensive, better power management
mechanisms can significantly reduce the power consumption, capital costs, and carbon …
mechanisms can significantly reduce the power consumption, capital costs, and carbon …
Redesigning data centers for renewable energy
Renewable energy is becoming an important power source for data centers, especially with
the zero-carbon waste pledges made by big cloud providers. However, one of the main …
the zero-carbon waste pledges made by big cloud providers. However, one of the main …