- Academic Search

{Information-Agnostic} flow scheduling for commodity data centers

Turnitin 降AI改写早检测系统早降重系统 Turnitin-UK版万方检测-期刊版维普编辑部版 Grammarly检测 Paperpass检测 checkpass检测 PaperYY检测

Load balancing in data center networks: A survey

J Zhang, FR Yu, S Wang, T Huang… - … Surveys & Tutorials, 2018 - ieeexplore.ieee.org

Data center networks usually employ the scale-out model to provide high bisection
bandwidth for applications. A large amount of data is required to be transferred frequently …

บันทึก อ้างอิง อ้างโดย203 บทความที่เกี่ยวข้อง

Networking for big data: A survey

S Yu, M Liu, W Dou, X Liu… - … Communications Surveys & …, 2016 - ieeexplore.ieee.org

Complementary to the fancy big data applications, networking for big data is an
indispensable supporting platform for these applications in practice. This emerging research …

บันทึก อ้างอิง อ้างโดย240 บทความที่เกี่ยวข้อง ทั้งหมด 6 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

HPCC: High precision congestion control

Y Li, R Miao, HH Liu, Y Zhuang, F Feng… - Proceedings of the …, 2019 - dl.acm.org

Congestion control (CC) is the key to achieving ultra-low latency, high bandwidth and
network stability in high-speed networks. From years of experience operating large-scale …

บันทึก อ้างอิง อ้างโดย647 บทความที่เกี่ยวข้อง ทั้งหมด 14 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Fast distributed inference serving for large language models

B Wu, Y Zhong, Z Zhang, S Liu, F Liu, Y Sun… - arxiv preprint arxiv …, 2023 - arxiv.org

Large language models (LLMs) power a new generation of interactive AI applications
exemplified by ChatGPT. The interactive nature of these applications demands low latency …

บันทึก อ้างอิง อ้างโดย88 บทความที่เกี่ยวข้อง ทั้งหมด 3 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] usenix.org

Tiresias: A {GPU} cluster manager for distributed deep learning

J Gu, M Chowdhury, KG Shin, Y Zhu, M Jeon… - … USENIX Symposium on …, 2019 - usenix.org

Deep learning (DL) training jobs bring some unique challenges to existing cluster
managers, such as unpredictable training times, an all-or-nothing execution model, and …

บันทึก อ้างอิง อ้างโดย448 บทความที่เกี่ยวข้อง ทั้งหมด 14 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Homa: A receiver-driven low-latency transport protocol using network priorities

B Montazeri, Y Li, M Alizadeh… - Proceedings of the 2018 …, 2018 - dl.acm.org

Homa is a new transport protocol for datacenter networks. It provides exceptionally low
latency, especially for workloads with a high volume of very short messages, and it also …

บันทึก อ้างอิง อ้างโดย467 บทความที่เกี่ยวข้อง ทั้งหมด 10 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Hula: Scalable load balancing using programmable data planes

N Katta, M Hira, C Kim, A Sivaraman… - Proceedings of the …, 2016 - dl.acm.org

Datacenter networks employ multi-rooted topologies (eg, Leaf-Spine, Fat-Tree) to provide
large bisection bandwidth. These topologies use a large degree of multipathing, and need a …

บันทึก อ้างอิง อ้างโดย484 บทความที่เกี่ยวข้อง ทั้งหมด 11 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Netllm: Adapting large language models for networking

D Wu, X Wang, Y Qiao, Z Wang, J Jiang, S Cui… - Proceedings of the …, 2024 - dl.acm.org

Many networking tasks now employ deep learning (DL) to solve complex prediction and
optimization problems. However, current design philosophy of DL-based algorithms entails …

บันทึก อ้างอิง อ้างโดย29 บทความที่เกี่ยวข้อง ทั้งหมด 4 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] usenix.org

Shinjuku: Preemptive Scheduling for {μsecond-scale} Tail Latency

K Kaffes, T Chong, JT Humphries, A Belay… - … USENIX Symposium on …, 2019 - usenix.org

The recently proposed dataplanes for microsecond scale applications, such as IX and
ZygOS, use non-preemptive policies to schedule requests to cores. For the many real-world …

บันทึก อ้างอิง อ้างโดย241 บทความที่เกี่ยวข้อง ทั้งหมด 12 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] usenix.org

Bolt:{Sub-RTT} congestion control for {Ultra-Low} latency

S Arslan, Y Li, G Kumar, N Dukkipati - 20th USENIX Symposium on …, 2023 - usenix.org

Data center networks are inclined towards increasing line rates to 200Gbps and beyond to
satisfy the performance requirements of applications such as NVMe and distributed ML. With …

บันทึก อ้างอิง อ้างโดย46 บทความที่เกี่ยวข้อง ทั้งหมด 4 ฉบับ ดูในรูปแบบ HTML

สร้างการแจ้งเตือน

อ้างอิง

การค้นหาขั้นสูง

บันทึกไปยังคลังของฉันแล้ว

{Information-Agnostic} flow scheduling for commodity data centers

Load balancing in data center networks: A survey

Networking for big data: A survey

HPCC: High precision congestion control

Fast distributed inference serving for large language models

Tiresias: A {GPU} cluster manager for distributed deep learning

Homa: A receiver-driven low-latency transport protocol using network priorities

Hula: Scalable load balancing using programmable data planes

Netllm: Adapting large language models for networking

Shinjuku: Preemptive Scheduling for {μsecond-scale} Tail Latency

Bolt:{Sub-RTT} congestion control for {Ultra-Low} latency