- Academic Search

Z Ye, W Gao, Q Hu, P Sun, X Wang, Y Luo… - ACM Computing …, 2024‏ - dl.acm.org‏

Deep learning (DL) has demonstrated its remarkable success in a wide variety of fields. The
development of a DL model is a time-consuming and resource-intensive procedure. Hence …‏

שמור צטט צוטט על ידי 23 מאמרים בנושא זה כל 5 הגרסאות

Current issues and perspectives in nanosensors-based artificial olfactory systems for breath diagnostics and environmental exposure monitoring‏

C Kim, MS Kang, IS Raja, JW Oh, YK Joung… - TrAC Trends in Analytical …, 2024‏ - Elsevier‏

Artificial olfactory systems that can provide sustainable monitoring and non-invasive
diagnostics are emerging for environmental exposure detection and exhaled breath …‏

שמור צטט צוטט על ידי 9 מאמרים בנושא זה

[Free GPT-4]
[DeepSeek]

[PDF] usenix.org

{MLaaS} in the wild: Workload analysis and scheduling in {Large-Scale} heterogeneous {GPU} clusters‏

Q Weng, W **ao, Y Yu, W Wang, C Wang, J He… - … USENIX Symposium on …, 2022‏ - usenix.org‏

With the sustained technological advances in machine learning (ML) and the availability of
massive datasets recently, tech companies are deploying large ML-as-a-Service (MLaaS) …‏

שמור צטט צוטט על ידי 304 מאמרים בנושא זה כל 3 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] usenix.org

Oort: Efficient federated learning via guided participant selection‏

F Lai, X Zhu, HV Madhyastha… - 15th {USENIX} Symposium …, 2021‏ - usenix.org‏

Federated Learning (FL) is an emerging direction in distributed machine learning (ML) that
enables in-situ model training and testing on edge data. Despite having the same end goals …‏

שמור צטט צוטט על ידי 485 מאמרים בנושא זה כל 18 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] usenix.org

{INFaaS}: Automated model-less inference serving‏

F Romero, Q Li, NJ Yadwadkar… - 2021 USENIX Annual …, 2021‏ - usenix.org‏

Despite existing work in machine learning inference serving, ease-of-use and cost efficiency
remain challenges at large scales. Developers must manually search through thousands of …‏

שמור צטט צוטט על ידי 250 מאמרים בנושא זה כל 4 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] usenix.org

A unified architecture for accelerating distributed {DNN} training in heterogeneous {GPU/CPU} clusters‏

Y Jiang, Y Zhu, C Lan, B Yi, Y Cui, C Guo - 14th USENIX Symposium on …, 2020‏ - usenix.org‏

Data center clusters that run DNN training jobs are inherently heterogeneous. They have
GPUs and CPUs for computation and network bandwidth for distributed training. However …‏

שמור צטט צוטט על ידי 346 מאמרים בנושא זה כל 11 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Loongserve: Efficiently serving long-context large language models with elastic sequence parallelism‏

B Wu, S Liu, Y Zhong, P Sun, X Liu, X ** - Proceedings of the ACM …, 2024‏ - dl.acm.org‏

The context window of large language models (LLMs) is rapidly increasing, leading to a
huge variance in resource usage between different requests as well as between different …‏

שמור צטט צוטט על ידי 26 מאמרים בנושא זה כל 3 הגרסאות

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Fast distributed inference serving for large language models‏

B Wu, Y Zhong, Z Zhang, S Liu, F Liu, Y Sun… - arxiv preprint arxiv …, 2023‏ - arxiv.org‏

Large language models (LLMs) power a new generation of interactive AI applications
exemplified by ChatGPT. The interactive nature of these applications demands low latency …‏

שמור צטט צוטט על ידי 88 מאמרים בנושא זה כל 3 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] usenix.org

Characterization of large language model development in the datacenter‏

Q Hu, Z Ye, Z Wang, G Wang, M Zhang… - … USENIX Symposium on …, 2024‏ - usenix.org‏

Large Language Models (LLMs) have presented impressive performance across several
transformative tasks. However, it is non-trivial to efficiently utilize large-scale cluster …‏

שמור צטט צוטט על ידי 42 מאמרים בנושא זה כל 9 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] usenix.org

Parrot: Efficient Serving of {LLM-based} Applications with Semantic Variable‏

C Lin, Z Han, C Zhang, Y Yang, F Yang… - … USENIX Symposium on …, 2024‏ - usenix.org‏

The rise of large language models (LLMs) has enabled LLM-based applications (aka AI
agents or co-pilots), a new software paradigm that combines the strength of LLM and …‏

שמור צטט צוטט על ידי 18 מאמרים בנושא זה כל 7 הגרסאות פתיחה בתור HTML

יצירת התראה

צטט

חיפוש מתקדם

נשמר בספרייה שלי

Tiresias: A {GPU} cluster manager for distributed deep learning

Deep learning workload scheduling in gpu datacenters: A survey‏

Current issues and perspectives in nanosensors-based artificial olfactory systems for breath diagnostics and environmental exposure monitoring‏

{MLaaS} in the wild: Workload analysis and scheduling in {Large-Scale} heterogeneous {GPU} clusters‏

Oort: Efficient federated learning via guided participant selection‏

{INFaaS}: Automated model-less inference serving‏

A unified architecture for accelerating distributed {DNN} training in heterogeneous {GPU/CPU} clusters‏

Loongserve: Efficiently serving long-context large language models with elastic sequence parallelism‏

Fast distributed inference serving for large language models‏

Characterization of large language model development in the datacenter‏

Parrot: Efficient Serving of {LLM-based} Applications with Semantic Variable‏