A comprehensive survey on trustworthy recommender systems

W Fan, X Zhao, X Chen, J Su, J Gao, L Wang… - arxiv preprint arxiv …, 2022 - arxiv.org
As one of the most successful AI-powered applications, recommender systems aim to help
people make appropriate decisions in an effective and efficient way, by providing …

Splitwise: Efficient generative llm inference using phase splitting

P Patel, E Choukse, C Zhang, A Shah… - 2024 ACM/IEEE 51st …, 2024 - ieeexplore.ieee.org
Generative large language model (LLM) applications are growing rapidly, leading to large-
scale deployments of expensive and power-hungry GPUs. Our characterization of LLM …

Deep learning workload scheduling in gpu datacenters: A survey

Z Ye, W Gao, Q Hu, P Sun, X Wang, Y Luo… - ACM Computing …, 2024 - dl.acm.org
Deep learning (DL) has demonstrated its remarkable success in a wide variety of fields. The
development of a DL model is a time-consuming and resource-intensive procedure. Hence …

Spatten: Efficient sparse attention architecture with cascade token and head pruning

H Wang, Z Zhang, S Han - 2021 IEEE International Symposium …, 2021 - ieeexplore.ieee.org
The attention mechanism is becoming increasingly popular in Natural Language Processing
(NLP) applications, showing superior performance than convolutional and recurrent …

Chasing carbon: The elusive environmental footprint of computing

U Gupta, YG Kim, S Lee, J Tse, HHS Lee… - … Symposium on High …, 2021 - ieeexplore.ieee.org
Given recent algorithm, software, and hardware innovation, computing has enabled a
plethora of new applications. As computing becomes increasingly ubiquitous, however, so …

Hardware architecture and software stack for PIM based on commercial DRAM technology: Industrial product

S Lee, S Kang, J Lee, H Kim, E Lee… - 2021 ACM/IEEE 48th …, 2021 - ieeexplore.ieee.org
Emerging applications such as deep neural network demand high off-chip memory
bandwidth. However, under stringent physical constraints of chip packages and system …

Elliot: A comprehensive and rigorous framework for reproducible recommender systems evaluation

VW Anelli, A Bellogín, A Ferrara, D Malitesta… - Proceedings of the 44th …, 2021 - dl.acm.org
Recommender Systems have shown to be an effective way to alleviate the over-choice
problem and provide accurate and tailored recommendations. However, the impressive …

Faa $ T: A transparent auto-scaling cache for serverless applications

F Romero, GI Chaudhry, Í Goiri, P Gopa… - Proceedings of the …, 2021 - dl.acm.org
Function-as-a-Service (FaaS) has become an increasingly popular way for users to deploy
their applications without the burden of managing the underlying infrastructure. However …

Planaria: Dynamic architecture fission for spatial multi-tenant acceleration of deep neural networks

S Ghodrati, BH Ahn, JK Kim, S Kinzer… - 2020 53rd Annual …, 2020 - ieeexplore.ieee.org
Deep Neural Networks (DNNs) have reinvigorated real-world applications that rely on
learning patterns of data and are permeating into different industries and markets. Cloud …

RecSSD: near data processing for solid state drive based recommendation inference

M Wilkening, U Gupta, S Hsia, C Trippel… - Proceedings of the 26th …, 2021 - dl.acm.org
Neural personalized recommendation models are used across a wide variety of datacenter
applications including search, social media, and entertainment. State-of-the-art models …