- Academic Search

M Khairy, Z Shen, TM Aamodt… - 2020 ACM/IEEE 47th …, 2020 - ieeexplore.ieee.org

In computer architecture, significant innovation frequently comes from industry. However, the
simulation tools used by industry are often not released for open use, and even when they …

Save Cite Cited by 315 Related articles All 10 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] princeton.edu

Llmcompass: Enabling efficient hardware design for large language model inference

H Zhang, A Ning, RB Prabhakar… - 2024 ACM/IEEE 51st …, 2024 - ieeexplore.ieee.org

The past year has witnessed the increasing popularity of Large Language Models (LLMs).
Their unprecedented scale and associated high hardware cost have impeded their broader …

Save Cite Cited by 12 Related articles All 7 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] nsf.gov

Ferroelectric ternary content addressable memories for energy-efficient associative search

X Yin, Y Qian, M Imani, K Ni, C Li… - … on Computer-Aided …, 2022 - ieeexplore.ieee.org

A fast and efficient search function across the database has been a core component for a
number of data-intensive tasks in machine learning, IoT applications, and inference …

Save Cite Cited by 30 Related articles All 4 versions Free GPT-4 DeepSeek

Need for speed: Experiences building a trustworthy system-level gpu simulator

O Villa, D Lustig, Z Yan, E Bolotin, Y Fu… - … Symposium on High …, 2021 - ieeexplore.ieee.org

The demands of high-performance computing (HPC) and machine learning (ML) workloads
have resulted in the rapid architectural evolution of GPUs over the last decade. The growing …

Save Cite Cited by 46 Related articles All 3 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A hardware evaluation framework for large language model inference

H Zhang, A Ning, R Prabhakar, D Wentzlaff - arxiv preprint arxiv …, 2023 - arxiv.org

The past year has witnessed the increasing popularity of Large Language Models (LLMs).
Their unprecedented scale and associated high hardware cost have impeded their broader …

Save Cite Cited by 9 Related articles All 4 versions Free GPT-4 DeepSeek View as HTML

[Free GPT-4]
[DeepSeek]

[PDF] github.io

Navisim: A highly accurate gpu simulator for amd rdna gpus

Y Bao, Y Sun, Z Feric, MT Shen, M Weston… - Proceedings of the …, 2022 - dl.acm.org

As GPUs continue to grow in popularity for accelerating demanding applications, such as
high-performance computing and machine learning, GPU architects need to deliver more …

Save Cite Cited by 14 Related articles All 6 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] supercomputing.org

Cuda flux: A lightweight instruction profiler for cuda applications

L Braun, H Fröning - 2019 IEEE/ACM Performance Modeling …, 2019 - ieeexplore.ieee.org

GPUs are powerful, massively parallel processors, which require a vast amount of thread
parallelism to keep their thousands of execution units busy, and to tolerate latency when …

Save Cite Cited by 32 Related articles All 5 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Exploring modern GPU memory system design challenges through accurate modeling

M Khairy, J Akshay, T Aamodt, TG Rogers - arxiv preprint arxiv …, 2018 - arxiv.org

This paper explores the impact of simulator accuracy on architecture design decisions in the
general-purpose graphics processing unit (GPGPU) space. We perform a detailed …

Save Cite Cited by 38 Related articles All 4 versions Free GPT-4 DeepSeek View as HTML

GPUCloudSim: an extension of CloudSim for modeling and simulation of GPUs in cloud data centers

A Siavashi, M Momtazpour - The Journal of Supercomputing, 2019 - Springer

Recent years have witnessed an increasing growth in the usage of GPUs in cloud data
centers. It is known that conventional virtualization techniques are not directly applicable to …

Save Cite Cited by 31 Related articles All 3 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Daisen: A framework for visualizing detailed gpu execution

Y Sun, Y Zhang, A Mosallaei, MD Shah… - Computer Graphics …, 2021 - Wiley Online Library

Abstract Graphics Processing Units (GPUs) have been widely used to accelerate artificial
intelligence, physics simulation, medical imaging, and information visualization applications …

Save Cite Cited by 15 Related articles All 12 versions Free GPT-4 DeepSeek

Create alert

Cite

Advanced search

Saved to My library

Multi2sim kepler: A detailed architectural gpu simulator

Accel-sim: An extensible simulation framework for validated gpu modeling

Llmcompass: Enabling efficient hardware design for large language model inference

Ferroelectric ternary content addressable memories for energy-efficient associative search

Need for speed: Experiences building a trustworthy system-level gpu simulator

A hardware evaluation framework for large language model inference

Navisim: A highly accurate gpu simulator for amd rdna gpus

Cuda flux: A lightweight instruction profiler for cuda applications

Exploring modern GPU memory system design challenges through accurate modeling

GPUCloudSim: an extension of CloudSim for modeling and simulation of GPUs in cloud data centers

Daisen: A framework for visualizing detailed gpu execution