A locality-aware memory hierarchy for energy-efficient GPU architectures

M Rhu, M Sullivan, J Leng, M Erez - … of the 46th Annual IEEE/ACM …, 2013 - dl.acm.org
As GPU's compute capabilities grow, their memory hierarchy increasingly becomes a
bottleneck. Current GPU memory hierarchies use coarse-grained memory accesses to …

A 3.2 Gbps/pin 8 Gbit 1.0 V LPDDR4 SDRAM with integrated ECC engine for sub-1 V DRAM core operation

TY Oh, H Chung, JY Park, KW Lee, S Oh… - IEEE Journal of Solid …, 2014 - ieeexplore.ieee.org
A 1.0 V 8 Gbit LPDDR4 SDRAM with 3.2 Gbps/pin speed and integrated ECC engine for sub-
1 V DRAM core is presented. DRAM internal read-modify-write operation for data masked …

A 1.2 V 8 Gb 8-channel 128 GB/s high-bandwidth memory (HBM) stacked DRAM with effective I/O test circuits

DU Lee, KW Kim, KW Kim, KS Lee… - IEEE Journal of Solid …, 2014 - ieeexplore.ieee.org
Motivated by a graphics memory system that achieves multiplied bandwidth by the number
of memories per system, HBM DRAM adopts a brand new architecture, with many technical …

A 16-Gb, 18-Gb/s/pin GDDR6 DRAM with per-bit trainable single-ended DFE and PLL-less clocking

YJ Kim, HJ Kwon, SY Doo, M Ahn… - IEEE Journal of Solid …, 2018 - ieeexplore.ieee.org
The graphic DRAM standard GDDR6 is developed to overcome the limitation of previous
standards GDDR5/5X for achieving high-speed operation. This paper introduces 16-Gb …

In-dram near-data approximate acceleration for gpus

A Yazdanbakhsh, C Song, J Sacks… - Proceedings of the 27th …, 2018 - dl.acm.org
GPUs are bottlenecked by the off-chip communication bandwidth and its energy cost; hence
near-data acceleration is particularly attractive for GPUs. Integrating the accelerators within …

A 16-Gb T-coil-based GDDR6 DRAM with merged-MUX TX, optimized WCK operation, and alternative-data-bus achieving 27-Gb/s/pin in NRZ

D Lee, J Baek, HJ Kwon, DH Kwon… - IEEE Journal of Solid …, 2022 - ieeexplore.ieee.org
This article introduces a 16-Gb T-coil-based graphics double-data-rate 6 (GDDR6) dynamic
random access memory (DRAM) with merged-multiplexer (MUX) transmitter (TX), optimized …

Lossless frame memory compression using pixel-grain prediction and dynamic order entropy coding

X Lian, Z Liu, W Zhou, Z Duan - IEEE Transactions on Circuits …, 2015 - ieeexplore.ieee.org
Power constraints constitute a critical design issue for the portable video codec system, in
which the external dynamic random access memory (DRAM) accounts for more than half of …

Design considerations of HBM stacked DRAM and the memory architecture extension

DU Lee, KS Lee, Y Lee, KW Kim… - 2015 IEEE Custom …, 2015 - ieeexplore.ieee.org
Recently, the 3D stacked memory, which is known as HBM (high bandwidth memory), using
TSV process has been developed. The stacked memory structure provides increased …

[書籍][B] Dynamic RAM: technology advancements

MA Siddiqi - 2017 - books.google.com
Because of their widespread use in mainframes, PCs, and mobile audio and video devices,
DRAMs are being manufactured in ever increasing volume, both in stand-alone and in …

A 1.2 V 38nm 2.4 Gb/s/pin 2Gb DDR4 SDRAM with bank group and× 4 half-page architecture

K Koo, S Ok, Y Kang, S Kim, C Song… - … Solid-State Circuits …, 2012 - ieeexplore.ieee.org
DDR4 SDRAM is expected to realize low power consumption and high bandwidth using a
1.2 V nominal supply voltage and to be a cost-effective solution for various applications. In …