- Academic Search

T Ungerer, B Robič, J Šilc - ACM Computing Surveys (CSUR), 2003 - dl.acm.org

Hardware multithreading is becoming a generally applied technique in the next generation
of microprocessors. Several multithreaded processors are announced by industry or already …

Uložit Citovat Počet citací tohoto článku: 349 Související články Všechny verze (počet: 18)

[Free GPT-4]
[DeepSeek]

[PDF] academia.edu

Multithreaded processors

T Ungerer, B Robič, J Šilc - The Computer Journal, 2002 - academic.oup.com

The instruction-level parallelism found in a conventional instruction stream is limited. Studies
have shown the limits of processor utilization even for today's superscalar microprocessors …

Uložit Citovat Počet citací tohoto článku: 130 Související články Všechny verze (počet: 12)

[Free GPT-4]
[DeepSeek]

[PDF] ucsd.edu

The wavescalar architecture

S Swanson, A Schwerin, M Mercaldi… - ACM Transactions on …, 2007 - dl.acm.org

Silicon technology will continue to provide an exponential increase in the availability of raw
transistors. Effectively translating this resource into application performance, however, is an …

Uložit Citovat Počet citací tohoto článku: 241 Související články Všechny verze (počet: 21) Hledat knihovnu

[Free GPT-4]
[DeepSeek]

[PDF] psu.edu

Handling long-latency loads in a simultaneous multithreading processor

DM Tullsen, JA Brown - Proceedings. 34th ACM/IEEE …, 2001 - ieeexplore.ieee.org

Simultaneous multithreading architectures have been defined previously with fully shared
execution resources. When one thread in such an architecture experiences a very long …

Uložit Citovat Počet citací tohoto článku: 340 Související články Všechny verze (počet: 16)

Decoupled software pipelining with the synchronization array

R Rangan, N Vachharajani… - … , 2004. PACT 2004., 2004 - ieeexplore.ieee.org

Despite the success of instruction-level parallelism (ILP) optimizations in increasing the
performance of microprocessors, certain codes remain elusive. In particular, codes …

Uložit Citovat Počet citací tohoto článku: 217 Související články Všechny verze (počet: 9)

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Persistent processor architecture

J Zeng, J Jeong, C Jung - Proceedings of the 56th Annual IEEE/ACM …, 2023 - dl.acm.org

This paper presents PPA (Persistent Processor Architecture), simple microarchitectural
support for lightweight yet performant whole-system persistence. PPA offers fully transparent …

Uložit Citovat Počet citací tohoto článku: 9 Související články Všechny verze (počet: 6)

[Free GPT-4]
[DeepSeek]

[PDF] psu.edu

Initial observations of the simultaneous multithreading Pentium 4 processor

N Tuck, DM Tullsen - 2003 12th International Conference on …, 2003 - ieeexplore.ieee.org

We analyze an Intel Pentium 4 hyper-threading processor. The focus is to understand its
performance and the underlying reasons behind that performance. Particular attention is …

Uložit Citovat Počet citací tohoto článku: 185 Související články Všechny verze (počet: 37)

[Free GPT-4]
[DeepSeek]

[PDF] udel.edu

Synchronization state buffer: supporting efficient fine-grain synchronization on many-core architectures

W Zhu, VC Sreedhar, Z Hu, GR Gao - Proceedings of the 34th annual …, 2007 - dl.acm.org

Efficient fine-grain synchronization is extremely important to effectively harness the
computational power of many-core architectures. However, designing and implementing …

Uložit Citovat Počet citací tohoto článku: 139 Související články Všechny verze (počet: 12)

[Free GPT-4]
[DeepSeek]

[PDF] semanticscholar.org

Physical experimentation with prefetching helper threads on Intel's hyper-threaded processors

D Kim, SSW Liao, PH Wang… - … Symposium on Code …, 2004 - ieeexplore.ieee.org

Pre-execution techniques have received much attention as an effective way of prefetching
cache blocks to tolerate the ever-increasing memory latency. A number of pre-execution …

Uložit Citovat Počet citací tohoto článku: 149 Související články Všechny verze (počet: 13)

[Free GPT-4]
[DeepSeek]

[PDF] ucsd.edu

Exploiting fine-grained data parallelism with chip multiprocessors and fast barriers

J Sampson, R Gonzalez, JF Collard… - 2006 39th Annual …, 2006 - ieeexplore.ieee.org

We examine the ability of CMPs, due to their lower on-chip communication latencies, to
exploit data parallelism at inner-loop granularities similar to that commonly targeted by …

Uložit Citovat Počet citací tohoto článku: 116 Související články Všechny verze (počet: 12)

Vytvořit upozornění

Citovat

Rozšířené vyhledávání

Uloženo do Mojí knihovny

Supporting fine-grained synchronization on a simultaneous multithreading processor

A survey of processors with explicit multithreading

Multithreaded processors

The wavescalar architecture

Handling long-latency loads in a simultaneous multithreading processor

Decoupled software pipelining with the synchronization array

Persistent processor architecture

Initial observations of the simultaneous multithreading Pentium 4 processor

Synchronization state buffer: supporting efficient fine-grain synchronization on many-core architectures

Physical experimentation with prefetching helper threads on Intel's hyper-threaded processors

Exploiting fine-grained data parallelism with chip multiprocessors and fast barriers