Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
[BOK][B] Embedded system design: embedded systems foundations of cyber-physical systems, and the internet of things
P Marwedel - 2021 - library.oapen.org
A unique feature of this open access textbook is to provide a comprehensive introduction to
the fundamental knowledge in embedded systems, with applications in cyber-physical …
the fundamental knowledge in embedded systems, with applications in cyber-physical …
Ithemal: Accurate, portable and fast basic block throughput estimation using deep neural networks
Predicting the number of clock cycles a processor takes to execute a block of assembly
instructions in steady state (the throughput) is important for both compiler designers and …
instructions in steady state (the throughput) is important for both compiler designers and …
Rethinking SIMD vectorization for in-memory databases
Analytical databases are continuously adapting to the underlying hardware in order to
saturate all sources of parallelism. At the same time, hardware evolves in multiple directions …
saturate all sources of parallelism. At the same time, hardware evolves in multiple directions …
Neurovectorizer: End-to-end vectorization with deep reinforcement learning
One of the key challenges arising when compilers vectorize loops for today's SIMD-
compatible architectures is to decide if vectorization or interleaving is beneficial. Then, the …
compatible architectures is to decide if vectorization or interleaving is beneficial. Then, the …
Cg: A system for programming graphics hardware in a C-like language
The latest real-time graphics architectures include programmable floating-point vertex and
fragment processors, with support for data-dependent control flow in the vertex processor …
fragment processors, with support for data-dependent control flow in the vertex processor …
The architecture of the DIVA processing-in-memory chip
The DIVA (Data IntensiVe Architecture) system incorporates a collection of Processing-In-
Memory (PIM) chips as smart-memory co-processors to a conventional microprocessor. We …
Memory (PIM) chips as smart-memory co-processors to a conventional microprocessor. We …
Synergistic processing in cell's multicore architecture
Eight synergistic processor units enable the Cell Broadband Engine's breakthrough
performance. The SPU architecture implements a novel, pervasively data-parallel …
performance. The SPU architecture implements a novel, pervasively data-parallel …
Auto-vectorization of interleaved data for SIMD
D Nuzman, I Rosen, A Zaks - ACM SIGPLAN Notices, 2006 - dl.acm.org
Most implementations of the Single Instruction Multiple Data (SIMD) model available today
require that data elements be packed in vector registers. Operations on disjoint vector …
require that data elements be packed in vector registers. Operations on disjoint vector …
Vectorization for SIMD architectures with alignment constraints
When vectorizing for SIMD architectures that are commonly employed by today's multimedia
extensions, one of the new challenges that arise is the handling of memory alignment. Prior …
extensions, one of the new challenges that arise is the handling of memory alignment. Prior …
When polyhedral transformations meet SIMD code generation
Data locality and parallelism are critical optimization objectives for performance on modern
multi-core machines. Both coarse-grain parallelism (eg, multi-core) and fine-grain …
multi-core machines. Both coarse-grain parallelism (eg, multi-core) and fine-grain …