Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Systems and methods for systolic array design from a high-level program
Systems and methods for automated systolic array design from a high-level program are
disclosed. One implementa tion of a systolic array design supporting a convolutional neural …
disclosed. One implementa tion of a systolic array design supporting a convolutional neural …
Pipelined approach to fused kernels for optimization of machine learning workloads on graphical processing units
(57) ABSTRACT A method for optimization of machine learning (ML) work loads on a
graphics processor unit (GPU). The method includes identifying a computation having a …
graphics processor unit (GPU). The method includes identifying a computation having a …
Polyhedral optimization of tensorflow computation graphs
Abstract We present R-Stream ⋅ TF, a polyhedral optimization tool for neural network
computations. R-Stream ⋅ TF transforms computations performed in a neural network graph …
computations. R-Stream ⋅ TF transforms computations performed in a neural network graph …
Data-parallel computation management
Data-parallel computation programs may be improved by, for example, determining the
functional properties user defined functions (UDFs), eliminating unnecessary data-shuffling …
functional properties user defined functions (UDFs), eliminating unnecessary data-shuffling …
Change data capturing during an upgrade
A Engelko, W Hoprich, D Debertin… - US Patent …, 2015 - Google Patents
In one general aspect, a computer system can include instruc tions stored on a non-
transitory computer-readable storage medium. The computer system can include a logging …
transitory computer-readable storage medium. The computer system can include a logging …
Systems and methods for generating code for parallel processing units
G Venkataramani, RP Kokku, J Shankar… - US Patent …, 2021 - Google Patents
Systems and methods generate code from a source program where the generated code may
be compiled and executed on a Graphics Processing Unit (GPU). A parallel loop analysis …
be compiled and executed on a Graphics Processing Unit (GPU). A parallel loop analysis …
Optimization of loops and data flow sections in multi-core processor environment
M Vorbach - US Patent 9,672,188, 2017 - Google Patents
The present invention relates to a method for compiling code for a multi-core processor,
comprising: detecting and opti mizing a loop, partitioning the loop into partitions execut able …
comprising: detecting and opti mizing a loop, partitioning the loop into partitions execut able …
Automatic optimization for programming of many-core architectures
The present invention extends to methods, systems, and com puter program products for
automatically optimizing memory accesses by kernel functions executing on parallel …
automatically optimizing memory accesses by kernel functions executing on parallel …
Performing a compiler optimization pass as a transaction
I D'souza - US Patent 10,289,395, 2019 - Google Patents
a process virtual machine is generally understood to be a platform dependent engine that
executes platform independent code written in a high-level programming language for …
executes platform independent code written in a high-level programming language for …
Method for optimizing loop processing under constraint on processors to be used
M Arai - US Patent App. 15/151,611, 2016 - Google Patents
BACKGROUND 0003) A program that a computer is to be caused to execute is created, for
example, by using a high-level language, and is transformed into a computer-executable …
example, by using a high-level language, and is transformed into a computer-executable …