Sparsity-aware hardware accelerators
A special-purpose, hardware-based accelerator may include an input subsystem configured
to receive first and second vectors as operands of a full dot-product operation. The …
to receive first and second vectors as operands of a full dot-product operation. The …
Neural network architecture using convolution engine filter weight buffers
C Martin - US Patent 11,182,668, 2021 - Google Patents
Hardware for implementing a Deep Neural Network (DNN) having a convolution layer, the
hardware comprising a plurality of convolution engines each configured to perform …
hardware comprising a plurality of convolution engines each configured to perform …
Implementation of MobileNet in a CNN based digital integrated circuit
L Yang, PZ Dong, JZ Dong, B Sun - US Patent 10,360,470, 2019 - Google Patents
Method and systems of replacing operations of depthwise separable filters with first and
second replacement convolutional layers are disclosed. Depthwise separable filters …
second replacement convolutional layers are disclosed. Depthwise separable filters …
Dataflow accelerator architecture for general matrix-matrix multiplication and tensor computation in deep learning
A general matrix-matrix multiplication (GEMM) dataflow accelerator circuit is disclosed that
includes a smart 3D stacking DRAM architecture. The accelerator circuit includes a memory …
includes a smart 3D stacking DRAM architecture. The accelerator circuit includes a memory …
Array-based inference engine for machine learning
A Sodani, U Hanebutte, S Durakovic… - US Patent …, 2020 - Google Patents
An array-based inference engine includes a plurality of processing tiles arranged in a two-
dimensional array of a plurality of rows and a plurality of columns. Each processing tile …
dimensional array of a plurality of rows and a plurality of columns. Each processing tile …
Architecture for dense operations in machine learning inference engine
A Sodani, U Hanebutte, S Durakovic… - US Patent …, 2021 - Google Patents
(57) ABSTRACT A processing unit of an inference engine for machine learning (ML)
includes a first, a second, and a third register, and a matrix multiplication block. The first …
includes a first, a second, and a third register, and a matrix multiplication block. The first …
Networked motion control
KA Hay, AH Henning - US Patent App. 16/453,317, 2020 - Google Patents
The presently disclosed technology relates to networked control of machine tools. An
example system can use messages as means for transmitting intention and status in a …
example system can use messages as means for transmitting intention and status in a …
Systems and methods for assigning tasks in a neural network processor
L Fishel, EK Norden - US Patent App. 15/971,872, 2019 - Google Patents
Embodiments relate to managing tasks that when executed by a neural processor circuit
instantiates a neural network. The neural processor circuit includes neural engine circuits …
instantiates a neural network. The neural processor circuit includes neural engine circuits …
Combined world-space pipeline shader stages
MP Nijasure, RW Ramsey, T Martin - US Patent 10,460,513, 2019 - Google Patents
Improvements to graphics processing pipelines are disclosed. More specifically, the vertex
shader stage, which performs vertex transformations, and the hull or geometry shader …
shader stage, which performs vertex transformations, and the hull or geometry shader …
Hardware environment and method of performing matrix multiplication in artificial intelligence applications
A plurality of hardware accelerators are interconnected and include a special processing
unit and accelerator memory. At least one host computer is coupled to each of the plurality of …
unit and accelerator memory. At least one host computer is coupled to each of the plurality of …