Parallel convolutional processing using an integrated photonic tensor core
With the proliferation of ultrahigh-speed mobile networks and internet-connected devices,
along with the rise of artificial intelligence (AI), the world is generating exponentially …
along with the rise of artificial intelligence (AI), the world is generating exponentially …
{LegoOS}: A disseminated, distributed {OS} for hardware resource disaggregation
The monolithic server model where a server is the unit of deployment, operation, and failure
is meeting its limits in the face of several recent hardware and application trends. To improve …
is meeting its limits in the face of several recent hardware and application trends. To improve …
Photonic multiply-accumulate operations for neural networks
It has long been known that photonic communication can alleviate the data movement
bottlenecks that plague conventional microelectronic processors. More recently, there has …
bottlenecks that plague conventional microelectronic processors. More recently, there has …
Profiling hyperscale big data processing
Computing demand continues to grow exponentially, largely driven by" big data" processing
on hyperscale data stores. At the same time, the slowdown in Moore's law is leading the …
on hyperscale data stores. At the same time, the slowdown in Moore's law is leading the …
Photonic neural networks and optics-informed deep learning fundamentals
The recent explosive compute growth, mainly fueled by the boost of artificial intelligence (AI)
and deep neural networks (DNNs), is currently instigating the demand for a novel computing …
and deep neural networks (DNNs), is currently instigating the demand for a novel computing …
A case study of {Processing-in-Memory} in {off-the-Shelf} systems
We evaluate a new processing-in-memory (PIM) architecture from UPMEM that was built
and deployed in an off-the-shelf server. Systems designed to perform computing in or near …
and deployed in an off-the-shelf server. Systems designed to perform computing in or near …
Aquoman: An analytic-query offloading machine
Analytic workloads on terabyte data-sets are often run in the cloud, where application and
storage servers are separate and connected via network. In order to saturate the storage …
storage servers are separate and connected via network. In order to saturate the storage …
Jumpgate:{In-Network} Processing as a Service for Data Analytics
In-network processing, where data is processed by special-purpose devices as it passes
over the network, is showing great promise at improving application performance, in …
over the network, is showing great promise at improving application performance, in …
Accelerating database analytic query workloads using an associative processor
Database analytic query workloads are heavy consumers of data-center cycles, and there is
constant demand to improve their performance. Associative processors (AP) have re …
constant demand to improve their performance. Associative processors (AP) have re …
Efficient generation of machine code for query compilers
Query compilation can make query execution extremely efficient, but it introduces additional
compilation time. The compilation time causes a relatively high overhead especially for short …
compilation time. The compilation time causes a relatively high overhead especially for short …