The deep learning compiler: A comprehensive survey

M Li, Y Liu, X Liu, Q Sun, X You, H Yang… - … on Parallel and …, 2020 - ieeexplore.ieee.org
The difficulty of deploying various deep learning (DL) models on diverse DL hardware has
boosted the research and development of DL compilers in the community. Several DL …

Implementing spiking neural networks on neuromorphic architectures: A review

PK Huynh, ML Varshika, A Paul, M Isik, A Balaji… - arxiv preprint arxiv …, 2022 - arxiv.org
Recently, both industry and academia have proposed several different neuromorphic
systems to execute machine learning applications that are designed using Spiking Neural …

Pruning and quantization for deep neural network acceleration: A survey

T Liang, J Glossner, L Wang, S Shi, X Zhang - Neurocomputing, 2021 - Elsevier
Deep neural networks have been applied in many applications exhibiting extraordinary
abilities in the field of computer vision. However, complex network architectures challenge …

On the tool manipulation capability of open-source large language models

Q Xu, F Hong, B Li, C Hu, Z Chen, J Zhang - arxiv preprint arxiv …, 2023 - arxiv.org
Recent studies on software tool manipulation with large language models (LLMs) mostly rely
on closed model APIs. The industrial adoption of these models is substantially constrained …

Dnnfusion: accelerating deep neural networks execution with advanced operator fusion

W Niu, J Guan, Y Wang, G Agrawal, B Ren - Proceedings of the 42nd …, 2021 - dl.acm.org
Deep Neural Networks (DNNs) have emerged as the core enabler of many major
applications on mobile devices. To achieve high accuracy, DNN models have become …

Neuromorphic intermediate representation: A unified instruction set for interoperable brain-inspired computing

JE Pedersen, S Abreu, M Jobst, G Lenz, V Fra… - Nature …, 2024 - nature.com
Spiking neural networks and neuromorphic hardware platforms that simulate neuronal
dynamics are getting wide attention and are being applied to many relevant problems using …

Scalehls: A new scalable high-level synthesis framework on multi-level intermediate representation

H Ye, C Hao, J Cheng, H Jeong… - … symposium on high …, 2022 - ieeexplore.ieee.org
High-level synthesis (HLS) has been widely adopted as it significantly improves the
hardware design productivity and enables efficient design space exploration (DSE). Existing …

Data movement is all you need: A case study on optimizing transformers

A Ivanov, N Dryden, T Ben-Nun, S Li… - … of Machine Learning …, 2021 - proceedings.mlsys.org
Transformers are one of the most important machine learning workloads today. Training one
is a very compute-intensive task, often taking days or weeks, and significant attention has …

BrainPy, a flexible, integrative, efficient, and extensible framework for general-purpose brain dynamics programming

C Wang, T Zhang, X Chen, S He, S Li, S Wu - elife, 2023 - elifesciences.org
Elucidating the intricate neural mechanisms underlying brain functions requires integrative
brain dynamics modeling. To facilitate this process, it is crucial to develop a general-purpose …

tf. data: A machine learning data processing framework

DG Murray, J Simsa, A Klimovic, I Indyk - arxiv preprint arxiv:2101.12127, 2021 - arxiv.org
Training machine learning models requires feeding input data for models to ingest. Input
pipelines for machine learning jobs are often challenging to implement efficiently as they …