Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Enabling resource-efficient aiot system with cross-level optimization: A survey
The emerging field of artificial intelligence of things (AIoT, AI+ IoT) is driven by the
widespread use of intelligent infrastructures and the impressive success of deep learning …
widespread use of intelligent infrastructures and the impressive success of deep learning …
Enable deep learning on mobile devices: Methods, systems, and applications
Deep neural networks (DNNs) have achieved unprecedented success in the field of artificial
intelligence (AI), including computer vision, natural language processing, and speech …
intelligence (AI), including computer vision, natural language processing, and speech …
Gemmini: Enabling systematic deep-learning architecture evaluation via full-stack integration
DNN accelerators are often developed and evaluated in isolation without considering the
cross-stack, system-level effects in real-world environments. This makes it difficult to …
cross-stack, system-level effects in real-world environments. This makes it difficult to …
An overview of sparsity exploitation in CNNs for on-device intelligence with software-hardware cross-layer optimizations
This paper presents a detailed overview of sparsity exploitation in deep neural network
(DNN) accelerators. Despite the algorithmic advancements which drove DNNs to become …
(DNN) accelerators. Despite the algorithmic advancements which drove DNNs to become …
Efficient N:M Sparse DNN Training Using Algorithm, Architecture, and Dataflow Co-Design
Sparse training is one of the promising techniques to reduce the computational cost of deep
neural networks (DNNs) while retaining high accuracy. In particular, N: M fine-grained …
neural networks (DNNs) while retaining high accuracy. In particular, N: M fine-grained …
DDC-PIM: Efficient algorithm/architecture co-design for doubling data capacity of SRAM-based processing-in-memory
Processing-in-memory (PIM), as a novel computing paradigm, provides significant
performance benefits from the aspect of effective data movement reduction. SRAM-based …
performance benefits from the aspect of effective data movement reduction. SRAM-based …
Efficient-grad: Efficient training deep convolutional neural networks on edge devices with grad ient optimizations
With the prospering of mobile devices, the distributed learning approach, enabling model
training with decentralized data, has attracted great interest from researchers. However, the …
training with decentralized data, has attracted great interest from researchers. However, the …
Hw-adam: Fpga-based accelerator for adaptive moment estimation
The selection of the optimizer is critical for convergence in the field of on-chip training. As
one second moment optimizer, adaptive moment estimation (ADAM) shows a significant …
one second moment optimizer, adaptive moment estimation (ADAM) shows a significant …
Energy-efficient DNN training processors on micro-AI systems
Many edge/mobile devices are now able to utilize deep neural networks (DNNs) thanks to
the development of mobile DNN accelerators. Mobile DNN accelerators overcame the …
the development of mobile DNN accelerators. Mobile DNN accelerators overcame the …
THETA: A high-efficiency training accelerator for DNNs with triple-side sparsity exploration
Training deep neural networks (DNNs) on edge devices has attracted increasing attention in
real-world applications for domain adaption and privacy protection. However, deploying …
real-world applications for domain adaption and privacy protection. However, deploying …