Neural architecture generator optimization

R Ru, P Esperanca, FM Carlucci - Advances in Neural …, 2020 - proceedings.neurips.cc
Abstract Neural Architecture Search (NAS) was first proposed to achieve state-of-the-art
performance through the discovery of new architecture patterns, without human intervention …

APapo: An asynchronous parallel optimization method for DNN models

S Liu, T Ju - Future Generation Computer Systems, 2024 - Elsevier
To address the challenges related to segmentation complexity, high memory usage,
extended training duration, and low equipment utilization in parallel optimization of large …

Random topology and random multiscale map**: An automated design of multiscale and lightweight neural network for remote-sensing image recognition

J Li, M Weinmann, X Sun, W Diao… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
With the proposal of neural architecture search (NAS), automated network architecture
design gradually becomes a new way in deep learning research. Due to its high capability …

深度神经网络模型任务切分及并行优化方法

巨涛, 刘帅, 王志**, **林娟 - 北京航空航天大学学报, 2022 - bhxb.buaa.edu.cn
为解决传统手工切分神经网络模型计算任务并行化方法面临的并行化难度大, 训练耗时长,
设备利用率低等问题, 提出了一种基于深度神经网络(DNN) 模型特性感知的任务切分及并行优化 …

Task segmentation and parallel optimization of DNN model

T JU, S LIU, Z WANG, L LI - 北京航空航天大学学报, 2022 - bhxb.buaa.edu.cn
In order to solve the problems of difficult parallelization, long training time, and low
equipment utilization in the traditional parallelization method of manually partitioning …

Dynamic graph: Learning instance-aware connectivity for neural networks

K Yuan, Q Li, D Chen, A Zhou, J Yan - arxiv preprint arxiv:2010.01097, 2020 - arxiv.org
One practice of employing deep neural networks is to apply the same architecture to all the
input instances. However, a fixed architecture may not be representative enough for data …

Automatic Cross-Domain Transfer Learning for Linear Regression

L **nshun, H **n, M Hui, L **g, L Weizhong… - arxiv preprint arxiv …, 2020 - arxiv.org
Transfer learning research attempts to make model induction transferable across different
domains. This method assumes that specific information regarding to which domain each …