Applications and techniques for fast machine learning in science

AMC Deiana, N Tran, J Agar, M Blott… - Frontiers in big …, 2022 - frontiersin.org
In this community review report, we discuss applications and techniques for fast machine
learning (ML) in science—the concept of integrating powerful ML methods into the real-time …

Inceptionnext: When inception meets convnext

W Yu, P Zhou, S Yan, X Wang - Proceedings of the IEEE/cvf …, 2024 - openaccess.thecvf.com
Inspired by the long-range modeling ability of ViTs large-kernel convolutions are widely
studied and adopted recently to enlarge the receptive field and improve model performance …

Metaformer baselines for vision

W Yu, C Si, P Zhou, M Luo, Y Zhou… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
MetaFormer, the abstracted architecture of Transformer, has been found to play a significant
role in achieving competitive performance. In this paper, we further explore the capacity of …

Metaformer is actually what you need for vision

W Yu, M Luo, P Zhou, C Si, Y Zhou… - Proceedings of the …, 2022 - openaccess.thecvf.com
Transformers have shown great potential in computer vision tasks. A common belief is their
attention-based token mixer module contributes most to their competence. However, recent …

Inception transformer

C Si, W Yu, P Zhou, Y Zhou… - Advances in Neural …, 2022 - proceedings.neurips.cc
Recent studies show that transformer has strong capability of building long-range
dependencies, yet is incompetent in capturing high frequencies that predominantly convey …

A survey of quantization methods for efficient neural network inference

A Gholami, S Kim, Z Dong, Z Yao… - Low-power computer …, 2022 - taylorfrancis.com
This chapter provides approaches to the problem of quantizing the numerical values in deep
Neural Network computations, covering the advantages/disadvantages of current methods …

Mambaout: Do we really need mamba for vision?

W Yu, X Wang - arxiv preprint arxiv:2405.07992, 2024 - arxiv.org
Mamba, an architecture with RNN-like token mixer of state space model (SSM), was recently
introduced to address the quadratic complexity of the attention mechanism and …

[HTML][HTML] A machine learning method for defect detection and visualization in selective laser sintering based on convolutional neural networks

E Westphal, H Seitz - Additive Manufacturing, 2021 - Elsevier
Part defects and irregularities that influence the part quality is an especially large problem in
additive manufacturing (AM) processes such as selective laser sintering (SLS). Destructive …

Deep separable convolutional network for remaining useful life prediction of machinery

B Wang, Y Lei, N Li, T Yan - Mechanical systems and signal processing, 2019 - Elsevier
Deep learning is gaining attention in data-driven remaining useful life (RUL) prediction of
machinery because of its powerful representation learning ability. With the help of deep …

Xception: Deep learning with depthwise separable convolutions

F Chollet - Proceedings of the IEEE conference on …, 2017 - openaccess.thecvf.com
We present an interpretation of Inception modules in convolutional neural networks as being
an intermediate step in-between regular convolution and the depthwise separable …