Run, don't walk: chasing higher FLOPS for faster neural networks

J Chen, S Kao, H He, W Zhuo, S Wen… - Proceedings of the …, 2023 - openaccess.thecvf.com
To design fast neural networks, many works have been focusing on reducing the number of
floating-point operations (FLOPs). We observe that such reduction in FLOPs, however, does …

Scconv: Spatial and channel reconstruction convolution for feature redundancy

J Li, Y Wen, L He - … of the IEEE/CVF conference on …, 2023 - openaccess.thecvf.com
Abstract Convolutional Neural Networks (CNNs) have achieved remarkable performance in
various computer vision tasks but this comes at the cost of tremendous computational …

Malunet: A multi-attention and light-weight unet for skin lesion segmentation

J Ruan, S **ang, M **e, T Liu… - 2022 IEEE International …, 2022 - ieeexplore.ieee.org
Recently, some pioneering works have preferred applying more complex modules to
improve segmentation performances. However, it is not friendly for actual clinical …

DCAM-NET: A novel domain generalization optic cup and optic disc segmentation pipeline with multi-region and multi-scale convolution attention mechanism

K Hua, X Fang, Z Tang, Y Cheng, Z Yu - Computers in Biology and …, 2023 - Elsevier
Fundus images are an essential basis for diagnosing ocular diseases, and using
convolutional neural networks has shown promising results in achieving accurate fundus …

MEW-UNet: Multi-axis representation learning in frequency domain for medical image segmentation

J Ruan, M **e, S **ang, T Liu, Y Fu - arxiv preprint arxiv:2210.14007, 2022 - arxiv.org
Recently, Visual Transformer (ViT) has been widely used in various fields of computer vision
due to applying self-attention mechanism in the spatial domain to modeling global …

Learning multi-axis representation in frequency domain for medical image segmentation

J Ruan, J Gao, M **e, S **ang - Machine Learning, 2025 - Springer
Abstract Recently, Visual Transformer (ViT) has been extensively used in medical image
segmentation (MIS) due to applying self-attention mechanism in the spatial domain to …

Dypa: a machine learning dyslexia prescreening mobile application for chinese children

S Zhong, S Song, T Tang, F Nie, X Zhou… - Proceedings of the …, 2023 - dl.acm.org
Identifying early a person with dyslexia, a learning disorder with reading and writing, is
critical for effective treatment. As accredited specialists for clinical diagnosis of dyslexia are …

HV-YOLOv8 by HDPconv: Better lightweight detectors for small object detection

W Wang, Y Meng, S Li, C Zhang - Image and Vision Computing, 2024 - Elsevier
Accurately identifying and localising small objects within images or videos is a critical
challenge in the field of computer vision. It is mostly applied in scenarios that require high …

[HTML][HTML] Expression-guided deep joint learning for facial expression recognition

B Fang, Y Zhao, G Han, J He - Sensors, 2023 - mdpi.com
In recent years, convolutional neural networks (CNNs) have played a dominant role in facial
expression recognition. While CNN-based methods have achieved remarkable success …

Learning generalizable visual representation via adaptive spectral random convolution for medical image segmentation

Z Zhang, Y Li, BS Shin - Computers in Biology and Medicine, 2023 - Elsevier
Medical image segmentation models often fail to generalize well when applied to new
datasets, hindering their usage in clinical practice. Existing random-convolution-based …