Unireplknet: A universal perception large-kernel convnet for audio video point cloud time-series and image recognition

X Ding, Y Zhang, Y Ge, S Zhao… - Proceedings of the …, 2024 - openaccess.thecvf.com
Large-kernel convolutional neural networks (ConvNets) have recently received extensive
research attention but two unresolved and critical issues demand further investigation. 1) …

PIXART-: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

J Chen, C Ge, E ** Transformer and mixer MLP
S Liu, L Wang, W Yue - Applied Soft Computing, 2024 - Elsevier
In recent years, medical image classification techniques based on deep learning have made
remarkable achievements, but most of the current models sacrifice the efficiency of the …

Cascade residual multiscale convolution and mamba-structured unet for advanced brain tumor image segmentation

R Zhou, J Wang, G ** cascade attention for medical image classification
S Liu, W Yue, Z Guo, L Wang - Scientific Reports, 2024 - nature.com
Abstract Visual Transformers (ViT) have made remarkable achievements in the field of
medical image analysis. However, ViT-based methods have poor classification results on …

Multimodal pathway: Improve transformers with irrelevant data from other modalities

Y Zhang, X Ding, K Gong, Y Ge… - Proceedings of the …, 2024 - openaccess.thecvf.com
We propose to improve transformers of a specific modality with irrelevant data from other
modalities eg improve an ImageNet model with audio or point cloud datasets. We would like …

Multi-Scale Group Agent Attention-based Graph Convolutional Decoding Networks for 2D Medical Image Segmentation

Z Wang, L Guo, S Zhao, S Zhang… - IEEE Journal of …, 2024 - ieeexplore.ieee.org
Automated medical image segmentation plays a crucial role in assisting doctors in
diagnosing diseases. Feature decoding is a critical yet challenging issue for medical image …

WoodGLNet: A multi-scale network integrating global and local information for real-time classification of wood images

Z Zheng, Z Ge, Z Tian, X Yang, Y Zhou - Journal of Real-Time Image …, 2024 - Springer
Current research on image classification has combined convolutional neural networks
(CNNs) and transformers to introduce inductive biases to the model, enhancing its ability to …

Scaling Up Your Kernels: Large Kernel Design in ConvNets towards Universal Representations

Y Zhang, X Ding, X Yue - arxiv preprint arxiv:2410.08049, 2024 - arxiv.org
This paper proposes the paradigm of large convolutional kernels in designing modern
Convolutional Neural Networks (ConvNets). We establish that employing a few large …

[HTML][HTML] Spatiotemporal predictive learning for radar-based precipitation nowcasting

X Wang, H Zhao, G Zhang, Q Guan, Y Zhu - Atmosphere, 2024 - mdpi.com
Based on C-band weather radar and ground precipitation data from the Helan Mountain
area in Yinchuan between 2017 to 2020, we evaluated the forecasting performances of 15 …