Efficientvit: Memory efficient vision transformer with cascaded group attention

X Liu, H Peng, N Zheng, Y Yang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Vision transformers have shown great success due to their high model capabilities.
However, their remarkable performance is accompanied by heavy computation costs, which …

YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

CY Wang, A Bochkovskiy… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Real-time object detection is one of the most important research topics in computer vision.
As new approaches regarding architecture optimization and training optimization are …

FastViT: A fast hybrid vision transformer using structural reparameterization

PKA Vasu, J Gabriel, J Zhu, O Tuzel… - Proceedings of the …, 2023 - openaccess.thecvf.com
The recent amalgamation of transformer and convolutional designs has led to steady
improvements in accuracy and efficiency of the models. In this work, we introduce FastViT, a …

Adaptive frequency filters as efficient global token mixers

Z Huang, Z Zhang, C Lan, ZJ Zha… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recent vision transformers, large-kernel CNNs and MLPs have attained remarkable
successes in broad vision tasks thanks to their effective information fusion in the global …

SeaFormer++: Squeeze-enhanced axial transformer for mobile visual recognition

Q Wan, Z Huang, J Lu, G Yu, L Zhang - International Journal of Computer …, 2025 - Springer
Since the introduction of Vision Transformers, the landscape of many computer vision tasks
(eg, semantic segmentation), which has been overwhelmingly dominated by CNNs, recently …

Make repvgg greater again: A quantization-aware approach

X Chu, L Li, B Zhang - Proceedings of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org
The tradeoff between performance and inference speed is critical for practical applications.
Architecture reparameterization obtains better tradeoffs and it is becoming an increasingly …

Underwater target detection based on improved YOLOv7

K Liu, Q Sun, D Sun, L Peng, M Yang… - Journal of Marine Science …, 2023 - mdpi.com
Underwater target detection is a crucial aspect of ocean exploration. However, conventional
underwater target detection methods face several challenges such as inaccurate feature …

A cooperative vehicle-infrastructure system for road hazards detection with edge intelligence

C Chen, G Yao, L Liu, Q Pei, H Song… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Road hazards (RH) have always been the cause of many serious traffic accidents. These
have posed a threat to the safety of drivers, passengers, and pedestrians, and have also …

Repghost: A hardware-efficient ghost module via re-parameterization

C Chen, Z Guo, H Zeng, P **ong, J Dong - arxiv preprint arxiv:2211.06088, 2022 - arxiv.org
Feature reuse has been a key technique in light-weight convolutional neural networks
(CNNs) design. Current methods usually utilize a concatenation operator to keep large …

Efficientrep: An efficient Repvgg-style convnets with hardware-aware neural network design

K Weng, X Chu, X Xu, J Huang, X Wei - arxiv preprint arxiv:2302.00386, 2023 - arxiv.org
We present a hardware-efficient architecture of convolutional neural network, which has a
repvgg-like architecture. Flops or parameters are traditional metrics to evaluate the efficiency …