When object detection meets knowledge distillation: A survey

Z Li, P Xu, X Chang, L Yang, Y Zhang… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Object detection (OD) is a crucial computer vision task that has seen the development of
many algorithms and models over the years. While the performance of current OD models …

Q-vit: Accurate and fully quantized low-bit vision transformer

Y Li, S Xu, B Zhang, X Cao, P Gao… - Advances in neural …, 2022 - proceedings.neurips.cc
The large pre-trained vision transformers (ViTs) have demonstrated remarkable
performance on various visual tasks, but suffer from expensive computational and memory …

Q-detr: An efficient low-bit quantized detection transformer

S Xu, Y Li, M Lin, P Gao, G Guo… - Proceedings of the …, 2023 - openaccess.thecvf.com
The recent detection transformer (DETR) has advanced object detection, but its application
on resource-constrained devices requires massive computation and memory resources …

Resilient binary neural network

S Xu, Y Li, T Ma, M Lin, H Dong, B Zhang… - Proceedings of the …, 2023 - ojs.aaai.org
Binary neural networks (BNNs) have received ever-increasing popularity for their great
capability of reducing storage burden as well as quickening inference time. However, there …

DCP–NAS: Discrepant Child–Parent Neural Architecture Search for 1-bit CNNs

Y Li, S Xu, X Cao, L Zhuo, B Zhang, T Wang… - International Journal of …, 2023 - Springer
Neural architecture search (NAS) proves to be among the effective approaches for many
tasks by generating an application-adaptive neural architecture, which is still challenged by …

BEV-LGKD: A Unified LiDAR-Guided Knowledge Distillation Framework for Multi-View BEV 3D Object Detection

J Li, M Lu, J Liu, Y Guo, Y Du, L Du… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Recently, the Bird's-Eye-View (BEV) representation has gained increasing attention in multi-
view 3D object detection, demonstrating promising applications in autonomous driving …

Semantic RGB-D Image Synthesis

S Li, R Li, J Gall - Proceedings of the IEEE/CVF International …, 2023 - openaccess.thecvf.com
Collecting diverse sets of training images for RGB-D semantic image segmentation is not
always possible. In particular, when robots need to operate in privacy-sensitive areas like …

Bi-ViT: Pushing the Limit of Vision Transformer Quantization

Y Li, S Xu, M Lin, X Cao, C Liu, X Sun… - Proceedings of the AAAI …, 2024 - ojs.aaai.org
Vision transformers (ViTs) quantization offers a promising prospect to facilitate deploying
large pre-trained networks on resource-limited devices. Fully-binarized ViTs (Bi-ViT) that …

Binaryvit: Towards efficient and accurate binary vision transformers

J **ao, Z Li, J Li, L Yang, Q Gu - IEEE Transactions on Circuits …, 2024 - ieeexplore.ieee.org
Vision Transformers (ViTs) have emerged as the new fundamental architecture for most
computer vision fields. However, the considerable memory and computation costs also …

Heterogeneous Binary Pixel Difference Networks For Remote Sensing Object Detection

J Zhan, L Bai, J Zhang, T Liu, F Shi… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Recent research in remote-sensing object detection (RSOD) has significantly advanced the
development of vision foundation models. However, deploying these models on resource …