[HTML][HTML] Deep learning and transfer learning for device-free human activity recognition: A survey

J Yang, Y Xu, H Cao, H Zou, L **e - Journal of Automation and Intelligence, 2022 - Elsevier
Device-free activity recognition plays a crucial role in smart building, security, and human–
computer interaction, which shows its strength in its convenience and cost-efficiency …

A survey of methods for brain tumor segmentation-based MRI images

YMA Mohammed, S El Garouani… - … of Computational Design …, 2023 - academic.oup.com
Brain imaging techniques play an important role in determining the causes of brain cell
injury. Therefore, earlier diagnosis of these diseases can be led to give rise to bring huge …

Bevt: Bert pretraining of video transformers

R Wang, D Chen, Z Wu, Y Chen… - Proceedings of the …, 2022 - openaccess.thecvf.com
This paper studies the BERT pretraining of video transformers. It is a straightforward but
worth-studying extension given the recent success from BERT pretraining of image …

X3d: Expanding architectures for efficient video recognition

C Feichtenhofer - Proceedings of the IEEE/CVF conference …, 2020 - openaccess.thecvf.com
This paper presents X3D, a family of efficient video networks that progressively expand a
tiny 2D image classification architecture along multiple network axes, in space, time, width …

Vidtr: Video transformer without convolutions

Y Zhang, X Li, C Liu, B Shuai, Y Zhu… - Proceedings of the …, 2021 - openaccess.thecvf.com
Abstract We introduce Video Transformer (VidTr) with separable-attention for video
classification. Comparing with commonly used 3D networks, VidTr is able to aggregate …

Refining activation downsampling with SoftPool

A Stergiou, R Poppe… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Abstract Convolutional Neural Networks (CNNs) use pooling to decrease the size of
activation maps. This process is crucial to increase the receptive fields and to reduce …

Video classification with channel-separated convolutional networks

D Tran, H Wang, L Torresani… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Group convolution has been shown to offer great computational savings in various 2D
convolutional architectures for image classification. It is natural to ask: 1) if group convolution …

Drop an octave: Reducing spatial redundancy in convolutional neural networks with octave convolution

Y Chen, H Fan, B Xu, Z Yan… - Proceedings of the …, 2019 - openaccess.thecvf.com
In natural images, information is conveyed at different frequencies where higher frequencies
are usually encoded with fine details and lower frequencies are usually encoded with global …

Video understanding with large language models: A survey

Y Tang, J Bi, S Xu, L Song, S Liang, T Wang… - arxiv preprint arxiv …, 2023 - arxiv.org
With the burgeoning growth of online video platforms and the escalating volume of video
content, the demand for proficient video understanding tools has intensified markedly. Given …

A^ 2-nets: Double attention networks

Y Chen, Y Kalantidis, J Li, S Yan… - Advances in neural …, 2018 - proceedings.neurips.cc
Learning to capture long-range relations is fundamental to image/video recognition. Existing
CNN models generally rely on increasing depth to model such relations which is highly …