Google Académico

[HTML][HTML] Deep learning and transfer learning for device-free human activity recognition: A survey

J Yang, Y Xu, H Cao, H Zou, L **e - Journal of Automation and Intelligence, 2022 - Elsevier

Device-free activity recognition plays a crucial role in smart building, security, and human–
computer interaction, which shows its strength in its convenience and cost-efficiency …

Guardar Citar Citado por 44 Artículos relacionados

[Free GPT-4]

[PDF] oup.com Full View

A survey of methods for brain tumor segmentation-based MRI images

YMA Mohammed, S El Garouani… - … of Computational Design …, 2023 - academic.oup.com

Brain imaging techniques play an important role in determining the causes of brain cell
injury. Therefore, earlier diagnosis of these diseases can be led to give rise to bring huge …

Guardar Citar Citado por 36 Artículos relacionados Las 5 versiones

[Free GPT-4]

[PDF] thecvf.com

Bevt: Bert pretraining of video transformers

R Wang, D Chen, Z Wu, Y Chen… - Proceedings of the …, 2022 - openaccess.thecvf.com

This paper studies the BERT pretraining of video transformers. It is a straightforward but
worth-studying extension given the recent success from BERT pretraining of image …

Guardar Citar Citado por 257 Artículos relacionados Las 6 versiones Versión en HTML

[Free GPT-4]

[PDF] thecvf.com

X3d: Expanding architectures for efficient video recognition

C Feichtenhofer - Proceedings of the IEEE/CVF conference …, 2020 - openaccess.thecvf.com

This paper presents X3D, a family of efficient video networks that progressively expand a
tiny 2D image classification architecture along multiple network axes, in space, time, width …

Guardar Citar Citado por 1270 Artículos relacionados Las 7 versiones Versión en HTML

[Free GPT-4]

[PDF] thecvf.com

Vidtr: Video transformer without convolutions

Y Zhang, X Li, C Liu, B Shuai, Y Zhu… - Proceedings of the …, 2021 - openaccess.thecvf.com

Abstract We introduce Video Transformer (VidTr) with separable-attention for video
classification. Comparing with commonly used 3D networks, VidTr is able to aggregate …

Guardar Citar Citado por 213 Artículos relacionados Las 11 versiones Versión en HTML

[Free GPT-4]

[PDF] thecvf.com

Refining activation downsampling with SoftPool

A Stergiou, R Poppe… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com

Abstract Convolutional Neural Networks (CNNs) use pooling to decrease the size of
activation maps. This process is crucial to increase the receptive fields and to reduce …

Guardar Citar Citado por 271 Artículos relacionados Las 9 versiones Versión en HTML

[Free GPT-4]

[PDF] thecvf.com

Video classification with channel-separated convolutional networks

D Tran, H Wang, L Torresani… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com

Group convolution has been shown to offer great computational savings in various 2D
convolutional architectures for image classification. It is natural to ask: 1) if group convolution …

Guardar Citar Citado por 755 Artículos relacionados Las 6 versiones Versión en HTML

[Free GPT-4]

[PDF] thecvf.com

Drop an octave: Reducing spatial redundancy in convolutional neural networks with octave convolution

Y Chen, H Fan, B Xu, Z Yan… - Proceedings of the …, 2019 - openaccess.thecvf.com

In natural images, information is conveyed at different frequencies where higher frequencies
are usually encoded with fine details and lower frequencies are usually encoded with global …

Guardar Citar Citado por 766 Artículos relacionados Las 8 versiones Versión en HTML

[Free GPT-4]

[PDF] arxiv.org

Video understanding with large language models: A survey

Y Tang, J Bi, S Xu, L Song, S Liang, T Wang… - arxiv preprint arxiv …, 2023 - arxiv.org

With the burgeoning growth of online video platforms and the escalating volume of video
content, the demand for proficient video understanding tools has intensified markedly. Given …

Guardar Citar Citado por 60 Artículos relacionados Las 2 versiones Versión en HTML

[Free GPT-4]

[PDF] neurips.cc

A^ 2-nets: Double attention networks

Y Chen, Y Kalantidis, J Li, S Yan… - Advances in neural …, 2018 - proceedings.neurips.cc

Learning to capture long-range relations is fundamental to image/video recognition. Existing
CNN models generally rely on increasing depth to model such relations which is highly …

Guardar Citar Citado por 709 Artículos relacionados Las 8 versiones Versión en HTML

Crear alerta

Citar

Búsqueda avanzada

Guardado en Mi biblioteca

Multi-fiber networks for video recognition

[HTML][HTML] Deep learning and transfer learning for device-free human activity recognition: A survey

A survey of methods for brain tumor segmentation-based MRI images

Bevt: Bert pretraining of video transformers

X3d: Expanding architectures for efficient video recognition

Vidtr: Video transformer without convolutions

Refining activation downsampling with SoftPool

Video classification with channel-separated convolutional networks

Drop an octave: Reducing spatial redundancy in convolutional neural networks with octave convolution

Video understanding with large language models: A survey

A^ 2-nets: Double attention networks