[HTML][HTML] Transparency of deep neural networks for medical image analysis: A review of interpretability methods

Z Salahuddin, HC Woodruff, A Chatterjee… - Computers in biology and …, 2022 - Elsevier
Artificial Intelligence (AI) has emerged as a useful aid in numerous clinical applications for
diagnosis and treatment decisions. Deep neural networks have shown the same or better …

[HTML][HTML] Perturbation-based methods for explaining deep neural networks: A survey

M Ivanovs, R Kadikis, K Ozols - Pattern Recognition Letters, 2021 - Elsevier
Deep neural networks (DNNs) have achieved state-of-the-art results in a broad range of
tasks, in particular the ones dealing with the perceptual data. However, full-scale application …

Automatic translation of sign language with multi-stream 3D CNN and generation of artificial depth maps

GZ De Castro, RR Guerra, FG Guimarães - Expert Systems with …, 2023 - Elsevier
Sign languages play an essential role in the cognitive and social development of the deaf,
consisting of a natural form of communication and being a symbol of identity and culture …

Visual interpretability in 3D brain tumor segmentation network

H Saleem, AR Shahid, B Raza - Computers in Biology and Medicine, 2021 - Elsevier
Medical image segmentation is a complex yet one of the most essential tasks for diagnostic
procedures such as brain tumor detection. Several 3D Convolutional Neural Network (CNN) …

Ct-net: Channel tensorization network for video classification

K Li, X Li, Y Wang, J Wang, Y Qiao - arxiv preprint arxiv:2106.01603, 2021 - arxiv.org
3D convolution is powerful for video classification but often computationally expensive,
recent studies mainly focus on decomposing it on spatial-temporal and/or channel …

Spatial-temporal interleaved network for efficient action recognition

S Jiang, H Zhang, Y Qi, Q Liu - IEEE Transactions on Industrial …, 2024 - ieeexplore.ieee.org
The decomposition of 3D convolution will considerably reduce the computing complexity of
3D convolutional neural networks, yet simple stacking restricts the performance of neural …

Sign language recognition based on R (2+ 1) D with spatial–temporal–channel attention

X Han, F Lu, J Yin, G Tian, J Liu - IEEE Transactions on Human …, 2022 - ieeexplore.ieee.org
Previous work utilized three-dimensional (3-D) convolutional neural networks (CNNs)
tomodel the spatial appearance and temporal evolution concurrently for sign language …

Visually explaining 3D-CNN predictions for video classification with an adaptive occlusion sensitivity analysis

T Uchiyama, N Sogi, K Niinuma… - Proceedings of the …, 2023 - openaccess.thecvf.com
This paper proposes a method for visually explaining the decision-making process of 3D
convolutional neural networks (CNN) with a temporal extension of occlusion sensitivity …

Gate-shift-fuse for video action recognition

S Sudhakaran, S Escalera… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Convolutional Neural Networks are the de facto models for image recognition. However 3D
CNNs, the straight forward extension of 2D CNNs for video recognition, have not achieved …

Multitask learning to improve egocentric action recognition

G Kapidis, R Poppe, E Van Dam… - Proceedings of the …, 2019 - openaccess.thecvf.com
In this work we employ multitask learning to capitalize on the structure that exists in related
supervised tasks to train complex neural networks. It allows training a network for multiple …