The multi-modal fusion in visual question answering: a review of attention mechanisms
Abstract Visual Question Answering (VQA) is a significant cross-disciplinary issue in the
fields of computer vision and natural language processing that requires a computer to output …
fields of computer vision and natural language processing that requires a computer to output …
A review of deep learning techniques for speech processing
The field of speech processing has undergone a transformative shift with the advent of deep
learning. The use of multiple processing layers has enabled the creation of models capable …
learning. The use of multiple processing layers has enabled the creation of models capable …
Obtaining genetics insights from deep learning via explainable artificial intelligence
Artificial intelligence (AI) models based on deep learning now represent the state of the art
for making functional predictions in genomics research. However, the underlying basis on …
for making functional predictions in genomics research. However, the underlying basis on …