Attention, please! A survey of neural attention models in deep learning
In humans, Attention is a core property of all perceptual and cognitive operations. Given our
limited ability to process competing sources, attention mechanisms select, modulate, and …
limited ability to process competing sources, attention mechanisms select, modulate, and …
Attention-based interpretable neural network for building cooling load prediction
Abstract Machine learning has gained increasing popularity in building energy management
due to its powerful capability and flexibility in model development as well as the rich data …
due to its powerful capability and flexibility in model development as well as the rich data …
Video captioning by adversarial LSTM
In this paper, we propose a novel approach to video captioning based on adversarial
learning and long short-term memory (LSTM). With this solution concept, we aim at …
learning and long short-term memory (LSTM). With this solution concept, we aim at …
Hierarchically structured reinforcement learning for topically coherent visual story generation
We propose a hierarchically structured reinforcement learning approach to address the
challenges of planning for generating coherent multi-sentence stories for the visual …
challenges of planning for generating coherent multi-sentence stories for the visual …
Violin: A large-scale dataset for video-and-language inference
We introduce a new task, Video-and-Language Inference, for joint multimodal
understanding of video and text. Given a video clip with aligned subtitles as premise, paired …
understanding of video and text. Given a video clip with aligned subtitles as premise, paired …
Video captioning: a comparative review of where we are and which could be the route
Video captioning is the process of describing the content of a sequence of images capturing
its semantic relationships and meanings. Dealing with this task with a single image is …
its semantic relationships and meanings. Dealing with this task with a single image is …
Denoising-based multiscale feature fusion for remote sensing image captioning
With the benefits from deep learning technology, generating captions for remote sensing
images has become achievable, and great progress has been made in this field in the …
images has become achievable, and great progress has been made in this field in the …
Adaptive hierarchical graph reasoning with semantic coherence for video-and-language inference
Abstract Video-and-Language Inference is a recently proposed task for joint video-and-
language understanding. This new task requires a model to draw inference on whether a …
language understanding. This new task requires a model to draw inference on whether a …
Study on key factors affecting the high-order building model order reduction for model predictive control application
Q Chen, N Li - Energy and Buildings, 2023 - Elsevier
The reduced-order model (ROM) can highly reduce computation costs while maintaining
high high-fidelity performance for model predictive control's application by applying the …
high high-fidelity performance for model predictive control's application by applying the …
SCA-Net: A spatial and channel attention network for medical image segmentation
T Shan, J Yan - IEEE Access, 2021 - ieeexplore.ieee.org
Automatic medical image segmentation is a critical tool for medical image analysis and
disease treatment. In recent years, convolutional neural networks (CNNs) have played an …
disease treatment. In recent years, convolutional neural networks (CNNs) have played an …