[HTML][HTML] Recurrent neural networks: A comprehensive review of architectures, variants, and applications

ID Mienye, TG Swart, G Obaido - Information, 2024 - mdpi.com
Recurrent neural networks (RNNs) have significantly advanced the field of machine learning
(ML) by enabling the effective processing of sequential data. This paper provides a …

A comprehensive survey on model compression and acceleration

T Choudhary, V Mishra, A Goswami… - Artificial Intelligence …, 2020 - Springer
In recent years, machine learning (ML) and deep learning (DL) have shown remarkable
improvement in computer vision, natural language processing, stock prediction, forecasting …

R-drop: Regularized dropout for neural networks

L Wu, J Li, Y Wang, Q Meng, T Qin… - Advances in …, 2021 - proceedings.neurips.cc
Dropout is a powerful and widely used technique to regularize the training of deep neural
networks. Though effective and performing well, the randomness introduced by dropout …

[HTML][HTML] Temporal fusion transformers for interpretable multi-horizon time series forecasting

B Lim, SÖ Arık, N Loeff, T Pfister - International Journal of Forecasting, 2021 - Elsevier
Multi-horizon forecasting often contains a complex mix of inputs–including static (ie time-
invariant) covariates, known future inputs, and other exogenous time series that are only …

[CITAT][C] An introduction to variational autoencoders

DP Kingma, M Welling - Foundations and Trends® in …, 2019 - nowpublishers.com
An Introduction to Variational Autoencoders Page 1 An Introduction to Variational Autoencoders
Page 2 Other titles in Foundations and Trends R in Machine Learning Computational Optimal …

Deep equilibrium models

S Bai, JZ Kolter, V Koltun - Advances in neural information …, 2019 - proceedings.neurips.cc
We present a new approach to modeling sequential data: the deep equilibrium model
(DEQ). Motivated by an observation that the hidden layers of many existing deep sequence …

Rat-sql: Relation-aware schema encoding and linking for text-to-sql parsers

B Wang, R Shin, X Liu, O Polozov… - arxiv preprint arxiv …, 2019 - arxiv.org
When translating natural language questions into SQL queries to answer questions from a
database, contemporary semantic parsing models struggle to generalize to unseen …

Dropblock: A regularization method for convolutional networks

G Ghiasi, TY Lin, QV Le - Advances in neural information …, 2018 - proceedings.neurips.cc
Deep neural networks often work well when they are over-parameterized and trained with a
massive amount of noise and regularization, such as weight decay and dropout. Although …

From recognition to cognition: Visual commonsense reasoning

R Zellers, Y Bisk, A Farhadi… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Visual understanding goes well beyond object recognition. With one glance at an image, we
can effortlessly imagine the world beyond the pixels: for instance, we can infer people's …

Stacked bidirectional and unidirectional LSTM recurrent neural network for forecasting network-wide traffic state with missing values

Z Cui, R Ke, Z Pu, Y Wang - Transportation Research Part C: Emerging …, 2020 - Elsevier
Short-term traffic forecasting based on deep learning methods, especially recurrent neural
networks (RNN), has received much attention in recent years. However, the potential of RNN …