[HTML][HTML] Recurrent neural networks: A comprehensive review of architectures, variants, and applications
Recurrent neural networks (RNNs) have significantly advanced the field of machine learning
(ML) by enabling the effective processing of sequential data. This paper provides a …
(ML) by enabling the effective processing of sequential data. This paper provides a …
A comprehensive survey on model compression and acceleration
In recent years, machine learning (ML) and deep learning (DL) have shown remarkable
improvement in computer vision, natural language processing, stock prediction, forecasting …
improvement in computer vision, natural language processing, stock prediction, forecasting …
R-drop: Regularized dropout for neural networks
Dropout is a powerful and widely used technique to regularize the training of deep neural
networks. Though effective and performing well, the randomness introduced by dropout …
networks. Though effective and performing well, the randomness introduced by dropout …
[HTML][HTML] Temporal fusion transformers for interpretable multi-horizon time series forecasting
Multi-horizon forecasting often contains a complex mix of inputs–including static (ie time-
invariant) covariates, known future inputs, and other exogenous time series that are only …
invariant) covariates, known future inputs, and other exogenous time series that are only …
[CITAT][C] An introduction to variational autoencoders
An Introduction to Variational Autoencoders Page 1 An Introduction to Variational Autoencoders
Page 2 Other titles in Foundations and Trends R in Machine Learning Computational Optimal …
Page 2 Other titles in Foundations and Trends R in Machine Learning Computational Optimal …
Deep equilibrium models
We present a new approach to modeling sequential data: the deep equilibrium model
(DEQ). Motivated by an observation that the hidden layers of many existing deep sequence …
(DEQ). Motivated by an observation that the hidden layers of many existing deep sequence …
Rat-sql: Relation-aware schema encoding and linking for text-to-sql parsers
When translating natural language questions into SQL queries to answer questions from a
database, contemporary semantic parsing models struggle to generalize to unseen …
database, contemporary semantic parsing models struggle to generalize to unseen …
Dropblock: A regularization method for convolutional networks
Deep neural networks often work well when they are over-parameterized and trained with a
massive amount of noise and regularization, such as weight decay and dropout. Although …
massive amount of noise and regularization, such as weight decay and dropout. Although …
From recognition to cognition: Visual commonsense reasoning
Visual understanding goes well beyond object recognition. With one glance at an image, we
can effortlessly imagine the world beyond the pixels: for instance, we can infer people's …
can effortlessly imagine the world beyond the pixels: for instance, we can infer people's …
Stacked bidirectional and unidirectional LSTM recurrent neural network for forecasting network-wide traffic state with missing values
Short-term traffic forecasting based on deep learning methods, especially recurrent neural
networks (RNN), has received much attention in recent years. However, the potential of RNN …
networks (RNN), has received much attention in recent years. However, the potential of RNN …