Neural machine translation: A review
F Stahlberg - Journal of Artificial Intelligence Research, 2020 - jair.org
The field of machine translation (MT), the automatic translation of written text from one
natural language into another, has experienced a major paradigm shift in recent years …
natural language into another, has experienced a major paradigm shift in recent years …
Complete dictionary recovery over the sphere I: Overview and the geometric picture
We consider the problem of recovering a complete (ie, square and invertible) matrix A 0,
from Y∈ R n× p with Y= A 0 X 0, provided X 0 is sufficiently sparse. This recovery problem is …
from Y∈ R n× p with Y= A 0 X 0, provided X 0 is sufficiently sparse. This recovery problem is …
A survey on vision transformer
Transformer, first applied to the field of natural language processing, is a type of deep neural
network mainly based on the self-attention mechanism. Thanks to its strong representation …
network mainly based on the self-attention mechanism. Thanks to its strong representation …
A survey on visual transformer
Transformer, first applied to the field of natural language processing, is a type of deep neural
network mainly based on the self-attention mechanism. Thanks to its strong representation …
network mainly based on the self-attention mechanism. Thanks to its strong representation …
Optimizing federated learning in distributed industrial IoT: A multi-agent approach
In this paper, we aim to make the best joint decision of device selection and computing and
spectrum resource allocation for optimizing federated learning (FL) performance in …
spectrum resource allocation for optimizing federated learning (FL) performance in …
How neural networks extrapolate: From feedforward to graph neural networks
We study how neural networks trained by gradient descent extrapolate, ie, what they learn
outside the support of the training distribution. Previous works report mixed empirical results …
outside the support of the training distribution. Previous works report mixed empirical results …
Learning and generalization in overparameterized neural networks, going beyond two layers
Learning and Generalization in Overparameterized Neural Networks, Going Beyond Two Layers
Page 1 Learning and Generalization in Overparameterized Neural Networks, Going Beyond Two …
Page 1 Learning and Generalization in Overparameterized Neural Networks, Going Beyond Two …
PPINN: Parareal physics-informed neural network for time-dependent PDEs
Physics-informed neural networks (PINNs) encode physical conservation laws and prior
physical knowledge into the neural networks, ensuring the correct physics is represented …
physical knowledge into the neural networks, ensuring the correct physics is represented …
Learning overparameterized neural networks via stochastic gradient descent on structured data
Neural networks have many successful applications, while much less theoretical
understanding has been gained. Towards bridging this gap, we study the problem of …
understanding has been gained. Towards bridging this gap, we study the problem of …
Secureml: A system for scalable privacy-preserving machine learning
Machine learning is widely used in practice to produce predictive models for applications
such as image processing, speech and text recognition. These models are more accurate …
such as image processing, speech and text recognition. These models are more accurate …