A survey on deep neural network pruning: Taxonomy, comparison, analysis, and recommendations
Modern deep neural networks, particularly recent large language models, come with
massive model sizes that require significant computational and storage resources. To …
massive model sizes that require significant computational and storage resources. To …
[HTML][HTML] Review of image classification algorithms based on convolutional neural networks
L Chen, S Li, Q Bai, J Yang, S Jiang, Y Miao - Remote Sensing, 2021 - mdpi.com
Image classification has always been a hot research direction in the world, and the
emergence of deep learning has promoted the development of this field. Convolutional …
emergence of deep learning has promoted the development of this field. Convolutional …
Infogcn: Representation learning for human skeleton-based action recognition
Human skeleton-based action recognition offers a valuable means to understand the
intricacies of human behavior because it can handle the complex relationships between …
intricacies of human behavior because it can handle the complex relationships between …
Birth of a transformer: A memory viewpoint
Large language models based on transformers have achieved great empirical successes.
However, as they are deployed more widely, there is a growing need to better understand …
However, as they are deployed more widely, there is a growing need to better understand …
[HTML][HTML] Pre-trained models: Past, present and future
Large-scale pre-trained models (PTMs) such as BERT and GPT have recently achieved
great success and become a milestone in the field of artificial intelligence (AI). Owing to …
great success and become a milestone in the field of artificial intelligence (AI). Owing to …
Single-source domain expansion network for cross-scene hyperspectral image classification
Currently, cross-scene hyperspectral image (HSI) classification has drawn increasing
attention. It is necessary to train a model only on source domain (SD) and directly …
attention. It is necessary to train a model only on source domain (SD) and directly …
[HTML][HTML] Embracing change: Continual learning in deep neural networks
Artificial intelligence research has seen enormous progress over the past few decades, but it
predominantly relies on fixed datasets and stationary environments. Continual learning is an …
predominantly relies on fixed datasets and stationary environments. Continual learning is an …
Wavegrad: Estimating gradients for waveform generation
This paper introduces WaveGrad, a conditional model for waveform generation which
estimates gradients of the data density. The model is built on prior work on score matching …
estimates gradients of the data density. The model is built on prior work on score matching …
Is quantum advantage the right goal for quantum machine learning?
Machine learning is frequently listed among the most promising applications for quantum
computing. This is in fact a curious choice: the machine-learning algorithms of today are …
computing. This is in fact a curious choice: the machine-learning algorithms of today are …
[KSIĄŻKA][B] The principles of deep learning theory
This textbook establishes a theoretical framework for understanding deep learning models
of practical relevance. With an approach that borrows from theoretical physics, Roberts and …
of practical relevance. With an approach that borrows from theoretical physics, Roberts and …