Image recognition algorithm based on artificial intelligence

H Chen, L Geng, H Zhao, C Zhao, A Liu - Neural Computing and …, 2022 - Springer
Convolutional neural networks also encountered some problems in the development of
image recognition. The most prominent problem is that it is costly and time-consuming to …

Sibs: A sparse encoder utilizing self-inspired bases for efficient image representation

AN Omara, MA Hebaishy, MS Abdallah… - Knowledge-Based Systems, 2024 - Elsevier
Addressing the limitations of pre-defined dictionaries in image processing, this study
introduces Self-Inspired Bases-based Sparse Encoder (SIBS), a novel approach that …

Exploiting low-dimensional structures to enhance dnn based acoustic modeling in speech recognition

P Dighe, G Luyet, A Asaei… - 2016 IEEE International …, 2016 - ieeexplore.ieee.org
We propose to model the acoustic space of deep neural network (DNN) class-conditional
posterior probabilities as a union of low-dimensional subspaces. To that end, the training …

Sparse subspace modeling for query by example spoken term detection

D Ram, A Asaei, H Bourlard - IEEE/ACM Transactions on Audio …, 2018 - ieeexplore.ieee.org
This paper focuses on the problem of query by example spoken term detection (QbE-STD) in
zero-resource scenario. Current state-of-the-art approaches to tackle this problem rely on …

What can phone attractors in RPS tell us? A study of dynamic information in speech signals for phone classification purposes

Y Shekofteh - Applied Acoustics, 2023 - Elsevier
The speech production system is time-varying, multidimensional, and nonlinear. Most
techniques for spoken feature extraction (SFE), which are tools for extracting information …

Reliable recovery of hierarchically sparse signals for Gaussian and Kronecker product measurements

I Roth, M Kliesch, A Flinth, G Wunder… - IEEE transactions on …, 2020 - ieeexplore.ieee.org
We propose and analyze a solution to the problem of recovering a block sparse signal with
sparse blocks from linear measurements. Such problems naturally emerge inter alia in the …

Subspace detection of DNN posterior probabilities via sparse representation for query by example spoken term detection

D Ram, A Asaei, H Bourlard - 2016 - infoscience.epfl.ch
We cast the query by example spoken term detection (QbE-STD) problem as subspace
detection where query and background subspaces are modeled as union of low …

Phonetic subspace features for improved query by example spoken term detection

D Ram, A Asaei, H Bourlard - Speech Communication, 2018 - Elsevier
This paper addresses the problem of detecting speech utterances from a large audio archive
using a simple spoken query, hence referring to this problem as “Query by Example Spoken …

On quantifying the quality of acoustic models in hybrid DNN-HMM ASR

P Dighe, A Asaei, H Bourlard - Speech Communication, 2020 - Elsevier
We propose an information theoretic framework for quantitative assessment of acoustic
models used in hidden Markov model (HMM) based automatic speech recognition (ASR) …

[PDF][PDF] Sparse modeling of posterior exemplars for keyword detection

D Ram, A Asaei, P Dighe… - Proceedings of …, 2015 - infoscience.epfl.ch
Sparse representation has been shown to be a powerful modeling framework for
classification and detection tasks. In this paper, we propose a new keyword detection …