Deep hybrid architectures and DenseNet35 in speaker-dependent visual speech recognition

PJ Seegehalli, BN Krupa - Signal, Image and Video Processing, 2024 - Springer
Visual speech recognition (VSR) translates the visual speech cues into transcription.
Speaker-dependent VSR (SD-VSR) can be used for authentication and secure human …

Bengali Word Detection from Lip Movements Using Mask RCNN and Generalized Linear Model

AB Bhuiyan, J Uddin - Indonesian Journal of Electrical …, 2024 - section.iaesonline.com
Speech processing with the help of lip detection and lip reading is an advancing field. For
this, we need proper algorithms and techniques to detect lips and movements of lips …