Google Académico

Multilevel feature representation for hybrid transformers-based emotion recognition

Enhanced human motion detection with hybrid RDA-WOA-based RNN and multiple hypothesis tracking for occlusion handling

JN Cheltha, C Sharma, D Prashar, AA Khan… - Image and Vision …, 2024 - Elsevier

Human motion detection in complex scenarios poses challenges due to occlusions. This
paper presents an integrated approach for accurate human motion detections by combining …

Guardar Citar Citado por 2 Artículos relacionados Las 2 versiones

An improved anchor-free object detection method applied in complex scenes based on SDA-DLA34

K Sun, Y Zhen, B Zhang, Z Song - Multimedia Tools and Applications, 2024 - Springer

The anchor-free object detection CenterNet has the problems that the utilization rate of
detected object features is low, which is difficult to detect morphological changes and …

Guardar Citar Citado por 3 Artículos relacionados

SMTDKD: A Semantic-Aware Multimodal Transformer Fusion Decoupled Knowledge Distillation Method for Action Recognition

Z Quan, Q Chen, W Wang, M Zhang, X Li… - IEEE Sensors …, 2023 - ieeexplore.ieee.org

Multimodal sensors, including vision sensors and wearable sensors, offer valuable
complementary information for accurate recognition tasks. Nonetheless, the heterogeneity …

Guardar Citar Citado por 5 Artículos relacionados Las 2 versiones

GCD-JFSE: Graph-based class-domain knowledge joint feature selection and ensemble learning for EEG-based emotion recognition

G Luo, Y Han, W **e, F Tian, L Zhu, K Qian, X Li… - Knowledge-Based …, 2025 - Elsevier

Feature selection has demonstrated strong performance in emotion recognition using
intrasubject electroencephalography (EEG) data. However, it faces challenges due to …

Guardar Citar Artículos relacionados Las 3 versiones

[Free GPT-4]
[DeepSeek]

[PDF] plos.org

Musical instrument classifier for early childhood percussion instruments

B Rufino, A Khan, T Dutta, E Biddiss - Plos one, 2024 - journals.plos.org

While the musical instrument classification task is well-studied, there remains a gap in
identifying non-pitched percussion instruments which have greater overlaps in frequency …

Guardar Citar Citado por 2 Artículos relacionados Las 7 versiones En caché

Classification and study of music genres with multimodal Spectro-Lyrical Embeddings for Music (SLEM)

A Mehra, A Mehra, P Narang - Multimedia Tools and Applications, 2024 - Springer

The essence of music is inherently multi-modal–with audio and lyrics going hand in hand.
However, there is very less research done to study the intricacies of the multi-modal nature …

Guardar Citar Citado por 3 Artículos relacionados

Action knowledge graph for violence detection using audiovisual features

M Khan, M Saad, A Khan, W Gueaieb… - 2024 IEEE …, 2024 - ieeexplore.ieee.org

Detecting violent content in video frames is a crucial aspect of violence detection.
Combining visual and audio cues is often the most effective way to identify violent behavior …

Guardar Citar Citado por 2 Artículos relacionados Las 2 versiones

[Free GPT-4]
[DeepSeek]

[HTML] mdpi.com

[HTML][HTML] Multi-Head Attention-Enhanced Speech Recognition for Reduced Data Requirements

Y Li, Y Zhou, Z Qiu, Y Wang, J Wang, G Huang - Electronics, 2024 - mdpi.com

Automatic speech recognition (ASR) technology has reached a mature level, and improving
performance in data-scarce scenarios has become a key research focus. In this study, we …

Guardar Citar Artículos relacionados Las 3 versiones En caché

[Free GPT-4]
[DeepSeek]

[HTML] mdpi.com

[HTML][HTML] Facial Biosignals Time–Series Dataset (FBioT): A Visual–Temporal Facial Expression Recognition (VT-FER) Approach

JMS Souza, CSM Alves, JJF Cerqueira, WLA Oliveira… - Electronics, 2024 - mdpi.com

Visual biosignals can be used to analyze human behavioral activities and serve as a
primary resource for Facial Expression Recognition (FER). FER computational systems face …

Guardar Citar Artículos relacionados Las 3 versiones En caché

Post-Stroke Dysarthria Voice Recognition based on Fusion Feature MSA and 1D

Y Wujian, Z Yingcong, C Yuehai, L Yijun… - Computer Methods in …, 2024 - Taylor & Francis

Post-stroke Dysarthria (PSD) is one of the common sequelae of stroke. PSD can harm
patients' quality of life and, in severe cases, be life-threatening. Most of the existing methods …

Guardar Citar Artículos relacionados Las 2 versiones

Crear alerta

Citar

Búsqueda avanzada

Guardado en Mi biblioteca

Multilevel feature representation for hybrid transformers-based emotion recognition

Enhanced human motion detection with hybrid RDA-WOA-based RNN and multiple hypothesis tracking for occlusion handling

An improved anchor-free object detection method applied in complex scenes based on SDA-DLA34

SMTDKD: A Semantic-Aware Multimodal Transformer Fusion Decoupled Knowledge Distillation Method for Action Recognition

GCD-JFSE: Graph-based class-domain knowledge joint feature selection and ensemble learning for EEG-based emotion recognition

Musical instrument classifier for early childhood percussion instruments

Classification and study of music genres with multimodal Spectro-Lyrical Embeddings for Music (SLEM)

Action knowledge graph for violence detection using audiovisual features

[HTML][HTML] Multi-Head Attention-Enhanced Speech Recognition for Reduced Data Requirements

[HTML][HTML] Facial Biosignals Time–Series Dataset (FBioT): A Visual–Temporal Facial Expression Recognition (VT-FER) Approach

Post-Stroke Dysarthria Voice Recognition based on Fusion Feature MSA and 1D