Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Fixation prediction through multimodal analysis
In this article, we propose to predict human eye fixation through incorporating both audio
and visual cues. Traditional visual attention models generally make the utmost of stimuli's …
and visual cues. Traditional visual attention models generally make the utmost of stimuli's …
Fusion of magnetic and visual sensors for indoor localization: Infrastructure-free and more effective
Accurate and infrastructure-free indoor positioning can be very useful in a variety of
applications. However, most existing approaches (eg, WiFi and infrared-based methods) for …
applications. However, most existing approaches (eg, WiFi and infrared-based methods) for …
Look&listen: Multi-modal correlation learning for active speaker detection and speech enhancement
Active speaker detection and speech enhancement have become two increasingly attractive
topics in audio-visual scenario understanding. According to their respective characteristics …
topics in audio-visual scenario understanding. According to their respective characteristics …
Auxiliary classifier generative adversarial network with soft labels in imbalanced acoustic event detection
In acoustic event detection, the training data size of some acoustic events is often small and
imbalanced. To deal with this, this paper proposes generating the virtual training data …
imbalanced. To deal with this, this paper proposes generating the virtual training data …
Enhancement in speaker recognition for optimized speech features using GMM, SVM and 1-D CNN
S Nainan, V Kulkarni - International Journal of Speech Technology, 2021 - Springer
Contemporary automatic speaker recognition (ASR) systems do not provide 100% accuracy
making it imperative to explore different techniques to improve it. Easy access to mobile …
making it imperative to explore different techniques to improve it. Easy access to mobile …
Introduction of SVM algorithms and recent applications about fault diagnosis and other aspects
Support vector machine has obtained more and more attentions as a new method of
machine learning based on the statistic learning theory. At the same time, there are …
machine learning based on the statistic learning theory. At the same time, there are …
Multimodal multi-channel on-line speaker diarization using sensor fusion through SVM
Speaker diarization (SD) is the process of assigning speech segments of an audio stream to
its corresponding speakers, thus comprising the problem of voice activity detection (VAD) …
its corresponding speakers, thus comprising the problem of voice activity detection (VAD) …
Sound source localization in wide-range outdoor environment using distributed sensor network
Sound source localization has always been one of the most challenging subjects in different
fields of engineering, one of the most important of which being tracking of flying objects. This …
fields of engineering, one of the most important of which being tracking of flying objects. This …
Audio-Visual Speaker Diarization: Current Databases, Approaches and Challenges
Nowadays, the large amount of audio-visual content available has fostered the need to
develop new robust automatic speaker diarization systems to analyse and characterise it …
develop new robust automatic speaker diarization systems to analyse and characterise it …
Multimodal fusion refiner networks
Tasks that rely on multi-modal information typically include a fusion module that combines
information from different modalities. In this work, we develop a Refiner Fusion Network …
information from different modalities. In this work, we develop a Refiner Fusion Network …