Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Learning in audio-visual context: A review, analysis, and new perspective
Sight and hearing are two senses that play a vital role in human communication and scene
understanding. To mimic human perception ability, audio-visual learning, aimed at …
understanding. To mimic human perception ability, audio-visual learning, aimed at …
Multimodal fusion on low-quality data: A comprehensive survey
Multimodal fusion focuses on integrating information from multiple modalities with the goal of
more accurate prediction, which has achieved remarkable progress in a wide range of …
more accurate prediction, which has achieved remarkable progress in a wide range of …
Dynamic neural networks: A survey
Dynamic neural network is an emerging research topic in deep learning. Compared to static
models which have fixed computational graphs and parameters at the inference stage …
models which have fixed computational graphs and parameters at the inference stage …
Multimodal dynamics: Dynamical fusion for trustworthy multimodal classification
Integration of heterogeneous and high-dimensional data (eg, multiomics) is becoming
increasingly important. Existing multimodal classification algorithms mainly focus on …
increasingly important. Existing multimodal classification algorithms mainly focus on …
Modality competition: What makes joint training of multi-modal network fail in deep learning?(provably)
Despite the remarkable success of deep multi-modal learning in practice, it has not been
well-explained in theory. Recently, it has been observed that the best uni-modal network …
well-explained in theory. Recently, it has been observed that the best uni-modal network …
On uni-modal feature learning in supervised multi-modal learning
We abstract the features (ie learned representations) of multi-modal data into 1) uni-modal
features, which can be learned from uni-modal training, and 2) paired features, which can …
features, which can be learned from uni-modal training, and 2) paired features, which can …
Dynamic multimodal fusion
Deep multimodal learning has achieved great progress in recent years. However, current
fusion approaches are static in nature, ie, they process and fuse multimodal inputs with …
fusion approaches are static in nature, ie, they process and fuse multimodal inputs with …
Skeleton graph-neural-network-based human action recognition: A survey
M Feng, J Meunier - Sensors, 2022 - mdpi.com
Human action recognition has been applied in many fields, such as video surveillance and
human computer interaction, where it helps to improve performance. Numerous reviews of …
human computer interaction, where it helps to improve performance. Numerous reviews of …
Efficient deep visual and inertial odometry with adaptive visual modality selection
In recent years, deep learning-based approaches for visual-inertial odometry (VIO) have
shown remarkable performance outperforming traditional geometric methods. Yet, all …
shown remarkable performance outperforming traditional geometric methods. Yet, all …
Curriculum-listener: Consistency-and complementarity-aware audio-enhanced temporal sentence grounding
Temporal Sentence Grounding aims to retrieve a video moment given a natural language
query. Most existing literature merely focuses on visual information in videos without …
query. Most existing literature merely focuses on visual information in videos without …