- Academic Search

[HTML][HTML] Deep learning for object detection and scene perception in self-driving cars: Survey, challenges, and open issues

A Gupta, A Anpalagan, L Guan, AS Khwaja - Array, 2021 - Elsevier

This article presents a comprehensive survey of deep learning applications for object
detection and scene perception in autonomous vehicles. Unlike existing review papers, we …

保存引用被引用数: 548 関連記事全 2 バージョン

3D Human Action Recognition: Through the eyes of researchers

A Sarkar, A Banerjee, PK Singh, R Sarkar - Expert Systems with …, 2022 - Elsevier

Abstract Human Action Recognition (HAR) has remained one of the most challenging tasks
in computer vision. With the surge in data-driven methodologies, the depth modality has …

保存引用被引用数: 45 関連記事全 2 バージョン

[Free GPT-4]

[PDF] springer.com

A survey on deep multimodal learning for computer vision: advances, trends, applications, and datasets

K Bayoudh, R Knani, F Hamdaoui, A Mtibaa - The Visual Computer, 2022 - Springer

The research progress in multimodal learning has grown rapidly over the last decade in
several areas, especially in computer vision. The growing potential of multimodal data …

保存引用被引用数: 356 関連記事全 7 バージョン

[Free GPT-4]

[PDF] ieee.org

Object detection recognition and robot gras** based on machine learning: A survey

Q Bai, S Li, J Yang, Q Song, Z Li, X Zhang - IEEE access, 2020 - ieeexplore.ieee.org

With the rapid development of machine learning, its powerful function in the machine vision
field is increasingly reflected. The combination of machine vision and robotics to achieve the …

保存引用被引用数: 137 関連記事全 3 バージョン

[Free GPT-4]

[PDF] port.ac.uk

Gesture recognition based on multi‐modal feature weight

H Duan, Y Sun, W Cheng, D Jiang… - Concurrency and …, 2021 - Wiley Online Library

With the continuous development of sensor technology, the acquisition cost of RGB‐D
images is getting lower and lower, and gesture recognition based on depth images and Red …

保存引用被引用数: 74 関連記事全 3 バージョン

[Free GPT-4]

[PDF] arxiv.org

P4contrast: Contrastive learning with pairs of point-pixel pairs for rgb-d scene understanding

Y Liu, L Yi, S Zhang, Q Fan, T Funkhouser… - ar** of objects with uncertain information: A review

C Wang, X Zhang, X Zang, Y Liu, G Ding, W Yin, J Zhao - Sensors, 2020 - mdpi.com

As there come to be more applications of intelligent robots, their task object is becoming
more varied. However, it is still a challenge for a robot to handle unfamiliar objects. We …

保存引用被引用数: 60 関連記事全 8 バージョンキャッシュ

CCAFFMNet: Dual-spectral semantic segmentation network with channel-coordinate attention feature fusion module

S Yi, J Li, X Liu, X Yuan - Neurocomputing, 2022 - Elsevier

Abstract Dual-spectral (RGB-thermal) semantic segmentation is a fundamental task for
visual perception of autonomous driving in harsh imaging environments (such as darkness …

保存引用被引用数: 39 関連記事全 2 バージョン

RGB-D fusion models for construction and demolition waste detection

J Li, H Fang, L Fan, J Yang, T Ji, Q Chen - Waste Management, 2022 - Elsevier

The development of urbanization has brought a large amount of construction and demolition
waste (CDW), which occupy land and cause adverse ecological effects. To effectively solve …

保存引用被引用数: 32 関連記事全 5 バージョン

[Free GPT-4]

[PDF] acm.org

Deep Multimodal Data Fusion

F Zhao, C Zhang, B Geng - ACM Computing Surveys, 2024 - dl.acm.org

Multimodal Artificial Intelligence (Multimodal AI), in general, involves various types of data
(eg, images, texts, or data collected from different sensors), feature engineering (eg …

保存引用被引用数: 34 関連記事

アラートを作成

引用

検索オプション

マイライブラリに保存しました

RGB-D-based object recognition using multimodal convolutional neural networks: a survey

[HTML][HTML] Deep learning for object detection and scene perception in self-driving cars: Survey, challenges, and open issues

3D Human Action Recognition: Through the eyes of researchers

A survey on deep multimodal learning for computer vision: advances, trends, applications, and datasets

Object detection recognition and robot gras** based on machine learning: A survey

Gesture recognition based on multi‐modal feature weight

P4contrast: Contrastive learning with pairs of point-pixel pairs for rgb-d scene understanding

CCAFFMNet: Dual-spectral semantic segmentation network with channel-coordinate attention feature fusion module

RGB-D fusion models for construction and demolition waste detection

Deep Multimodal Data Fusion