Deep learning for event-based vision: A comprehensive survey and benchmarks
Event cameras are bio-inspired sensors that capture the per-pixel intensity changes
asynchronously and produce event streams encoding the time, pixel position, and polarity …
asynchronously and produce event streams encoding the time, pixel position, and polarity …
Deep learning-based depth estimation methods from monocular image and videos: A comprehensive survey
Estimating depth from single RGB images and videos is of widespread interest due to its
applications in many areas, including autonomous driving, 3D reconstruction, digital …
applications in many areas, including autonomous driving, 3D reconstruction, digital …
Spiking transformers for event-based single object tracking
Event-based cameras bring a unique capability to tracking, being able to function in
challenging real-world conditions as a direct result of their high temporal resolution and high …
challenging real-world conditions as a direct result of their high temporal resolution and high …
Delivering arbitrary-modal semantic segmentation
Multimodal fusion can make semantic segmentation more robust. However, fusing an
arbitrary number of modalities remains underexplored. To delve into this problem, we create …
arbitrary number of modalities remains underexplored. To delve into this problem, we create …
CMX: Cross-modal fusion for RGB-X semantic segmentation with transformers
Scene understanding based on image segmentation is a crucial component of autonomous
vehicles. Pixel-wise semantic segmentation of RGB images can be advanced by exploiting …
vehicles. Pixel-wise semantic segmentation of RGB images can be advanced by exploiting …
Cmda: Cross-modality domain adaptation for nighttime semantic segmentation
Most nighttime semantic segmentation studies are based on domain adaptation approaches
and image input. However, limited by the low dynamic range of conventional cameras …
and image input. However, limited by the low dynamic range of conventional cameras …
Ess: Learning event-based semantic segmentation from still images
Retrieving accurate semantic information in challenging high dynamic range (HDR) and
high-speed conditions remains an open challenge for image-based algorithms due to …
high-speed conditions remains an open challenge for image-based algorithms due to …
Spike transformer: Monocular depth estimation for spiking camera
Spiking camera is a bio-inspired vision sensor that mimics the sampling mechanism of the
primate fovea, which has shown great potential for capturing high-speed dynamic scenes …
primate fovea, which has shown great potential for capturing high-speed dynamic scenes …
Vista 2.0: An open, data-driven simulator for multimodal sensing and policy learning for autonomous vehicles
Simulation has the potential to transform the development of robust algorithms for mobile
agents deployed in safety-critical scenarios. However, the poor photorealism and lack of …
agents deployed in safety-critical scenarios. However, the poor photorealism and lack of …
Brain-inspired computing: A systematic survey and future trends
Brain-inspired computing (BIC) is an emerging research field that aims to build fundamental
theories, models, hardware architectures, and application systems toward more general …
theories, models, hardware architectures, and application systems toward more general …