Single-model and any-modality for video object tracking
In the realm of video object tracking auxiliary modalities such as depth thermal or event data
have emerged as valuable assets to complement the RGB trackers. In practice most existing …
have emerged as valuable assets to complement the RGB trackers. In practice most existing …
Blinkvision: A benchmark for optical flow, scene flow and point tracking estimation using rgb frames and events
Recent advances in event-based vision suggest that they complement traditional cameras
by providing continuous observation without frame rate limitations and high dynamic range …
by providing continuous observation without frame rate limitations and high dynamic range …
Muvo: A multimodal generative world model for autonomous driving with geometric representations
D Bogdoll, Y Yang, JM Zöllner - arxiv preprint arxiv:2311.11762, 2023 - arxiv.org
Learning unsupervised world models for autonomous driving has the potential to improve
the reasoning capabilities of today's systems dramatically. However, most work neglects the …
the reasoning capabilities of today's systems dramatically. However, most work neglects the …
Segment Any Event Streams via Weighted Adaptation of Pivotal Tokens
In this paper we delve into the nuanced challenge of tailoring the Segment Anything Models
(SAMs) for integration with event data with the overarching objective of attaining robust and …
(SAMs) for integration with event data with the overarching objective of attaining robust and …
Efficient Meshflow and Optical Flow Estimation from Event Cameras
In this paper we explore the problem of event-based meshflow estimation a novel task that
involves predicting a spatially smooth sparse motion field from event cameras. To start we …
involves predicting a spatially smooth sparse motion field from event cameras. To start we …
Temporal event stereo via joint learning with stereoscopic flow
Event cameras are dynamic vision sensors inspired by the biological retina, characterized
by their high dynamic range, high temporal resolution, and low power consumption. These …
by their high dynamic range, high temporal resolution, and low power consumption. These …
Bring Event into RGB and LiDAR: Hierarchical Visual-Motion Fusion for Scene Flow
Single RGB or LiDAR is the mainstream sensor for the challenging scene flow which relies
heavily on visual features to match motion features. Compared with single modality existing …
heavily on visual features to match motion features. Compared with single modality existing …
[HTML][HTML] High-Performance Grape Disease Detection Method Using Multimodal Data and Parallel Activation Functions
R Li, J Liu, B Shi, H Zhao, Y Li, X Zheng, C Peng, C Lv - Plants, 2024 - pmc.ncbi.nlm.nih.gov
This paper introduces a novel deep learning model for grape disease detection that
integrates multimodal data and parallel heterogeneous activation functions, significantly …
integrates multimodal data and parallel heterogeneous activation functions, significantly …
Steering Prediction via a Multi-Sensor System for Autonomous Racing
Autonomous racing has rapidly gained research attention. Traditionally, racing cars rely on
2D LiDAR as their primary visual system. In this work, we explore the integration of an event …
2D LiDAR as their primary visual system. In this work, we explore the integration of an event …
Video Frame Prediction from a Single Image and Events
Recently, the task of Video Frame Prediction (VFP), which predicts future video frames from
previous ones through extrapolation, has made remarkable progress. However, the …
previous ones through extrapolation, has made remarkable progress. However, the …