Online episodic memory visual query localization with egocentric streaming object memory

Z Manigrasso, M Dunnhofer, A Furnari… - arxiv preprint arxiv …, 2024 - arxiv.org
Episodic memory retrieval aims to enable wearable devices with the ability to recollect from
past video observations objects or events that have been observed (eg," where did I last see …

Hier-EgoPack: Hierarchical Egocentric Video Understanding with Diverse Task Perspectives

SA Peirone, F Pistilli, A Alliegro, T Tommasi… - arxiv preprint arxiv …, 2025 - arxiv.org
Our comprehension of video streams depicting human activities is naturally multifaceted: in
just a few moments, we can grasp what is happening, identify the relevance and interactions …

Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs

Z Huang, Y Ji, X Wang, N Mehta, T **ao, D Lee… - arxiv preprint arxiv …, 2025 - arxiv.org
Long-form video understanding with Large Vision Language Models is challenged by the
need to analyze temporally dispersed yet spatially concentrated key moments within limited …