Vision-Based Multimodal Interfaces: A Survey and Taxonomy for Enhanced Context-Aware System Design
The recent surge in artificial intelligence, particularly in multimodal processing technology,
has advanced human-computer interaction, by altering how intelligent systems perceive …
has advanced human-computer interaction, by altering how intelligent systems perceive …
Empowering smart glasses with large language models: Towards ubiquitous AGI
Smart glasses, augmented by advances in multimodal Large Language Models (LLMs), are
at the forefront of creating ubiquitous Artificial General Intelligence (AGI). This short literature …
at the forefront of creating ubiquitous Artificial General Intelligence (AGI). This short literature …
NoTeeline: Supporting Real-Time, Personalized Notetaking with LLM-Enhanced Micronotes
Taking notes quickly while effectively capturing key information can be challenging,
especially when watching videos that present simultaneous visual and auditory streams …
especially when watching videos that present simultaneous visual and auditory streams …
[PDF][PDF] Vision-Based Multimodal Interfaces: A Survey and Taxonomy for Enhanced Context-Aware System Design
J Tang, X Gong, Z Zhou, S Zhang, DS Elvitigala, W Hu… - 2025 - 3dvar.com
The recent surge in artificial intelligence, particularly in multimodal processing technology,
has advanced human-computer interaction, by altering how intelligent systems perceive …
has advanced human-computer interaction, by altering how intelligent systems perceive …