- Academic Search

Y Hu, J Tang, X Gong, Z Zhou, S Zhang… - arxiv preprint arxiv …, 2025‏ - arxiv.org‏

The recent surge in artificial intelligence, particularly in multimodal processing technology,
has advanced human-computer interaction, by altering how intelligent systems perceive …‏

שמור צטט מאמרים בנושא זה כל 2 הגרסאות פתיחה בתור HTML

Empowering smart glasses with large language models: Towards ubiquitous AGI‏

D Zhang, Y Li, Z He, X Li - Companion of the 2024 on ACM International …, 2024‏ - dl.acm.org‏

Smart glasses, augmented by advances in multimodal Large Language Models (LLMs), are
at the forefront of creating ubiquitous Artificial General Intelligence (AGI). This short literature …‏

שמור צטט צוטט על ידי 2 מאמרים בנושא זה

[免费ChatGPT] [DeepSeek可用网址] [PDF] arxiv.org

NoTeeline: Supporting Real-Time, Personalized Notetaking with LLM-Enhanced Micronotes‏

F Huq, A Samee, DC Lin, XA Tang… - arxiv preprint arxiv …, 2024‏ - arxiv.org‏

Taking notes quickly while effectively capturing key information can be challenging,
especially when watching videos that present simultaneous visual and auditory streams …‏

שמור צטט מאמרים בנושא זה פתיחה בתור HTML

[免费ChatGPT] [DeepSeek可用网址] [PDF] 3dvar.com

[PDF][PDF] Vision-Based Multimodal Interfaces: A Survey and Taxonomy for Enhanced Context-Aware System Design‏

J Tang, X Gong, Z Zhou, S Zhang, DS Elvitigala, W Hu… - 2025‏ - 3dvar.com‏

The recent surge in artificial intelligence, particularly in multimodal processing technology,
has advanced human-computer interaction, by altering how intelligent systems perceive …‏

שמור צטט מאמרים בנושא זה פתיחה בתור HTML

יצירת התראה

צטט

חיפוש מתקדם

נשמר בספרייה שלי

GazeNoter: Co-Piloted AR Note-Taking via Gaze Selection of LLM Suggestions to Match Users'...

Vision-Based Multimodal Interfaces: A Survey and Taxonomy for Enhanced Context-Aware System Design‏

Empowering smart glasses with large language models: Towards ubiquitous AGI‏

NoTeeline: Supporting Real-Time, Personalized Notetaking with LLM-Enhanced Micronotes‏

[PDF][PDF] Vision-Based Multimodal Interfaces: A Survey and Taxonomy for Enhanced Context-Aware System Design‏