Joint sensing adaptation and model placement in 6G fabric computing
Sensing and computing based on intelligent fabrics can meet the ultra-reliable and low-
latency communication (URLLC) needs of sixth-generation wireless (6G) by integrating …
latency communication (URLLC) needs of sixth-generation wireless (6G) by integrating …
Edge intelligence-empowered immersive media
Recent years have witnessed many immersive media services and applications, ranging
from 360° video streaming to augmented and virtual reality (VR) and the recent metaverse …
from 360° video streaming to augmented and virtual reality (VR) and the recent metaverse …
Jellyfish: Timely inference serving for dynamic edge networks
While high accuracy is of paramount importance for deep learning (DL) inference, serving
inference requests on time is equally critical but has not been carefully studied especially …
inference requests on time is equally critical but has not been carefully studied especially …
Rt-lm: Uncertainty-aware resource management for real-time inference of language models
Recent advancements in language models (LMs) have gained substantial attentions on their
capability to generate human-like responses. Though exhibiting a promising future for …
capability to generate human-like responses. Though exhibiting a promising future for …
: On-Device Real-Time Deep Reinforcement Learning for Autonomous Robotics
Autonomous robotic systems, like autonomous vehicles and robotic search and rescue,
require efficient on-device training for continuous adaptation of Deep Reinforcement …
require efficient on-device training for continuous adaptation of Deep Reinforcement …
Collaborative video analytics on distributed edges with multiagent deep reinforcement learning
G Gao, Y Dong, R Wang - arxiv e-prints, 2022 - ui.adsabs.harvard.edu
Abstract Deep Neural Network (DNN) based video analytics empowers many computer
vision-based applications to achieve high recognition accuracy. To reduce inference delay …
vision-based applications to achieve high recognition accuracy. To reduce inference delay …
EdgeVision: Towards Collaborative Video Analytics on Distributed Edges for Performance Maximization
G Gao, Y Dong, R Wang, X Zhou - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Deep Neural Network (DNN)-based video analytics significantly improves recognition
accuracy in computer vision applications. Deploying DNN models at edge nodes, closer to …
accuracy in computer vision applications. Deploying DNN models at edge nodes, closer to …
Share-Aware Joint Model Deployment and Task Offloading for Multi-Task Inference
In vehicular edge computing, efficient strategies for model deployment and task offloading
offer tremendous potential to reduce response time for machine learning inference …
offer tremendous potential to reduce response time for machine learning inference …
Chasing Common Knowledge: Joint Large Model Selection and Pulling in MEC With Parameter Sharing
Pretrained Foundation Models (PFMs) are regarded as a promising accelerator for the
development of various Artificial Intelligence (AI) applications, and have recently been …
development of various Artificial Intelligence (AI) applications, and have recently been …
MoEI: Mobility-Aware Edge Inference Based on Model Partition and Service Migration
Deep neural networks are the cornerstone of many mobile intelligent systems, and their
inference processes bring about computation-intensive tasks. Device-edge cooperative …
inference processes bring about computation-intensive tasks. Device-edge cooperative …