Joint sensing adaptation and model placement in 6G fabric computing

Y Hao, L Hu, M Chen - IEEE Journal on Selected Areas in …, 2023 - ieeexplore.ieee.org
Sensing and computing based on intelligent fabrics can meet the ultra-reliable and low-
latency communication (URLLC) needs of sixth-generation wireless (6G) by integrating …

Edge intelligence-empowered immersive media

Z Wang, J Liu, W Zhu - IEEE MultiMedia, 2023 - ieeexplore.ieee.org
Recent years have witnessed many immersive media services and applications, ranging
from 360° video streaming to augmented and virtual reality (VR) and the recent metaverse …

Jellyfish: Timely inference serving for dynamic edge networks

V Nigade, P Bauszat, H Bal… - 2022 IEEE Real-Time …, 2022 - ieeexplore.ieee.org
While high accuracy is of paramount importance for deep learning (DL) inference, serving
inference requests on time is equally critical but has not been carefully studied especially …

Rt-lm: Uncertainty-aware resource management for real-time inference of language models

Y Li, Z Li, W Yang, C Liu - arxiv preprint arxiv:2309.06619, 2023 - arxiv.org
Recent advancements in language models (LMs) have gained substantial attentions on their
capability to generate human-like responses. Though exhibiting a promising future for …

: On-Device Real-Time Deep Reinforcement Learning for Autonomous Robotics

Z Li, A Samanta, Y Li, A Soltoggio… - 2023 IEEE Real-Time …, 2023 - ieeexplore.ieee.org
Autonomous robotic systems, like autonomous vehicles and robotic search and rescue,
require efficient on-device training for continuous adaptation of Deep Reinforcement …

Collaborative video analytics on distributed edges with multiagent deep reinforcement learning

G Gao, Y Dong, R Wang - arxiv e-prints, 2022 - ui.adsabs.harvard.edu
Abstract Deep Neural Network (DNN) based video analytics empowers many computer
vision-based applications to achieve high recognition accuracy. To reduce inference delay …

EdgeVision: Towards Collaborative Video Analytics on Distributed Edges for Performance Maximization

G Gao, Y Dong, R Wang, X Zhou - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Deep Neural Network (DNN)-based video analytics significantly improves recognition
accuracy in computer vision applications. Deploying DNN models at edge nodes, closer to …

Share-Aware Joint Model Deployment and Task Offloading for Multi-Task Inference

Y Wu, J Wu, L Chen, B Liu, M Yao… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
In vehicular edge computing, efficient strategies for model deployment and task offloading
offer tremendous potential to reduce response time for machine learning inference …

Chasing Common Knowledge: Joint Large Model Selection and Pulling in MEC With Parameter Sharing

L Zhou, Z Xu, Q **a, Z Xu, W Ren, W Qi… - … on Parallel and …, 2025 - ieeexplore.ieee.org
Pretrained Foundation Models (PFMs) are regarded as a promising accelerator for the
development of various Artificial Intelligence (AI) applications, and have recently been …

MoEI: Mobility-Aware Edge Inference Based on Model Partition and Service Migration

Z Liu, M Tian, M Dong, X Wang, C Qiu… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Deep neural networks are the cornerstone of many mobile intelligent systems, and their
inference processes bring about computation-intensive tasks. Device-edge cooperative …