Deep learning-enabled semantic communication systems with task-unaware transmitter and dynamic data
H Zhang, S Shao, M Tao, X Bi… - IEEE Journal on …, 2022 - ieeexplore.ieee.org
Existing deep learning-enabled semantic communication systems often rely on shared
background knowledge between the transmitter and receiver that includes empirical data …
background knowledge between the transmitter and receiver that includes empirical data …
Scalable image coding for humans and machines
At present, and increasingly so in the future, much of the captured visual content will not be
seen by humans. Instead, it will be used for automated machine vision analytics and may …
seen by humans. Instead, it will be used for automated machine vision analytics and may …
Reversible vision transformers
Abstract We present Reversible Vision Transformers, a memory efficient architecture design
for visual recognition. By decoupling the GPU memory footprint from the depth of the model …
for visual recognition. By decoupling the GPU memory footprint from the depth of the model …
Spcgc: scalable point cloud geometry compression for machine vision
With the proliferation of sensor devices, the extensive utilization of three-dimensional data in
multimedia continues to grow. Point clouds are widely adopted within this domain because …
multimedia continues to grow. Point clouds are widely adopted within this domain because …
Re2TAL: Rewiring pretrained video backbones for reversible temporal action localization
Temporal action localization (TAL) requires long-form reasoning to predict actions of various
durations and complex content. Given limited GPU memory, training TAL end to end (ie, from …
durations and complex content. Given limited GPU memory, training TAL end to end (ie, from …
Roi-guided point cloud geometry compression towards human and machine vision
Point cloud data is pivotal in applications like autonomous driving, virtual reality, and
robotics. However, its substantial volume poses significant challenges in storage and …
robotics. However, its substantial volume poses significant challenges in storage and …
Non-semantics suppressed mask learning for unsupervised video semantic compression
Most video compression methods aim to improve the decoded video visual quality, instead
of particularly guaranteeing the semantic-completeness, which deteriorates downstream …
of particularly guaranteeing the semantic-completeness, which deteriorates downstream …
An indirect rate-distortion characterization for semantic sources: General model and the case of gaussian observation
A new source model, which consists of an intrinsic state part and an extrinsic observation
part, is proposed and its information-theoretic characterization, namely its rate-distortion …
part, is proposed and its information-theoretic characterization, namely its rate-distortion …
Joint coding-modulation for digital semantic communications via variational autoencoder
Y Bo, Y Duan, S Shao, M Tao - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Semantic communications have emerged as a new paradigm for improving communication
efficiency by transmitting the semantic information of a source message that is most relevant …
efficiency by transmitting the semantic information of a source message that is most relevant …
SSSIC: semantics-to-signal scalable image coding with learned structural representations
We address the requirement of image coding for joint human-machine vision, ie, the
decoded image serves both human observation and machine analysis/understanding …
decoded image serves both human observation and machine analysis/understanding …