pix2gestalt: Amodal segmentation by synthesizing wholes
We introduce pix2gestalt, a framework for zero-shot amodal segmentation, which learns to
estimate the shape and appearance of whole objects that are only partially visible behind …
estimate the shape and appearance of whole objects that are only partially visible behind …
Exploration of Attention Mechanism-Enhanced Deep Learning Models in the Mining of Medical Textual Data
The research explores the utilization of a deep learning model employing an attention
mechanism in medical text mining. It targets the challenge of analyzing unstructured text …
mechanism in medical text mining. It targets the challenge of analyzing unstructured text …
[PDF][PDF] Emerging techniques in vision-based human posture detection: Machine learning methods and applications
Human posture detection is a rapidly evolving field with significant implications for various
applications, including healthcare, surveillance, and human-computer interaction. The …
applications, including healthcare, surveillance, and human-computer interaction. The …
Application of multimodal fusion deep learning model in disease recognition
This paper introduces an innovative multi-modal fusion deep learning approach to
overcome the drawbacks of traditional single-modal recognition techniques. These …
overcome the drawbacks of traditional single-modal recognition techniques. These …
RoHM: Robust Human Motion Reconstruction via Diffusion
We propose RoHM an approach for robust 3D human motion reconstruction from monocular
RGB (-D) videos in the presence of noise and occlusions. Most previous approaches either …
RGB (-D) videos in the presence of noise and occlusions. Most previous approaches either …
[HTML][HTML] Gta-net: An iot-integrated 3d human pose estimation system for real-time adolescent sports posture correction
S Yuan, L Zhou - Alexandria Engineering Journal, 2025 - Elsevier
With the advancement of artificial intelligence, 3D human pose estimation-based systems for
sports training and posture correction have gained significant attention in adolescent sports …
sports training and posture correction have gained significant attention in adolescent sports …
Enhancing medical imaging with GANs synthesizing realistic images from limited data
In this research, we introduce an innovative method for synthesizing medical images using
generative adversarial networks (GANs). Our proposed GANs method demonstrates the …
generative adversarial networks (GANs). Our proposed GANs method demonstrates the …
PhysPT: Physics-aware Pretrained Transformer for Estimating Human Dynamics from Monocular Videos
While current methods have shown promising progress on estimating 3D human motion
from monocular videos their motion estimates are often physically unrealistic because they …
from monocular videos their motion estimates are often physically unrealistic because they …
Multi-agent Long-term 3D Human Pose Forecasting via Interaction-aware Trajectory Conditioning
Human pose forecasting garners attention for its diverse applications. However challenges
in modeling the multi-modal nature of human motion and intricate interactions among agents …
in modeling the multi-modal nature of human motion and intricate interactions among agents …
Neural textured deformable meshes for robust analysis-by-synthesis
Human vision demonstrates higher robustness than current AI algorithms under out-of-
distribution scenarios. It has been conjectured such robustness benefits from performing …
distribution scenarios. It has been conjectured such robustness benefits from performing …