Beyond the contact: Discovering comprehensive affordance for 3d objects from pre-trained 2d diffusion models

H Kim, S Han, P Kwon, H Joo - European Conference on Computer Vision, 2024 - Springer
Understanding the inherent human knowledge in interacting with a given environment (eg,
affordance) is essential for improving AI to better assist humans. While existing approaches …

Crin: rotation-invariant point cloud analysis and rotation estimation via centrifugal reference frame

Y Lou, Z Ye, Y You, N Jiang, J Lu, W Wang… - Proceedings of the …, 2023 - ojs.aaai.org
Various recent methods attempt to implement rotation-invariant 3D deep learning by
replacing the input coordinates of points with relative distances and angles. Due to the …

Markov Progressive Framework, a Universal Paradigm for Modeling Long Videos

B Pang, G Peng, Y Li, C Lu - IEEE Transactions on Pattern …, 2024 - ieeexplore.ieee.org
The computational complexity of video models increases linearly with the square number of
frames. Thus, constrained bycomputational resources, training video models to learn long …

PGT: A progressive method for training models on long videos

B Pang, G Peng, Y Li, C Lu - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com
Convolutional video models have an order of magnitude larger computational complexity
than their counterpart image-level models. Constrained by computational resources, there is …

Understanding Pixel-Level 2D Image Semantics With 3D Keypoint Knowledge Engine

Y You, C Li, Y Lou, Z Cheng, L Li, L Ma… - … on Pattern Analysis …, 2021 - ieeexplore.ieee.org
Pixel-level 2D object semantic understanding is an important topic in computer vision and
could help machine deeply understand objects (eg, functionality and affordance) in our daily …