A comprehensive survey on segment anything model for vision and beyond

C Zhang, L Liu, Y Cui, G Huang, W Lin, Y Yang… - arxiv preprint arxiv …, 2023 - arxiv.org
Artificial intelligence (AI) is evolving towards artificial general intelligence, which refers to the
ability of an AI system to perform a wide range of tasks and exhibit a level of intelligence …

Transitioning to human interaction with AI systems: New challenges and opportunities for HCI professionals to enable human-centered AI

W Xu, MJ Dainoff, L Ge, Z Gao - International Journal of Human …, 2023 - Taylor & Francis
While AI has benefited humans, it may also harm humans if not appropriately developed.
The priority of current HCI work should focus on transiting from conventional human …

Diffumask: Synthesizing images with pixel-level annotations for semantic segmentation using diffusion models

W Wu, Y Zhao, MZ Shou, H Zhou… - Proceedings of the …, 2023 - openaccess.thecvf.com
Collecting and annotating images with pixel-wise labels is time-consuming and laborious. In
contrast, synthetic data can be freely available using a generative model (eg, DALL-E …

Maptrv2: An end-to-end framework for online vectorized hd map construction

B Liao, S Chen, Y Zhang, B Jiang, Q Zhang… - International Journal of …, 2024 - Springer
High-definition (HD) map provides abundant and precise static environmental information of
the driving scene, serving as a fundamental and indispensable component for planning in …

Kitti-360: A novel dataset and benchmarks for urban scene understanding in 2d and 3d

Y Liao, J **e, A Geiger - IEEE Transactions on Pattern Analysis …, 2022 - ieeexplore.ieee.org
For the last few decades, several major subfields of artificial intelligence including computer
vision, graphics, and robotics have progressed largely independently from each other …

Segdiff: Image segmentation with diffusion probabilistic models

T Amit, T Shaharbany, E Nachmani, L Wolf - arxiv preprint arxiv …, 2021 - arxiv.org
Diffusion Probabilistic Methods are employed for state-of-the-art image generation. In this
work, we present a method for extending such models for performing image segmentation …

Simpleclick: Interactive image segmentation with simple vision transformers

Q Liu, Z Xu, G Bertasius… - Proceedings of the …, 2023 - openaccess.thecvf.com
Click-based interactive image segmentation aims at extracting objects with a limited user
clicking. A hierarchical backbone is the de-facto architecture for current methods. Recently …

Tap-vid: A benchmark for tracking any point in a video

C Doersch, A Gupta, L Markeeva… - Advances in …, 2022 - proceedings.neurips.cc
Generic motion understanding from video involves not only tracking objects, but also
perceiving how their surfaces deform and move. This information is useful to make …

Polyformer: Referring image segmentation as sequential polygon generation

J Liu, H Ding, Z Cai, Y Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
In this work, instead of directly predicting the pixel-level segmentation masks, the problem of
referring image segmentation is formulated as sequential polygon generation, and the …

Panoptic nerf: 3d-to-2d label transfer for panoptic urban scene segmentation

X Fu, S Zhang, T Chen, Y Lu, L Zhu… - … Conference on 3D …, 2022 - ieeexplore.ieee.org
Large-scale training data with high-quality annotations is critical for training semantic and
instance segmentation models. Unfortunately, pixel-wise annotation is labor-intensive and …