Unified 3d segmenter as prototypical classifiers

Z Qin, C Han, Q Wang, X Nie, Y Yin… - Advances in Neural …, 2023 - proceedings.neurips.cc
The task of point cloud segmentation, comprising semantic, instance, and panoptic
segmentation, has been mainly tackled by designing task-specific network architectures …

How to configure good in-context sequence for visual question answering

L Li, J Peng, H Chen, C Gao… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Inspired by the success of Large Language Models in dealing with new tasks via In-Context
Learning (ICL) in NLP researchers have also developed Large Vision-Language Models …

Robust fuzzy rough approximations with kNN granules for semi-supervised feature selection

S An, M Zhang, C Wang, W Ding - Fuzzy Sets and Systems, 2023 - Elsevier
Fuzzy rough set theory has attracted much attention because of its successful application in
uncertainty measurement. To improve the efficiency and robustness of uncertainty measure …

Noise-aware image captioning with progressively exploring mismatched words

Z Fu, K Song, L Zhou, Y Yang - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org
Image captioning aims to automatically generate captions for images by learning a cross-
modal generator from vision to language. The large amount of image-text pairs required for …

Video corpus moment retrieval via deformable multigranularity feature fusion and adversarial training

X Zhang, P Zhao, J Ji, X Lu, Y Yin - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
As a new emerging task, video corpus moment retrieval (VCMR) aims to find the video
segments relevant to a given natural language query from a large number of untrimmed …

Deep visual-linguistic fusion network considering cross-modal inconsistency for rumor detection

Y Yang, R Bao, W Guo, DC Zhan, Y Yin… - Science China Information …, 2023 - Springer
With the development of the Internet, users can freely publish posts on various social media
platforms, which offers great convenience for kee** abreast of the world. However, posts …

Robust semi-supervised learning for self-learning open-world classes

W **, X Song, W Guo, Y Yang - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Existing semi-supervised learning (SSL) methods assume that labeled and unlabeled data
share the same class space. However, in real-world applications, unlabeled data always …

DOMFN: A divergence-orientated multi-modal fusion network for resume assessment

Y Yang, J Zhang, F Gao, X Gao, H Zhu - Proceedings of the 30th ACM …, 2022 - dl.acm.org
In talent management, resume assessment aims to analyze the quality of a job seeker's
resume, which can assist recruiters to discover suitable candidates and benefit job seekers …

Jobformer: Skill-aware job recommendation with semantic-enhanced transformer

Z Guan, JQ Yang, Y Yang, H Zhu, W Li… - ACM Transactions on …, 2024 - dl.acm.org
Job recommendation aims to provide potential talents with suitable job descriptions (JDs)
consistent with their career trajectory, which plays an essential role in proactive talent …

Dvsai: Diverse view-shared anchors based incomplete multi-view clustering

S Yu, S Wang, P Zhang, M Wang, Z Wang… - Proceedings of the …, 2024 - ojs.aaai.org
In numerous real-world applications, it is quite common that sample information is partially
available for some views due to machine breakdown or sensor failure, causing the problem …