Mm-llms: Recent advances in multimodal large language models

D Zhang, Y Yu, J Dong, C Li, D Su, C Chu… - arxiv preprint arxiv …, 2024 - arxiv.org
In the past year, MultiModal Large Language Models (MM-LLMs) have undergone
substantial advancements, augmenting off-the-shelf LLMs to support MM inputs or outputs …

Continual named entity recognition without catastrophic forgetting

D Zhang, W Cong, J Dong, Y Yu, X Chen… - arxiv preprint arxiv …, 2023 - arxiv.org
Continual Named Entity Recognition (CNER) is a burgeoning area, which involves updating
an existing model by incorporating new entity types sequentially. Nevertheless, continual …

Gradient-semantic compensation for incremental semantic segmentation

W Cong, Y Cong, J Dong, G Sun… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Incremental semantic segmentation focuses on continually learning the segmentation of
new coming classes without obtaining the training data from previously seen classes …

Learn or Recall? Revisiting Incremental Learning with Pre-trained Language Models

J Zheng, S Qiu, Q Ma - arxiv preprint arxiv:2312.07887, 2023 - arxiv.org
Incremental Learning (IL) has been a long-standing problem in both vision and Natural
Language Processing (NLP) communities. In recent years, as Pre-trained Language Models …

Flexible Weight Tuning and Weight Fusion Strategies for Continual Named Entity Recognition

Y Yu, D Zhang, X Chen, C Chu - Findings of the Association for …, 2024 - aclanthology.org
Abstract Continual Named Entity Recognition (CNER) is dedicated to sequentially learning
new entity types while mitigating catastrophic forgetting of old entity types. Traditional CNER …

Deffusion: Deformable multimodal representation fusion for 3d semantic segmentation

R Xu, C Wang, D Zhang, M Zhang, S Xu… - … on Robotics and …, 2024 - ieeexplore.ieee.org
The complementarity between camera and LiDAR data makes fusion methods a promising
approach to improve 3D semantic segmentation performance. Recent transformer-based …

PSTNet: Enhanced polyp segmentation with multi-scale alignment and frequency domain integration

W Xu, R Xu, C Wang, X Li, S Xu… - IEEE Journal of …, 2024 - ieeexplore.ieee.org
Accurate segmentation of colorectal polyps in colonoscopy images is crucial for effective
diagnosis and management of colorectal cancer (CRC). However, current deep learning …

Local feature matching using deep learning: A survey

S Xu, S Chen, R Xu, C Wang, P Lu, L Guo - Information Fusion, 2024 - Elsevier
Local feature matching enjoys wide-ranging applications in the realm of computer vision,
encompassing domains such as image retrieval, 3D reconstruction, and object recognition …

Generalization Boosted Adapter for Open-Vocabulary Segmentation

W Xu, C Wang, X Feng, R Xu, L Huang… - … on Circuits and …, 2024 - ieeexplore.ieee.org
Vision-language models (VLMs) have demonstrated remarkable open-vocabulary object
recognition capabilities, motivating their adaptation for dense prediction tasks like …

Concept-driven knowledge distillation and pseudo label generation for continual named entity recognition

H Liu, X **n, W Peng, J Song, J Sun - Expert Systems with Applications, 2025 - Elsevier
Continual named entity recognition requires models to be continuously updated to
recognize new entity types while retaining learned knowledge. In this task, the inherent …