Mm-llms: Recent advances in multimodal large language models
In the past year, MultiModal Large Language Models (MM-LLMs) have undergone
substantial advancements, augmenting off-the-shelf LLMs to support MM inputs or outputs …
substantial advancements, augmenting off-the-shelf LLMs to support MM inputs or outputs …
Continual named entity recognition without catastrophic forgetting
Continual Named Entity Recognition (CNER) is a burgeoning area, which involves updating
an existing model by incorporating new entity types sequentially. Nevertheless, continual …
an existing model by incorporating new entity types sequentially. Nevertheless, continual …
Gradient-semantic compensation for incremental semantic segmentation
Incremental semantic segmentation focuses on continually learning the segmentation of
new coming classes without obtaining the training data from previously seen classes …
new coming classes without obtaining the training data from previously seen classes …
Learn or Recall? Revisiting Incremental Learning with Pre-trained Language Models
Incremental Learning (IL) has been a long-standing problem in both vision and Natural
Language Processing (NLP) communities. In recent years, as Pre-trained Language Models …
Language Processing (NLP) communities. In recent years, as Pre-trained Language Models …
Flexible Weight Tuning and Weight Fusion Strategies for Continual Named Entity Recognition
Abstract Continual Named Entity Recognition (CNER) is dedicated to sequentially learning
new entity types while mitigating catastrophic forgetting of old entity types. Traditional CNER …
new entity types while mitigating catastrophic forgetting of old entity types. Traditional CNER …
Deffusion: Deformable multimodal representation fusion for 3d semantic segmentation
The complementarity between camera and LiDAR data makes fusion methods a promising
approach to improve 3D semantic segmentation performance. Recent transformer-based …
approach to improve 3D semantic segmentation performance. Recent transformer-based …
PSTNet: Enhanced polyp segmentation with multi-scale alignment and frequency domain integration
Accurate segmentation of colorectal polyps in colonoscopy images is crucial for effective
diagnosis and management of colorectal cancer (CRC). However, current deep learning …
diagnosis and management of colorectal cancer (CRC). However, current deep learning …
Local feature matching using deep learning: A survey
Local feature matching enjoys wide-ranging applications in the realm of computer vision,
encompassing domains such as image retrieval, 3D reconstruction, and object recognition …
encompassing domains such as image retrieval, 3D reconstruction, and object recognition …
Generalization Boosted Adapter for Open-Vocabulary Segmentation
Vision-language models (VLMs) have demonstrated remarkable open-vocabulary object
recognition capabilities, motivating their adaptation for dense prediction tasks like …
recognition capabilities, motivating their adaptation for dense prediction tasks like …
Concept-driven knowledge distillation and pseudo label generation for continual named entity recognition
Continual named entity recognition requires models to be continuously updated to
recognize new entity types while retaining learned knowledge. In this task, the inherent …
recognize new entity types while retaining learned knowledge. In this task, the inherent …