Google Academic

L Chen, B Li, S Shen, J Yang, C Li… - Advances in …, 2023 - proceedings.neurips.cc

Visual reasoning requires multimodal perception and commonsense cognition of the world.
Recently, multiple vision-language models (VLMs) have been proposed with excellent …

Salvați Citați Citat de 37 ori Articole cu conținut similar Toate cele 5 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Aloft: A lightweight mlp-like architecture with dynamic low-frequency transform for domain generalization

J Guo, N Wang, L Qi, Y Shi - … of the IEEE/CVF conference on …, 2023 - openaccess.thecvf.com

Abstract Domain generalization (DG) aims to learn a model that generalizes well to unseen
target domains utilizing multiple source domains without re-training. Most existing DG works …

Salvați Citați Citat de 40 ori Articole cu conținut similar Toate cele 6 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Robust mixture-of-expert training for convolutional neural networks

Y Zhang, R Cai, T Chen, G Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Abstract Sparsely-gated Mixture of Expert (MoE), an emerging deep model architecture, has
demonstrated a great promise to enable high-accuracy and ultra-efficient model inference …

Salvați Citați Citat de 26 ori Articole cu conținut similar Toate cele 5 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Fusemoe: Mixture-of-experts transformers for fleximodal fusion

X Han, H Nguyen, C Harris, N Ho… - Advances in Neural …, 2025 - proceedings.neurips.cc

As machine learning models in critical fields increasingly grapple with multimodal data, they
face the dual challenges of handling a wide array of modalities, often incomplete due to …

Salvați Citați Citat de 18 ori Articole cu conținut similar Toate cele 4 versiuni Afișare ca HTML

Knowledge distillation-based domain-invariant representation learning for domain generalization

Z Niu, J Yuan, X Ma, Y Xu, J Liu… - IEEE Transactions …, 2023 - ieeexplore.ieee.org

Domain generalization (DG) aims to generalize the knowledge learned from multiple source
domains to unseen target domains. Existing DG techniques can be subsumed under two …

Salvați Citați Citat de 35 ori Articole cu conținut similar Toate cele 4 versiuni

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Dgmamba: Domain generalization via generalized state space model

S Long, Q Zhou, X Li, X Lu, C Ying, Y Luo… - Proceedings of the …, 2024 - dl.acm.org

Domain generalization (DG) aims at solving distribution shift problems in various scenes.
Existing approaches are based on Convolution Neural Networks (CNNs) or Vision …

Salvați Citați Citat de 11 ori Articole cu conținut similar Toate cele 7 versiuni

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Graph mixture of experts: Learning on large-scale graphs with explicit diversity modeling

H Wang, Z Jiang, Y You, Y Han, G Liu… - Advances in …, 2023 - proceedings.neurips.cc

Graph neural networks (GNNs) have found extensive applications in learning from graph
data. However, real-world graphs often possess diverse structures and comprise nodes and …

Salvați Citați Citat de 19 ori Articole cu conținut similar Toate cele 7 versiuni Afișare ca HTML

Ca-moeit: Generalizable face anti-spoofing via dual cross-attention and semi-fixed mixture-of-expert

A Liu - International Journal of Computer Vision, 2024 - Springer

Although the generalization of face anti-spo-ofing (FAS) is increasingly concerned, it is still
in the initial stage to solve it based on Vision Transformer (ViT). In this paper, we present a …

Salvați Citați Citat de 9 ori Articole cu conținut similar Toate cele 3 versiuni

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Moe-ffd: Mixture of experts for generalized and parameter-efficient face forgery detection

C Kong, A Luo, P Bao, Y Yu, H Li, Z Zheng… - arxiv preprint arxiv …, 2024 - arxiv.org

Deepfakes have recently raised significant trust issues and security concerns among the
public. Compared to CNN face forgery detectors, ViT-based methods take advantage of the …

Salvați Citați Citat de 11 ori Articole cu conținut similar Toate cele 3 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

On least square estimation in softmax gating mixture of experts

H Nguyen, N Ho, A Rinaldo - arxiv preprint arxiv:2402.02952, 2024 - arxiv.org

Mixture of experts (MoE) model is a statistical machine learning design that aggregates
multiple expert networks using a softmax gating function in order to form a more intricate and …

Salvați Citați Citat de 12 ori Articole cu conținut similar Toate cele 8 versiuni Afișare ca HTML

Creează alerta

Citați

Căutare avansată

Salvat în Bibliotecă

Sparse mixture-of-experts are domain generalizable learners

Large language models are visual reasoning coordinators

Aloft: A lightweight mlp-like architecture with dynamic low-frequency transform for domain generalization

Robust mixture-of-expert training for convolutional neural networks

Fusemoe: Mixture-of-experts transformers for fleximodal fusion

Knowledge distillation-based domain-invariant representation learning for domain generalization

Dgmamba: Domain generalization via generalized state space model

Graph mixture of experts: Learning on large-scale graphs with explicit diversity modeling

Ca-moeit: Generalizable face anti-spoofing via dual cross-attention and semi-fixed mixture-of-expert

Moe-ffd: Mixture of experts for generalized and parameter-efficient face forgery detection

On least square estimation in softmax gating mixture of experts