- Academic Search

From google gemini to openai q*(q-star): A survey of resha** the generative artificial intelligence (ai) research landscape

TR McIntosh, T Susnjak, T Liu, P Watters… - ar** agents for dialogue
management (DM) that are non-myopic, conduct rich conversations, and maximize overall …

Gem Citer Citeret af 4 Relaterede artikler Alle 7 versioner Vis som HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Statistical Advantages of Perturbing Cosine Router in Sparse Mixture of Experts

H Nguyen, P Akbarian, T Pham, T Nguyen… - arxiv preprint arxiv …, 2024 - arxiv.org

The cosine router in sparse Mixture of Experts (MoE) has recently emerged as an attractive
alternative to the conventional linear router. Indeed, the cosine router demonstrates …

Gem Citer Citeret af 3 Relaterede artikler Alle 4 versioner Vis som HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Understanding expert structures on minimax parameter estimation in contaminated mixture of experts

F Yan, H Nguyen, D Le, P Akbarian, N Ho - arxiv preprint arxiv …, 2024 - arxiv.org

We conduct the convergence analysis of parameter estimation in the contaminated mixture
of experts. This model is motivated from the prompt learning problem where ones utilize …

Gem Citer Citeret af 3 Relaterede artikler Alle 5 versioner Vis som HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A general theory for softmax gating multinomial logistic mixture of experts

H Nguyen, P Akbarian, TT Nguyen, N Ho - arxiv preprint arxiv:2310.14188, 2023 - arxiv.org

Mixture-of-experts (MoE) model incorporates the power of multiple submodels via gating
functions to achieve greater performance in numerous regression and classification …

Gem Citer Citeret af 7 Relaterede artikler Alle 10 versioner Vis som HTML

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org

DGPO: discovering multiple strategies with diversity-guided policy optimization

W Chen, S Huang, Y Chiang, T Pearce… - Proceedings of the …, 2024 - ojs.aaai.org

Most reinforcement learning algorithms seek a single optimal strategy that solves a given
task. However, it can often be valuable to learn a diverse set of solutions, for instance, to …

Gem Citer Citeret af 6 Relaterede artikler Alle 8 versioner Vis som HTML

Opret underretning

Citer

Avanceret søgning

Gemt i Min samling

A mixture-of-expert approach to rl-based dialogue management

From google gemini to openai q*(q-star): A survey of resha** the generative artificial intelligence (ai) research landscape

Statistical Advantages of Perturbing Cosine Router in Sparse Mixture of Experts

Understanding expert structures on minimax parameter estimation in contaminated mixture of experts

A general theory for softmax gating multinomial logistic mixture of experts

DGPO: discovering multiple strategies with diversity-guided policy optimization