How far are we to gpt-4v? closing the gap to commercial multimodal models with open-source suites

Z Chen, W Wang, H Tian, S Ye, Z Gao, E Cui… - Science China …, 2024‏ - Springer
In this paper, we introduce InternVL 1.5, an open-source multimodal large language model
(MLLM) to bridge the capability gap between open-source and proprietary commercial …

Deepseek-vl2: Mixture-of-experts vision-language models for advanced multimodal understanding

Z Wu, X Chen, Z Pan, X Liu, W Liu, D Dai… - arxiv preprint arxiv …, 2024‏ - arxiv.org
We present DeepSeek-VL2, an advanced series of large Mixture-of-Experts (MoE) Vision-
Language Models that significantly improves upon its predecessor, DeepSeek-VL, through …

When LLMs meet cunning texts: A fallacy understanding benchmark for large language models

Y Li, Q Zhou, Y Luo, S Ma, Y Li, HT Zheng, X Hu… - arxiv preprint arxiv …, 2024‏ - arxiv.org
Recently, Large Language Models (LLMs) make remarkable evolutions in language
understanding and generation. Following this, various benchmarks for measuring all kinds …

FreStega: A Plug-and-Play Method for Boosting Imperceptibility and Capacity in Generative Linguistic Steganography for Real-World Scenarios

K Pang - arxiv preprint arxiv:2412.19652, 2024‏ - arxiv.org
Linguistic steganography embeds secret information in seemingly innocent texts,
safeguarding privacy in surveillance environments. Generative linguistic steganography …

Benchmarking Chinese Knowledge Rectification in Large Language Models

T Lu, J Fang, Y Yao, X Xu, N Zhang, H Chen - arxiv preprint arxiv …, 2024‏ - arxiv.org
While Large Language Models (LLMs) exhibit remarkable generative capabilities, they are
not without flaws, particularly in the form of hallucinations. This issue is even more …

Educational-Psychological Dialogue Robot Based on Multi-agent Collaboration

S Ni, M Yang - International Conference on Social Robotics, 2024‏ - Springer
Intelligent dialogue systems are increasingly used in modern education and psychological
counseling fields, but most existing systems are limited to a single domain, cannot deal with …

Research on Tibetan Tourism Viewpoints information generation system based on LLM

J Qi, S Yan, W Zhang, Y Zhang, Z Liu… - … and Wireless Optical …, 2024‏ - ieeexplore.ieee.org
Tibet, ensconced within China's territorial expanse, is distinguished by its labyrinthine and
heterogeneous topography, a testament to its profound historical heritage, and the cradle of …

[HTML][HTML] SDD-LawLLM: Advancing Intelligent Legal Systems Through Synthetic Data-Driven Fine-Tuning of Large Language Models

H Ma, Y Lu, Z **ao, J Feng, H Zhang, J Yu - Electronics, 2025‏ - mdpi.com
The extensive use of large language models (LLMs) across various natural language
processing tasks has markedly elevated the intelligence of legal systems. Despite their …

I-SHEEP: Self-Alignment of LLM from Scratch through an Iterative Self-Enhancement Paradigm

Y Liang, G Zhang, X Qu, T Zheng, J Guo, X Du… - arxiv preprint arxiv …, 2024‏ - arxiv.org
Large Language Models (LLMs) have achieved significant advancements, however, the
common learning paradigm treats LLMs as passive information repositories, neglecting their …

Pantip Multi-turn Datasets Generating from Thai Large Social Platform Forum Using Sentence Similarity Techniques

A Sae-Oueng, K Kerdthaisong… - … Joint Symposium on …, 2024‏ - ieeexplore.ieee.org
Fine-tuning Large Language Models (LLMs) for specific domains is crucial. However, lack of
Thai open dialogues presents a major challenge. For the major challenge, this study …