A review of multimodal explainable artificial intelligence: Past, present and future
Artificial intelligence (AI) has rapidly developed through advancements in computational
power and the growth of massive datasets. However, this progress has also heightened …
power and the growth of massive datasets. However, this progress has also heightened …
Survey of different large language model architectures: Trends, benchmarks, and challenges
Large Language Models (LLMs) represent a class of deep learning models adept at
understanding natural language and generating coherent responses to various prompts or …
understanding natural language and generating coherent responses to various prompts or …
[HTML][HTML] Automating Systematic Literature Reviews with Retrieval-Augmented Generation: A Comprehensive Overview
This study examines Retrieval-Augmented Generation (RAG) in large language models
(LLMs) and their significant application for undertaking systematic literature reviews (SLRs) …
(LLMs) and their significant application for undertaking systematic literature reviews (SLRs) …
Mathscape: Evaluating mllms in multimodal math scenarios through a hierarchical benchmark
With the development of Multimodal Large Language Models (MLLMs), the evaluation of
multimodal models in the context of mathematical problems has become a valuable …
multimodal models in the context of mathematical problems has become a valuable …
The Synergy between Data and Multi-Modal Large Language Models: A Survey from Co-Development Perspective
The rapid development of large language models (LLMs) has been witnessed in recent
years. Based on the powerful LLMs, multi-modal LLMs (MLLMs) extend the modality from …
years. Based on the powerful LLMs, multi-modal LLMs (MLLMs) extend the modality from …
Mammoth-vl: Eliciting multimodal reasoning with instruction tuning at scale
Open-source multimodal large language models (MLLMs) have shown significant potential
in a broad range of multimodal tasks. However, their reasoning capabilities remain …
in a broad range of multimodal tasks. However, their reasoning capabilities remain …
Data-juicer sandbox: A comprehensive suite for multimodal data-model co-development
The emergence of large-scale multi-modal generative models has drastically advanced
artificial intelligence, introducing unprecedented levels of performance and functionality …
artificial intelligence, introducing unprecedented levels of performance and functionality …
Synthvlm: High-efficiency and high-quality synthetic data for vision language models
Recently, with the rise of web images, managing and understanding large-scale image
datasets has become increasingly important. Vision Large Language Models (VLLMs) have …
datasets has become increasingly important. Vision Large Language Models (VLLMs) have …
Keyvideollm: Towards large-scale video keyframe selection
Recently, with the rise of web videos, managing and understanding large-scale video
datasets has become increasingly important. Video Large Language Models (VideoLLMs) …
datasets has become increasingly important. Video Large Language Models (VideoLLMs) …
Synth-empathy: Towards high-quality synthetic empathy data
In recent years, with the rapid advancements in large language models (LLMs), achieving
excellent empathetic response capabilities has become a crucial prerequisite …
excellent empathetic response capabilities has become a crucial prerequisite …