Math-llava: Bootstrap** mathematical reasoning for multimodal large language models
Large language models (LLMs) have demonstrated impressive reasoning capabilities,
particularly in textual mathematical problem-solving. However, existing open-source image …
particularly in textual mathematical problem-solving. However, existing open-source image …
ChartAdapter: Large Vision-Language Model for Chart Summarization
Chart summarization, which focuses on extracting key information from charts and
interpreting it in natural language, is crucial for generating and delivering insights through …
interpreting it in natural language, is crucial for generating and delivering insights through …
MPT: Multi-grained Prompt Tuning for Text-Video Retrieval
Recently, significant advancements have been made in supporting text-video retrieval by
transferring large-scale image-text pre-training models through model adaptation, ie, full fine …
transferring large-scale image-text pre-training models through model adaptation, ie, full fine …
MagicVFX: Visual Effects Synthesis in Just Minutes
Visual effects synthesis is crucial in the film and television industry, which aims at enhancing
raw footage with virtual elements for greater expressiveness. As the demand for detailed …
raw footage with virtual elements for greater expressiveness. As the demand for detailed …
Contextual Interaction via Primitive-based Adversarial Training For Compositional Zero-shot Learning
Compositional Zero-shot Learning (CZSL) aims to identify novel compositions via known
attribute-object pairs. The primary challenge in CZSL tasks lies in the significant …
attribute-object pairs. The primary challenge in CZSL tasks lies in the significant …
CognArtive: Large Language Models for Automating Art Analysis and Decoding Aesthetic Elements
A Khadangi, A Sartipi, I Tchappi, G Fridgen - arxiv preprint arxiv …, 2025 - arxiv.org
Art, as a universal language, can be interpreted in diverse ways, with artworks embodying
profound meanings and nuances. The advent of Large Language Models (LLMs) and the …
profound meanings and nuances. The advent of Large Language Models (LLMs) and the …