Mm-soc: Benchmarking multimodal large language models in social media platforms

Y **, M Choi, G Verma, J Wang, S Kumar - arxiv preprint arxiv …, 2024 - arxiv.org
Social media platforms are hubs for multimodal information exchange, encompassing text,
images, and videos, making it challenging for machines to comprehend the information or …

Agentreview: Exploring peer review dynamics with llm agents

Y **, Q Zhao, Y Wang, H Chen, K Zhu, Y **ao… - arxiv preprint arxiv …, 2024 - arxiv.org
Peer review is fundamental to the integrity and advancement of scientific publication.
Traditional methods of peer review analyses often rely on exploration and statistics of …

PrivacyMind: large language models can be contextual privacy protection learners

Y **ao, Y **, Y Bai, Y Wu, X Yang, X Luo, W Yu… - arxiv preprint arxiv …, 2023 - arxiv.org
The proliferation of Large Language Models (LLMs) has driven considerable interest in fine-
tuning them with domain-specific data to create specialized language models. Nevertheless …

Deliberate reasoning for llms as structure-aware planning with accurate world model

S **ong, A Payani, Y Yang, F Fekri - arxiv preprint arxiv:2410.03136, 2024 - arxiv.org
Enhancing the reasoning capabilities of large language models (LLMs) remains a key
challenge, especially for tasks that require complex, multi-step decision-making. Humans …

Shortcut Learning in In-Context Learning: A Survey

R Song, Y Li, L Shi, F Giunchiglia, H Xu - arxiv preprint arxiv:2411.02018, 2024 - arxiv.org
Shortcut learning refers to the phenomenon where models employ simple, non-robust
decision rules in practical tasks, which hinders their generalization and robustness. With the …

CMQCIC-Bench: A Chinese Benchmark for Evaluating Large Language Models in Medical Quality Control Indicator Calculation

G Yu, Y Li, Z Jiang, Y **, L Dai, Y Lin, R Hou… - arxiv preprint arxiv …, 2025 - arxiv.org
Medical quality control indicators are essential to assess the qualifications of healthcare
institutions for medical services. With the impressive performance of large language models …

TableMaster: A Recipe to Advance Table Understanding with Language Models

L Cao - arxiv preprint arxiv:2501.19378, 2025 - arxiv.org
Tables serve as a fundamental format for representing structured relational data. While
current language models (LMs) excel at many text-based tasks, they still face challenges in …

[TRÍCH DẪN][C] Unraveling and Overcoming Challenges in Machine Learning: Generalizability, Adaptability, and Multifacetedness

H Cho