Are we there yet? revealing the risks of utilizing large language models in scholarly peer review
Scholarly peer review is a cornerstone of scientific advancement, but the system is under
strain due to increasing manuscript submissions and the labor-intensive nature of the …
strain due to increasing manuscript submissions and the labor-intensive nature of the …
Multi-modal and multi-agent systems meet rationality: A survey
Rationality is characterized by logical thinking and decision-making that align with evidence
and logical rules. This quality is essential for effective problem-solving, as it ensures that …
and logical rules. This quality is essential for effective problem-solving, as it ensures that …
Social Science Meets LLMs: How Reliable Are Large Language Models in Social Simulations?
Large Language Models (LLMs) are increasingly employed for simulations, enabling
applications in role-playing agents and Computational Social Science (CSS). However, the …
applications in role-playing agents and Computational Social Science (CSS). However, the …
Proteingpt: Multimodal llm for protein property prediction and structure understanding
Understanding biological processes, drug development, and biotechnological
advancements requires detailed analysis of protein structures and sequences, a task in …
advancements requires detailed analysis of protein structures and sequences, a task in …
Piecing It All Together: Verifying Multi-Hop Multimodal Claims
Existing claim verification datasets often do not require systems to perform complex
reasoning or effectively interpret multimodal evidence. To address this, we introduce a new …
reasoning or effectively interpret multimodal evidence. To address this, we introduce a new …
From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents
Traditional sociological research often relies on human participation, which, though
effective, is expensive, challenging to scale, and with ethical concerns. Recent …
effective, is expensive, challenging to scale, and with ethical concerns. Recent …
Scito2M: A 2 Million, 30-Year Cross-disciplinary Dataset for Temporal Scientometric Analysis
Understanding the creation, evolution, and dissemination of scientific knowledge is crucial
for bridging diverse subject areas and addressing complex global challenges such as …
for bridging diverse subject areas and addressing complex global challenges such as …
Generative Pre-trained Ranking Model with Over-parameterization at Web-Scale
Learning to rank (LTR) is widely employed in web searches to prioritize pertinent webpages
from retrieved content based on input queries. However, traditional LTR models encounter …
from retrieved content based on input queries. However, traditional LTR models encounter …
SEAGraph: Unveiling the Whole Story of Paper Review Comments
Peer review, as a cornerstone of scientific research, ensures the integrity and quality of
scholarly work by providing authors with objective feedback for refinement. However, in the …
scholarly work by providing authors with objective feedback for refinement. However, in the …
What Limits LLM-based Human Simulation: LLMs or Our Design?
We argue that advancing LLM-based human simulation requires addressing both LLM's
inherent limitations and simulation framework design challenges. Recent studies have …
inherent limitations and simulation framework design challenges. Recent studies have …