- Academic Search

Articles

Scholar

2 résultats (0,01 s)

Mon profil Ma bibliothèque

Scholarchemqa: Unveiling the power of language models in chemical research question answering

Rechercher parmi les articles qui s'y rapportent

[Free GPT-4]

[PDF] arxiv.org

Justice or prejudice? quantifying biases in llm-as-a-judge

J Ye, Y Wang, Y Huang, D Chen, Q Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org

LLM-as-a-Judge has been widely utilized as an evaluation method in various benchmarks
and served as supervised rewards in model training. However, despite their excellence in …

Enregistrer Citer Cité 12 fois Autres articles Les 2 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

Agent Laboratory: Using LLM Agents as Research Assistants

S Schmidgall, Y Su, Z Wang, X Sun, J Wu, X Yu… - arxiv preprint arxiv …, 2025 - arxiv.org

Historically, scientific discovery has been a lengthy and costly process, demanding
substantial time and resources from initial conception to final results. To accelerate scientific …

Enregistrer Citer Cité 1 fois Autres articles Les 2 versions Free GPT-4 Version HTML

Créer l'alerte

Citer

Recherche avancée

Enregistré dans Ma bibliothèque

Scholarchemqa: Unveiling the power of language models in chemical research question answering

Justice or prejudice? quantifying biases in llm-as-a-judge

Agent Laboratory: Using LLM Agents as Research Assistants