A survey on evaluation of large language models

Y Chang, X Wang, J Wang, Y Wu, L Yang… - ACM transactions on …, 2024 - dl.acm.org
Large language models (LLMs) are gaining increasing popularity in both academia and
industry, owing to their unprecedented performance in various applications. As LLMs …

Fostering and measuring skills: Improving cognitive and non-cognitive skills to promote lifetime success

This paper reviews the recent literature on measuring and boosting cognitive and
noncognitive skills. The literature establishes that achievement tests do not adequately …

[PDF][PDF] Can large language models transform computational social science?

C Ziems, W Held, O Shaikh, J Chen, Z Zhang… - Computational …, 2024 - direct.mit.edu
Large language models (LLMs) are capable of successfully performing many language
processing tasks zero-shot (without training data). If zero-shot LLMs can also reliably classify …

Awe, the diminished self, and collective engagement: Universals and cultural variations in the small self.

Y Bai, LA Maruskin, S Chen, AM Gordon… - Journal of personality …, 2017 - psycnet.apa.org
Awe has been theorized as a collective emotion, one that enables individuals to integrate
into social collectives. In kee** with this theorizing, we propose that awe diminishes the …

Neither Eastern nor Western: Patterns of independence and interdependence in Mediterranean societies.

AK Uskul, A Kirchner-Häusler, VL Vignoles… - Journal of Personality …, 2023 - psycnet.apa.org
Social science research has highlighted “honor” as a central value driving social behavior in
Mediterranean societies, which requires individuals to develop and protect a sense of their …

Large-scale psychological differences within China explained by rice versus wheat agriculture

T Talhelm, X Zhang, S Oishi, C Shimin, D Duan, X Lan… - Science, 2014 - science.org
Cross-cultural psychologists have mostly contrasted East Asia with the West. However, this
study shows that there are major psychological differences within China. We propose that a …

Historically rice-farming societies have tighter social norms in China and worldwide

T Talhelm, AS English - Proceedings of the National Academy of Sciences, 2020 - pnas.org
Data recently published in PNAS mapped out regional differences in the tightness of social
norms across China [RYJ Chua, KG Huang, M. **, Proc. Natl. Acad. Sci. USA 116, 6720 …

Fostering and measuring skills: Interventions that improve character and cognition

JJ Heckman, T Kautz - 2013 - nber.org
This paper reviews the recent literature on measuring and boosting cognitive and
noncognitive skills. The literature establishes that achievement tests do not adequately …

Promise and paradox: Measuring students' non-cognitive skills and the impact of schooling

MR West, MA Kraft, AS Finn, RE Martin… - … and policy analysis, 2016 - journals.sagepub.com
We used self-report surveys to gather information on a broad set of non-cognitive skills from
1,368 eighth graders. At the student level, scales measuring conscientiousness, self-control …

The weirdest people in the world?

J Henrich, SJ Heine, A Norenzayan - Behavioral and brain sciences, 2010 - cambridge.org
Behavioral scientists routinely publish broad claims about human psychology and behavior
in the world's top journals based on samples drawn entirely from Western, Educated …