Skip to content

Commit 4c82bb8

Browse files
authored
Update index.html
1 parent f9a297f commit 4c82bb8

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

index.html

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -463,7 +463,7 @@ <h1 class="title is-1 publication-title">Large Language Model Psychometrics: A S
463463
<h2 class="title is-3 has-text-centered">Abstract</h2>
464464
<div class="abstract content">
465465
<p>
466-
The survey will be released soon. Stay tuned!
466+
The rapid advancement of large language models (LLMs) has outpaced traditional evaluation methodologies. It presents novel challenges, such as measuring human-like psychological constructs, navigating beyond static and task-specific benchmarks, and establishing human-centered evaluation. These challenges intersect with Psychometrics, the science of quantifying the intangible aspects of human psychology, such as personality, values, and intelligence. This survey introduces and synthesizes an emerging interdisciplinary field of LLM Psychometrics, which leverages psychometric instruments, theories, and principles to evaluate, understand, and enhance LLMs. We systematically explore the role of Psychometrics in shaping benchmarking principles, broadening evaluation scopes, refining methodologies, validating results, and advancing LLM capabilities. This paper integrates diverse perspectives to provide a structured framework for researchers across disciplines, enabling a more comprehensive understanding of this nascent field. Ultimately, we aim to provide actionable insights for developing future evaluation paradigms that align with human-level AI and promote the advancement of human-centered AI systems for societal benefit. A curated repository of LLM psychometric resources is available at \href{https://github.com/valuebyte-ai/Awesome-LLM-Psychometrics}{https://github.com/valuebyte-ai/Awesome-LLM-Psychometrics}.
467467
</p>
468468
</div>
469469
</div>

0 commit comments

Comments
 (0)