diff --git a/README.md b/README.md
index 96f8f59..21e3523 100644
--- a/README.md
+++ b/README.md
@@ -1,6 +1,6 @@
-Xiezhi is a comprehensive evaluation suite for Language Models (LMs). It consists of 249587 multi-choice questions spanning 516 diverse disciplines and four difficulty levels, as shown below. Please check our [paper](https://arxiv.org/abs/2305.08322) for more details, and our **website** will be open later on.
+Xiezhi is a comprehensive evaluation suite for Language Models (LMs). It consists of 249587 multi-choice questions spanning 516 diverse disciplines and four difficulty levels, as shown below. Please check our [paper](https://arxiv.org/abs/2306.05783) for more details, and our **website** will be open later on.
We hope Xiezhi could help developers track the progress and analyze the important strengths/shortcomings of their LMs.