Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
gmftbyGMFTBY authored Feb 22, 2024
1 parent e11c331 commit c98a167
Showing 1 changed file with 4 additions and 4 deletions.
8 changes: 4 additions & 4 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -54,7 +54,10 @@ mkdir data
cd data
git clone https://huggingface.co/datasets/opencompass/CriticBench
```
which get into the `data` folder and clone the CriticBench dataset
which get into the `data` folder and clone the CriticBench dataset.
Note that the human-annotated Likert scores, preference labels, and critiques in `test` set are excluded.
You can submit your inference results on the `test` set (via run codes under `inference` folder) to this [email]([email protected]). We will run your predictions and update the results in our leaderboard. Please also provide the scale of your tested model.
The structure of your submission should be similar to that in `example_data`.

#### 1.2 Prepare Code and Env

Expand Down Expand Up @@ -197,9 +200,6 @@ The Objective evaluation results of some representation LLMs are shown:

Refer to our [Project Page](https://open-compass.github.io/CriticBench/) for the complete evaluation results on <b>CriticBench</b>.

## Submit Your Results

You can submit your inference results (via run codes under `inference` folder) to this [email]([email protected]). We will run your predictions and update the results in our leaderboard. Please also provide the scale of your tested model. A sample structure of your submission should be similar to that in `example_data`.

## Acknowledgements

Expand Down

0 comments on commit c98a167

Please sign in to comment.