model card update

2025-01-21 10:01:05 +00:00 · 2022-05-29 18:43:37 +02:00 · 2022-05-29 18:43:37 +02:00 · 0680e0197a
parent e77a4cd147
commit 0680e0197a
1 changed files with 5 additions and 0 deletions
--- a/README.md
+++ b/README.md
@ -101,6 +101,11 @@ Do not deploy without appropriate measures.

 ## Evaluation results

+### Language Model Evaluation Harness
+
+The following table compares GPT-J 6B to GPT-4chan on a subset of the [Language Model Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness).
+Differences exceeding standard errors are marked in the "Significant" column with a minus sign (-) indicating an advantage for GPT-J 6B and a plus sign (+) indicating an advantage for GPT-4chan.
+
 <figure>

 | Task                                                      | Metric          |     GPT-J-6B |       stderr |    GPT-4chan |       stderr | Significant   |