From 0680e0197abbdd79dd133e905a2c78faee87c953 Mon Sep 17 00:00:00 2001 From: Yannic Kilcher Date: Sun, 29 May 2022 18:43:37 +0200 Subject: [PATCH] model card update --- README.md | 5 +++++ 1 file changed, 5 insertions(+) diff --git a/README.md b/README.md index 14ce4e8..8943f2a 100644 --- a/README.md +++ b/README.md @@ -101,6 +101,11 @@ Do not deploy without appropriate measures. ## Evaluation results +### Language Model Evaluation Harness + +The following table compares GPT-J 6B to GPT-4chan on a subset of the [Language Model Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness). +Differences exceeding standard errors are marked in the "Significant" column with a minus sign (-) indicating an advantage for GPT-J 6B and a plus sign (+) indicating an advantage for GPT-4chan. +
| Task | Metric | GPT-J-6B | stderr | GPT-4chan | stderr | Significant |