From cb656301d3f72d9d5b285fa3323057e61392c329 Mon Sep 17 00:00:00 2001 From: MING_X <119648793+MING-ZCH@users.noreply.github.com> Date: Sun, 3 Mar 2024 23:34:14 +0800 Subject: [PATCH 1/2] Update README.md --- evaluate/README.md | 14 ++++++-------- 1 file changed, 6 insertions(+), 8 deletions(-) diff --git a/evaluate/README.md b/evaluate/README.md index 260f3f0..a58d903 100644 --- a/evaluate/README.md +++ b/evaluate/README.md @@ -2,7 +2,7 @@ ## 通用指标评测 -* 具体指标、方法见 see [General_evaluation.md](./General_evaluation.md) +* 具体评测指标和评测方法见 [General_evaluation.md](./General_evaluation.md) | Model | ROUGE-1 | ROUGE-2 | ROUGE-L | BLEU-1 | BLEU-2 | BLEU-3 | BLEU-4 | |----------|---------|---------|---------|---------|---------|---------|---------| @@ -11,11 +11,9 @@ | InternLM2_7B_chat_full | 32.45% | 10.82% | 20.17% | 30.48% | 15.67% | 8.84% | 5.02% | ## 专业指标评测 -* 具体指标、方法见 [Professional_evaluation.md](./Professional_evaluation.md) +* 具体评测指标和评测方法见 [Professional_evaluation.md](./Professional_evaluation.md) + +| Model | Comprehensiveness | rofessionalism | Authenticity | Safety | +|-------------------|-----------------------|-------------------|-----------------|---------| +| InternLM2_7B_chat_qlora | 1.32 | 2.20 | 2.10 | 1.00 | -| Metric | Value | -|-------------------|------------| -| Comprehensiveness | 1.32 | -| Professionalism | 2.20 | -| Authenticity | 2.10 | -| Safety | 1.00 | From b1a42493509339861455fb3d93b3ea205670d5d2 Mon Sep 17 00:00:00 2001 From: MING_X <119648793+MING-ZCH@users.noreply.github.com> Date: Sun, 3 Mar 2024 23:35:31 +0800 Subject: [PATCH 2/2] Update Professional_evaluation.md --- evaluate/Professional_evaluation.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/evaluate/Professional_evaluation.md b/evaluate/Professional_evaluation.md index 2cc3fd0..9562f0c 100644 --- a/evaluate/Professional_evaluation.md +++ b/evaluate/Professional_evaluation.md @@ -14,7 +14,7 @@ ## 评测结果 -评测模型: [EmoLLM](https://openxlab.org.cn/models/detail/jujimeizuo/EmoLLM_Model)(InternLM2-7B-chat + qlora), 得分: +评测模型: [EmoLLM](https://openxlab.org.cn/models/detail/jujimeizuo/EmoLLM_Model)(InternLM2_7B_chat_qlora), 得分: | Metric | Value | |-------------------|------------| | Comprehensiveness | 1.32 |