OliveSensorAPI/evaluate
2024-03-03 17:20:16 +08:00
..
data_dir add InterLM2_7B eval 2024-03-03 17:20:16 +08:00
General evaluation.md add InterLM2_7B eval 2024-03-03 17:20:16 +08:00
InternLM2_7B_chat_eval.py add InterLM2_7B eval 2024-03-03 17:20:16 +08:00
metric.py add InterLM2_7B eval 2024-03-03 17:20:16 +08:00
Professional evaluation.md Create Professional evaluation.md 2024-03-01 20:59:01 +08:00
Qwen1_5-0_5B-Chat_eval.py add InterLM2_7B eval 2024-03-03 17:20:16 +08:00
qwen_generation_utils.py add evaluation part 2024-02-28 20:14:46 +08:00
README.md add InterLM2_7B eval 2024-03-03 17:20:16 +08:00

EmoLLM评测

通用指标评测

  • 具体指标、方法见 General evaluation.md
Model ROUGE-1 ROUGE-2 ROUGE-L BLEU-1 BLEU-2 BLEU-3 BLEU-4
Qwen1_5-0_5B-Chat 27.23% 8.55% 17.05% 26.65% 13.11% 7.19% 4.05%
InternLM2_7B_chat 37.86% 15.23% 24.34% 39.71% 22.66% 14.26% 9.21%

专业指标评测

  • 具体指标、方法见 Professional evaluation.md
Metric Value
Comprehensiveness 1.32
Professionalism 2.20
Authenticity 2.10
Safety 1.00