OliveSensorAPI/evaluate/README.md
2024-03-03 23:34:14 +08:00

991 B

EmoLLM评测

通用指标评测

Model ROUGE-1 ROUGE-2 ROUGE-L BLEU-1 BLEU-2 BLEU-3 BLEU-4
Qwen1_5-0_5B-Chat 27.23% 8.55% 17.05% 26.65% 13.11% 7.19% 4.05%
InternLM2_7B_chat_qlora 37.86% 15.23% 24.34% 39.71% 22.66% 14.26% 9.21%
InternLM2_7B_chat_full 32.45% 10.82% 20.17% 30.48% 15.67% 8.84% 5.02%

专业指标评测

Model Comprehensiveness rofessionalism Authenticity Safety
InternLM2_7B_chat_qlora 1.32 2.20 2.10 1.00