bakaEC/OliveSensorAPI

ZeyuBa b977dc1644 add InternLM2_7B_chat_full eval

2024-03-03 22:50:54 +08:00

950 B

Raw Blame History

EmoLLM Evaluation

General Metrics Evaluation

For specific metrics and methods, see General_evaluation.md

Model	ROUGE-1	ROUGE-2	ROUGE-L	BLEU-1	BLEU-2	BLEU-3	BLEU-4
Qwen1_5-0_5B-Chat	27.23%	8.55%	17.05%	26.65%	13.11%	7.19%	4.05%
InternLM2_7B_chat	37.86%	15.23%	24.34%	39.71%	22.66%	14.26%	9.21%
InternLM2_7B_chat_full	32.45%	10.82%	20.17%	30.48%	15.67%	8.84%	5.02%

Professional Metrics Evaluation

For specific metrics and methods, see Professional_evaluation_EN.md

Metric	Value
Comprehensiveness	1.32
Professionalism	2.20
Authenticity	2.10
Safety	1.00