.. | ||
train_dir | ||
eval.py | ||
General evaluation.md | ||
metric.py | ||
Professional evaluation.md | ||
qwen_generation_utils.py | ||
README.md |
EmoLLM评测
通用指标评测
- 具体指标、方法见 General evaluation.md
Metric | Value |
---|---|
ROUGE-1 | 27.23% |
ROUGE-2 | 8.55% |
ROUGE-L | 17.05% |
BLEU-1 | 26.65% |
BLEU-2 | 13.11% |
BLEU-3 | 7.19% |
BLEU-4 | 4.05% |
专业指标评测
- 具体指标、方法见 Professional evaluation.md
Metric | Value |
---|---|
Comprehensiveness | 1.32 |
Professionalism | 2.20 |
Authenticity | 2.10 |
Safety | 1.00 |