| .. | ||
| train_dir | ||
| eval.py | ||
| General evaluation.md | ||
| metric.py | ||
| Professional evaluation.md | ||
| qwen_generation_utils.py | ||
| README.md | ||
EmoLLM评测
通用指标评测
- 具体指标、方法见 General evaluation.md
| Metric | Value |
|---|---|
| ROUGE-1 | 27.23% |
| ROUGE-2 | 8.55% |
| ROUGE-L | 17.05% |
| BLEU-1 | 26.65% |
| BLEU-2 | 13.11% |
| BLEU-3 | 7.19% |
| BLEU-4 | 4.05% |
专业指标评测
- 具体指标、方法见 Professional evaluation.md
| Metric | Value |
|---|---|
| Comprehensiveness | 1.32 |
| Professionalism | 2.20 |
| Authenticity | 2.10 |
| Safety | 1.00 |