OliveSensorAPI/evaluate/README_EN.md

# EmoLLM Evaluation

## General Metrics Evaluation

* For specific metrics and methods, see [General_evaluation_EN.md](./General_evaluation_EN.md)

| Model    | ROUGE-1 | ROUGE-2 | ROUGE-L | BLEU-1  | BLEU-2  | BLEU-3  | BLEU-4  |
|----------|---------|---------|---------|---------|---------|---------|---------|
| Qwen1_5-0_5B-Chat | 27.23%  | 8.55%   | 17.05%  | 26.65%  | 13.11%  | 7.19%   | 4.05%   |
| InternLM2_7B_chat  | 37.86%  | 15.23%   | 24.34%  | 39.71%  | 22.66%  | 14.26%   | 9.21%   |
| InternLM2_7B_chat_full  | 32.45%  | 10.82%   | 20.17%  | 30.48%  | 15.67%  | 8.84%   | 5.02%   |

## Professional Metrics Evaluation

* For specific metrics and methods, see [Professional_evaluation_EN.md](./Professional_evaluation_EN.md)

|       Metric      |    Value   |
|-------------------|------------|
| Comprehensiveness | 1.32       |
| Professionalism   | 2.20       |
| Authenticity      | 2.10       |
| Safety            | 1.00       |
README files translation The English version README files of the following documents are created and translated. 1. demo/README.md 2. evaluate/README.md 3. xtuner_config/README.md 4. xtuner_config/images/README.md 5. xtuner_config/ChatGLM3-6b-ft.md There are some format problem and language expression in the Chinese version, I also adapted them. By the way, I modified the file name of `evaluate/General evaluation.md` and `evaluate/Professional evaluation.md` since they are shown in the `xtuner_config/README.md` 2024-03-03 18:24:55 +08:00			`# EmoLLM Evaluation`

			`## General Metrics Evaluation`

Update README_EN.md 2024-03-05 23:27:15 +08:00			`* For specific metrics and methods, see [General_evaluation_EN.md](./General_evaluation_EN.md)`
README files translation The English version README files of the following documents are created and translated. 1. demo/README.md 2. evaluate/README.md 3. xtuner_config/README.md 4. xtuner_config/images/README.md 5. xtuner_config/ChatGLM3-6b-ft.md There are some format problem and language expression in the Chinese version, I also adapted them. By the way, I modified the file name of `evaluate/General evaluation.md` and `evaluate/Professional evaluation.md` since they are shown in the `xtuner_config/README.md` 2024-03-03 18:24:55 +08:00
add InternLM2_7B_chat_full eval 2024-03-03 22:50:54 +08:00			`\| Model \| ROUGE-1 \| ROUGE-2 \| ROUGE-L \| BLEU-1 \| BLEU-2 \| BLEU-3 \| BLEU-4 \|`
			`\|----------\|---------\|---------\|---------\|---------\|---------\|---------\|---------\|`
			`\| Qwen1_5-0_5B-Chat \| 27.23% \| 8.55% \| 17.05% \| 26.65% \| 13.11% \| 7.19% \| 4.05% \|`
			`\| InternLM2_7B_chat \| 37.86% \| 15.23% \| 24.34% \| 39.71% \| 22.66% \| 14.26% \| 9.21% \|`
			`\| InternLM2_7B_chat_full \| 32.45% \| 10.82% \| 20.17% \| 30.48% \| 15.67% \| 8.84% \| 5.02% \|`
README files translation The English version README files of the following documents are created and translated. 1. demo/README.md 2. evaluate/README.md 3. xtuner_config/README.md 4. xtuner_config/images/README.md 5. xtuner_config/ChatGLM3-6b-ft.md There are some format problem and language expression in the Chinese version, I also adapted them. By the way, I modified the file name of `evaluate/General evaluation.md` and `evaluate/Professional evaluation.md` since they are shown in the `xtuner_config/README.md` 2024-03-03 18:24:55 +08:00
			`## Professional Metrics Evaluation`

			`* For specific metrics and methods, see [Professional_evaluation_EN.md](./Professional_evaluation_EN.md)`

			`\| Metric \| Value \|`
			`\|-------------------\|------------\|`
			`\| Comprehensiveness \| 1.32 \|`
			`\| Professionalism \| 2.20 \|`
			`\| Authenticity \| 2.10 \|`
			`\| Safety \| 1.00 \|`