Merge pull request #61 from MING-ZCH/main
[DOC] Update EmoLLM V2.0’s evaluation details and fix some bugs in docs
This commit is contained in:
commit
77e3531169
@ -64,8 +64,9 @@
|
|||||||
- 评估和诊断工具:为了有效促进心理健康,需要有科学的工具来评估个体的心理状态,以及诊断可能存在的心理问题。
|
- 评估和诊断工具:为了有效促进心理健康,需要有科学的工具来评估个体的心理状态,以及诊断可能存在的心理问题。
|
||||||
|
|
||||||
### 最近更新
|
### 最近更新
|
||||||
|
- 【2024.3.11】 **EmoLLM V2.0 相比 EmoLLM V1.0 全面提升,已超越 Role-playing ChatGPT 在心理咨询任务上的能力!**
|
||||||
- 【2024.3.9】 新增并发功能加速 QA 对生成
|
- 【2024.3.9】 新增并发功能加速 QA 对生成
|
||||||
- 【2024.3.3】 [基于InternLM2-7B-chat全量微调版本开源](https://openxlab.org.cn/models/detail/ajupyter/EmoLLM_internlm2_7b_full),需要两块A100*80G,更新专业评估,详见[evaluate](./evaluate/),更新基于PaddleOCR的PDF转txt工具脚本,详见[scripts](./scripts/)
|
- 【2024.3.3】 [基于InternLM2-7B-chat全量微调版本EmoLLM V2.0开源](https://openxlab.org.cn/models/detail/ajupyter/EmoLLM_internlm2_7b_full),需要两块A100*80G,更新专业评估,详见[evaluate](./evaluate/),更新基于PaddleOCR的PDF转txt工具脚本,详见[scripts](./scripts/)
|
||||||
- 【2024.2.29】更新客观评估计算,详见[evaluate](./evaluate/),更新一系列数据集,详见[datasets](./datasets/)。
|
- 【2024.2.29】更新客观评估计算,详见[evaluate](./evaluate/),更新一系列数据集,详见[datasets](./datasets/)。
|
||||||
- 【2024.2.27】更新英文readme和一系列数据集(舔狗和单轮对话)
|
- 【2024.2.27】更新英文readme和一系列数据集(舔狗和单轮对话)
|
||||||
- 【2024.2.23】推出基于InternLM2_7B_chat_qlora的 `温柔御姐心理医生艾薇`,[点击获取模型权重](https://openxlab.org.cn/models/detail/ajupyter/EmoLLM_aiwei),[配置文件](xtuner_config/aiwei-internlm2_chat_7b_qlora.py),[在线体验链接](https://openxlab.org.cn/apps/detail/ajupyter/EmoLLM-aiwei)
|
- 【2024.2.23】推出基于InternLM2_7B_chat_qlora的 `温柔御姐心理医生艾薇`,[点击获取模型权重](https://openxlab.org.cn/models/detail/ajupyter/EmoLLM_aiwei),[配置文件](xtuner_config/aiwei-internlm2_chat_7b_qlora.py),[在线体验链接](https://openxlab.org.cn/apps/detail/ajupyter/EmoLLM-aiwei)
|
||||||
@ -89,7 +90,7 @@
|
|||||||
|
|
||||||
- 【2024.2.3】 [项目宣传视频](https://www.bilibili.com/video/BV1N7421N76X/)完成 😊
|
- 【2024.2.3】 [项目宣传视频](https://www.bilibili.com/video/BV1N7421N76X/)完成 😊
|
||||||
- 【2024.1.27】 完善数据构建文档、微调指南、部署指南、Readme等相关文档 👏
|
- 【2024.1.27】 完善数据构建文档、微调指南、部署指南、Readme等相关文档 👏
|
||||||
- 【2024.1.25】 完成EmoLLM第一版并部署上线 https://openxlab.org.cn/apps/detail/jujimeizuo/EmoLLM 😀
|
- 【2024.1.25】 EmoLLM V1.0 已部署上线 https://openxlab.org.cn/apps/detail/jujimeizuo/EmoLLM 😀
|
||||||
|
|
||||||
</details>
|
</details>
|
||||||
|
|
||||||
|
33
README_EN.md
33
README_EN.md
@ -1,4 +1,4 @@
|
|||||||
# EmoLLM - Large Languge Model for Mental Health
|
# EmoLLM - Large Language Model for Mental Health
|
||||||
|
|
||||||
<!-- PROJECT SHIELDS -->
|
<!-- PROJECT SHIELDS -->
|
||||||
[![Contributors][contributors-shield]][contributors-url]
|
[![Contributors][contributors-shield]][contributors-url]
|
||||||
@ -35,15 +35,15 @@
|
|||||||
<!-- 本篇README.md面向开发者 -->
|
<!-- 本篇README.md面向开发者 -->
|
||||||
|
|
||||||
|
|
||||||
**EmoLLM** is a series of large language models designed to understand, support and help customers in mental health counseling. It is fine-tuned from the LLM instructions. We really appreciate it if you can give it a star~⭐⭐. The open-sourced configuration is as follows:
|
**EmoLLM** is a series of large language models designed to understand, support and help customers in mental health counseling. It is fine-tuned from the LLM instructions. We really appreciate it if you could give it a star~⭐⭐. The open-sourced configuration is as follows:
|
||||||
|
|
||||||
| model | type |
|
| model | type |
|
||||||
| :-------------------: | :------: |
|
| :-------------------: | :------: |
|
||||||
| InternLM2_7B_chat | qlora |
|
| InternLM2_7B_chat | qlora |
|
||||||
| InternLM2_7B_chat | full finetuning |
|
| InternLM2_7B_chat | full fine-tuning |
|
||||||
| InternLM2_1_8B_chat | full finetuning |
|
| InternLM2_1_8B_chat | full fine-tuning |
|
||||||
| Qwen_7b_chat | qlora |
|
| Qwen_7b_chat | qlora |
|
||||||
| Qwen1_5-0_5B-Chat | full finetuning |
|
| Qwen1_5-0_5B-Chat | full fine-tuning |
|
||||||
| Baichuan2_13B_chat | qlora |
|
| Baichuan2_13B_chat | qlora |
|
||||||
| ChatGLM3_6B | lora |
|
| ChatGLM3_6B | lora |
|
||||||
| DeepSeek MoE_16B_chat | qlora |
|
| DeepSeek MoE_16B_chat | qlora |
|
||||||
@ -52,7 +52,7 @@
|
|||||||
Everyone is welcome to contribute to this project ~
|
Everyone is welcome to contribute to this project ~
|
||||||
---
|
---
|
||||||
|
|
||||||
The Model is aimed at fully understanding and promoting the mental health of individuals, groups, and society. This model typically includes the following key components:
|
The Model aims to fully understand and promote the mental health of individuals, groups, and society. This model typically includes the following key components:
|
||||||
|
|
||||||
- Cognitive factors: Involving an individual's thought patterns, belief systems, cognitive biases, and problem-solving abilities. Cognitive factors significantly impact mental health as they affect how individuals interpret and respond to life events.
|
- Cognitive factors: Involving an individual's thought patterns, belief systems, cognitive biases, and problem-solving abilities. Cognitive factors significantly impact mental health as they affect how individuals interpret and respond to life events.
|
||||||
- Emotional factors: Including emotion regulation, emotional expression, and emotional experiences. Emotional health is a crucial part of mental health, involving how individuals manage and express their emotions and how they recover from negative emotions.
|
- Emotional factors: Including emotion regulation, emotional expression, and emotional experiences. Emotional health is a crucial part of mental health, involving how individuals manage and express their emotions and how they recover from negative emotions.
|
||||||
@ -63,8 +63,9 @@ The Model is aimed at fully understanding and promoting the mental health of ind
|
|||||||
- Prevention and intervention measures: The Mental Health Grand Model also includes strategies for preventing psychological issues and promoting mental health, such as psychological education, counseling, therapy, and social support systems.
|
- Prevention and intervention measures: The Mental Health Grand Model also includes strategies for preventing psychological issues and promoting mental health, such as psychological education, counseling, therapy, and social support systems.
|
||||||
- Assessment and diagnostic tools: Effective promotion of mental health requires scientific tools to assess individuals' psychological states and diagnose potential psychological issues.
|
- Assessment and diagnostic tools: Effective promotion of mental health requires scientific tools to assess individuals' psychological states and diagnose potential psychological issues.
|
||||||
### Recent Updates
|
### Recent Updates
|
||||||
|
- 【2024.3.11】 **EmoLLM V2.0 is greatly improved in all scores compared to EmoLLM V1.0. Surpasses the performance of Role-playing ChatGPT on counseling tasks!**
|
||||||
- 【2024.3.9】 New concurrency feature speeds up QA pair generation
|
- 【2024.3.9】 New concurrency feature speeds up QA pair generation
|
||||||
- 【2024.3.3】 [Based on InternLM2-7B-chat full amount of fine-tuned version of open source](https://openxlab.org.cn/models/detail/ajupyter/EmoLLM_internlm2_7b_full), need two A100*80G, update professional evaluation, see [evaluate](./evaluate/), update PaddleOCR-based PDF to txt tool scripts, see [scripts](./scripts/).
|
- 【2024.3.3】 [Based on InternLM2-7B-chat full fine-tuned version EmoLLM V2.0 open sourced](https://openxlab.org.cn/models/detail/ajupyter/EmoLLM_internlm2_7b_full), need two A100*80G, update professional evaluation, see [evaluate](./evaluate/), update PaddleOCR-based PDF to txt tool scripts, see [scripts](./scripts/).
|
||||||
- 【2024.2.29】 Updated objective assessment calculations, see [evaluate](./evaluate/) for details. A series of datasets have also been updated, see [datasets](./datasets/) for details.
|
- 【2024.2.29】 Updated objective assessment calculations, see [evaluate](./evaluate/) for details. A series of datasets have also been updated, see [datasets](./datasets/) for details.
|
||||||
- 【2024.2.27】 Updated English README and a series of datasets (licking dogs and one-round dialogue)
|
- 【2024.2.27】 Updated English README and a series of datasets (licking dogs and one-round dialogue)
|
||||||
- 【2024.2.23】The "Gentle Lady Psychologist Ai Wei" based on InternLM2_7B_chat_qlora was launched. [Click here to obtain the model weights](https://openxlab.org.cn/models/detail/ajupyter/EmoLLM_aiwei), [configuration file](xtuner_config/aiwei-internlm2_chat_7b_qlora.py), [online experience link](https://openxlab.org.cn/apps/detail/ajupyter/EmoLLM-aiwei)
|
- 【2024.2.23】The "Gentle Lady Psychologist Ai Wei" based on InternLM2_7B_chat_qlora was launched. [Click here to obtain the model weights](https://openxlab.org.cn/models/detail/ajupyter/EmoLLM_aiwei), [configuration file](xtuner_config/aiwei-internlm2_chat_7b_qlora.py), [online experience link](https://openxlab.org.cn/apps/detail/ajupyter/EmoLLM-aiwei)
|
||||||
@ -91,7 +92,7 @@ The Model is aimed at fully understanding and promoting the mental health of ind
|
|||||||
|
|
||||||
- 【2024.2.3】 [Project Vedio](https://www.bilibili.com/video/BV1N7421N76X/) at bilibili 😊
|
- 【2024.2.3】 [Project Vedio](https://www.bilibili.com/video/BV1N7421N76X/) at bilibili 😊
|
||||||
- 【2024.1.27】 Complete data construction documentation, fine-tuning guide, deployment guide, Readme, and other related documents 👏
|
- 【2024.1.27】 Complete data construction documentation, fine-tuning guide, deployment guide, Readme, and other related documents 👏
|
||||||
- 【2024.1.25】 Complete the first version of EmoLLM and deploy it online https://openxlab.org.cn/apps/detail/jujimeizuo/EmoLLM 😀
|
- 【2024.1.25】 EmoLLM V1.0 has deployed online https://openxlab.org.cn/apps/detail/jujimeizuo/EmoLLM 😀
|
||||||
|
|
||||||
</details>
|
</details>
|
||||||
|
|
||||||
@ -104,7 +105,7 @@ The Model is aimed at fully understanding and promoting the mental health of ind
|
|||||||
|
|
||||||
## Contents
|
## Contents
|
||||||
|
|
||||||
- [EmoLLM - Large Languge Model for Mental Health](#emollm---large-languge-model-for-mental-health)
|
- [EmoLLM - Large Language Model for Mental Health](#emollm---large-language-model-for-mental-health)
|
||||||
- [Everyone is welcome to contribute to this project ~](#everyone-is-welcome-to-contribute-to-this-project-)
|
- [Everyone is welcome to contribute to this project ~](#everyone-is-welcome-to-contribute-to-this-project-)
|
||||||
- [Recent Updates](#recent-updates)
|
- [Recent Updates](#recent-updates)
|
||||||
- [Contents](#contents)
|
- [Contents](#contents)
|
||||||
@ -147,12 +148,12 @@ git clone https://github.com/SmartFlowAI/EmoLLM.git
|
|||||||
### File Directory Explanation
|
### File Directory Explanation
|
||||||
|
|
||||||
```
|
```
|
||||||
├─assets:Image Resources
|
├─assets: Image Resources
|
||||||
├─datasets:Dataset
|
├─datasets: Dataset
|
||||||
├─demo:demo scripts
|
├─demo: demo scripts
|
||||||
├─generate_data:Data Generation Guide
|
├─generate_data: Data Generation Guide
|
||||||
│ └─xinghuo
|
│ └─xinghuo
|
||||||
├─scripts:Some Available Tools
|
├─scripts: Some Available Tools
|
||||||
└─xtuner_config:Fine-tuning Guide
|
└─xtuner_config:Fine-tuning Guide
|
||||||
└─images
|
└─images
|
||||||
```
|
```
|
||||||
@ -193,7 +194,7 @@ Contributions make the open-source community an excellent place for learning, in
|
|||||||
|
|
||||||
### Version control
|
### Version control
|
||||||
|
|
||||||
This project uses Git for version control. You can see the current available versions in the repository.
|
This project uses Git for version control. You can see the currently available versions in the repository.
|
||||||
|
|
||||||
</details>
|
</details>
|
||||||
|
|
||||||
@ -209,7 +210,7 @@ This project uses Git for version control. You can see the current available ver
|
|||||||
|
|
||||||
[ZhouXinAo](https://github.com/zxazys)@Master's student at Nankai University
|
[ZhouXinAo](https://github.com/zxazys)@Master's student at Nankai University
|
||||||
|
|
||||||
[MING_X](https://github.com/MING-ZCH) @Undergraduate at Huazhong University of Science and Technology
|
[MING_X](https://github.com/MING-ZCH) @Undergraduate student at Huazhong University of Science and Technology
|
||||||
|
|
||||||
[Z_L](https://github.com/JasonLLLLLLLLLLL)@swufe
|
[Z_L](https://github.com/JasonLLLLLLLLLLL)@swufe
|
||||||
|
|
||||||
|
@ -14,19 +14,21 @@
|
|||||||
|
|
||||||
## 评测结果
|
## 评测结果
|
||||||
|
|
||||||
* 评测模型: [EmoLLM V1.0](https://openxlab.org.cn/models/detail/jujimeizuo/EmoLLM_Model)(InternLM2_7B_chat_qlora)
|
* 评测模型:
|
||||||
|
* [EmoLLM V1.0](https://openxlab.org.cn/models/detail/jujimeizuo/EmoLLM_Model) (InternLM2_7B_chat_qlora)
|
||||||
|
* [EmoLLM V2.0](https://openxlab.org.cn/apps/detail/Farewell1/EmoLLMV2.0) (InternLM2_7B_chat_full)
|
||||||
|
|
||||||
* 得分:
|
* 得分:
|
||||||
|
|
||||||
| Metric | Value |
|
| Model | Comprehensiveness | Professionalism | Authenticity | Safety |
|
||||||
|-------------------|------------|
|
|-------------------|-----------------------|-------------------|-----------------|---------|
|
||||||
| Comprehensiveness | 1.32 |
|
| InternLM2_7B_chat_qlora | 1.32 | 2.20 | 2.10 | 1.00 |
|
||||||
| Professionalism | 2.20 |
|
| InternLM2_7B_chat_full | 1.40 | 2.45 | 2.24 | 1.00 |
|
||||||
| Authenticity | 2.10 |
|
|
||||||
| Safety | 1.00 |
|
|
||||||
|
|
||||||
## 比较
|
## 比较
|
||||||
|
|
||||||
* [EmoLLM V1.0](https://openxlab.org.cn/models/detail/jujimeizuo/EmoLLM_Model) 在 InternLM2_7B_Chat 基础上提升较大;相比 Role-playing ChatGPT 在心理咨询任务上能力相近
|
* EmoLLM V2.0 相比 EmoLLM V1.0 在指标上全面提升!已超越 Role-playing ChatGPT 在心理咨询任务上的能力!
|
||||||
|
* EmoLLM V1.0 在 InternLM2_7B_Chat 基础上提升较大;相比 Role-playing ChatGPT 在心理咨询任务上能力相近
|
||||||
|
|
||||||
* 对比结果图片来源于论文《CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling》
|
* 对比结果图片来源于论文《CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling》
|
||||||
![image](https://github.com/MING-ZCH/EmoLLM/assets/119648793/abc9f626-11bc-4ec8-84a4-427c4600a720)
|
![image](https://github.com/MING-ZCH/EmoLLM/assets/119648793/abc9f626-11bc-4ec8-84a4-427c4600a720)
|
||||||
|
@ -14,19 +14,21 @@ The evaluation method, metric, and dataset from the paper《CPsyCoun: A Report-b
|
|||||||
|
|
||||||
## Result
|
## Result
|
||||||
|
|
||||||
* Model: [EmoLLM V1.0](https://openxlab.org.cn/models/detail/jujimeizuo/EmoLLM_Model)(InternLM2_7B_chat_qlora)
|
* Model:
|
||||||
|
* [EmoLLM V1.0](https://openxlab.org.cn/models/detail/jujimeizuo/EmoLLM_Model) (InternLM2_7B_chat_qlora)
|
||||||
|
* [EmoLLM V2.0](https://openxlab.org.cn/apps/detail/Farewell1/EmoLLMV2.0) (InternLM2_7B_chat_full)
|
||||||
|
|
||||||
* Score:
|
* Score:
|
||||||
|
|
||||||
| Metric | Value |
|
| Model | Comprehensiveness | Professionalism | Authenticity | Safety |
|
||||||
|-------------------|------------|
|
|-------------------|-----------------------|-------------------|-----------------|---------|
|
||||||
| Comprehensiveness | 1.32 |
|
| InternLM2_7B_chat_qlora | 1.32 | 2.20 | 2.10 | 1.00 |
|
||||||
| Professionalism | 2.20 |
|
| InternLM2_7B_chat_full | 1.40 | 2.45 | 2.24 | 1.00 |
|
||||||
| Authenticity | 2.10 |
|
|
||||||
| Safety | 1.00 |
|
|
||||||
|
|
||||||
## Comparison
|
## Comparison
|
||||||
|
|
||||||
* [EmoLLM V1.0](https://openxlab.org.cn/models/detail/jujimeizuo/EmoLLM_Model) is greatly improved on InternLM2_7B_Chat; Performance on the counseling task was similar compared to ChatGPT(Role-playing)
|
* EmoLLM V2.0 is greatly improved in all scores compared to EmoLLM V1.0! Surpasses the performance of Role-playing ChatGPT on counseling tasks!
|
||||||
|
* EmoLLM V1.0 is greatly improved on InternLM2_7B_Chat; Performance on the counseling task was similar compared to ChatGPT(Role-playing)
|
||||||
|
|
||||||
* The comparison results are from the paper《CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling》
|
* The comparison results are from the paper《CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling》
|
||||||
![image](https://github.com/MING-ZCH/EmoLLM/assets/119648793/abc9f626-11bc-4ec8-84a4-427c4600a720)
|
![image](https://github.com/MING-ZCH/EmoLLM/assets/119648793/abc9f626-11bc-4ec8-84a4-427c4600a720)
|
||||||
|
@ -9,11 +9,13 @@
|
|||||||
| Qwen1_5-0_5B-chat | 27.23% | 8.55% | 17.05% | 26.65% | 13.11% | 7.19% | 4.05% |
|
| Qwen1_5-0_5B-chat | 27.23% | 8.55% | 17.05% | 26.65% | 13.11% | 7.19% | 4.05% |
|
||||||
| InternLM2_7B_chat_qlora | 37.86% | 15.23% | 24.34% | 39.71% | 22.66% | 14.26% | 9.21% |
|
| InternLM2_7B_chat_qlora | 37.86% | 15.23% | 24.34% | 39.71% | 22.66% | 14.26% | 9.21% |
|
||||||
| InternLM2_7B_chat_full | 32.45% | 10.82% | 20.17% | 30.48% | 15.67% | 8.84% | 5.02% |
|
| InternLM2_7B_chat_full | 32.45% | 10.82% | 20.17% | 30.48% | 15.67% | 8.84% | 5.02% |
|
||||||
|
|
||||||
## 专业指标评测
|
## 专业指标评测
|
||||||
|
|
||||||
* 具体评测指标和评测方法见 [Professional_evaluation.md](./Professional_evaluation.md)
|
* 具体评测指标和评测方法见 [Professional_evaluation.md](./Professional_evaluation.md)
|
||||||
|
|
||||||
| Model | Comprehensiveness | rofessionalism | Authenticity | Safety |
|
| Model | Comprehensiveness | Professionalism | Authenticity | Safety |
|
||||||
|-------------------|-----------------------|-------------------|-----------------|---------|
|
|-------------------|-----------------------|-------------------|-----------------|---------|
|
||||||
| InternLM2_7B_chat_qlora | 1.32 | 2.20 | 2.10 | 1.00 |
|
| InternLM2_7B_chat_qlora | 1.32 | 2.20 | 2.10 | 1.00 |
|
||||||
|
| InternLM2_7B_chat_full | 1.40 | 2.45 | 2.24 | 1.00 |
|
||||||
|
|
||||||
|
Loading…
Reference in New Issue
Block a user