* update rag/src/data_processing.py

* Add files via upload

allow user to load embedding & rerank models from cache

* Add files via upload

embedding_path = os.path.join(model_dir, 'embedding_model')  
rerank_path = os.path.join(model_dir, 'rerank_model')

* 测试push dev

测试push dev

* Add files via upload

两个母亲多轮对话数据集合并、清理和去重之后，得到 2439 条多轮对话数据（每条有6-8轮对话）。

* optimize deduplicate.py

Add time print information
save duplicate dataset as well
remove print(content)

* add base model qlora fintuning config file: internlm2_7b_base_qlora_e10_M_1e4_32_64.py

* add full finetune code from internlm2

* other 2 configs for base model

* update cli_internlm2.py

 three methods to load model

1. download model in openxlab
2. download model in modelscope
3. offline model

* create upload_modelscope.py

* add base model and update personal contributions

* add README.md for Emollm_Scientist

* Create README_internlm2_7b_base_qlora.md

InternLM2 7B Base QLoRA 微调指南

* [DOC]EmoLLM_Scientist微调指南

* [DOC]EmoLLM_Scientist微调指南

* [DOC]EmoLLM_Scientist微调指南

* [DOC]EmoLLM_Scientist微调指南

* [DOC]EmoLLM_Scientist微调指南

* [DOC]EmoLLM_Scientist微调指南

* update

* [DOC]README_scientist.md

* delete config

* format update

* upload xlab

* add README_Model_Uploading.md and images

* modelscope model upload

* Modify Recent Updates

* update daddy-like Boy-Friend EmoLLM

* update model uploading with openxlab

* update model uploading with openxlab

---------

Co-authored-by: zealot52099 <songyan5209@163.com>
Co-authored-by: xzw <62385492+aJupyter@users.noreply.github.com>
Co-authored-by: zealot52099 <67356208+zealot52099@users.noreply.github.com>
Co-authored-by: Bryce Wang <90940753+brycewang2018@users.noreply.github.com>
Co-authored-by: HongCheng <kwchenghong@gmail.com>

2024-03-24 11:51:19 +08:00

4.5 KiB

Raw Blame History

EmoLLM_Scientist微调指南

[EmoLLM_Scientist模型地址]https://modelscope.cn/models/zealot5209/EmoLLM_Scientist/summary

数据

借助LLM自动生成符合科学家人设的对话内容，再对生成数据进行人工校验。最终微调数据包含约3800段对话，每段对话包含5~10轮。

数据路径：

datasets\scientist.json

数据样例：

{"conversation":[{"system":"现在你是一个心理专家, 我有一些心理问题, 请你用专业的知识帮我解决。",
"input":"我最近对学习特别兴奋，每天可以熬夜到凌晨三点，我觉得这样效率很高，但室友说我有点过度了，我不这么认为。\n\n",
"output":"很高兴看到你对学业充满热情，你现在的学习状态让我想到了心理学上的“过度激活”现象。能否详细描述一下你为何选择深夜学习，以及这样做的实际效果如何呢？\n\n"},
{"input":"我觉得晚上思路更清晰，而且能集中精力完成很多任务。确实白天上课时有点困倦，但我总觉得自己在高效利用时间。\n\n",
"output":"理解你的想法，不过科学研究显示，长期熬夜可能影响生物钟，进而导致认知功能下降。作为你的科学家心理咨询师，我想知道，你是否注意到这种高强度学习方式对日常生活其他方面的潜在影响，比如社交活动或身体健康？\n\n"},
{"input":"确实，我发现自己减少了锻炼和跟朋友相处的时间，但我觉得这些都是暂时牺牲，为了更好的成绩。\n\n",
"output":"你这种短期目标优先的策略值得肯定，但从长远看，身心健康与社会交往同样重要。有没有想过制定一个兼顾学习与生活的合理计划？我们可以一起探讨如何优化时间管理，确保高效学习的同时，也不忽视生活质量。\n\n"},
{"input":"我也想平衡，只是不知道怎么做才好。\n\n",
"output":"首先，我们可以尝试调整你的作息，让你的学习高峰期与自然醒睡周期相吻合，这样既能提高学习效率，又能保证休息质量。其次，我们可以设定每日学习目标和休闲时间，科学分配任务。最后，记得适时奖励自己，庆祝每个小成就，这样可以激励持续进步。你觉得这样的方案是否可行？\n\n"},
{"input":"听起来不错，我会试试看的，但怕自己坚持不下来。\n\n",
"output":"理解你的担忧，改变习惯需要时间和耐心。我们可以通过行为科学的方法逐步调整，如采用“番茄工作法”，结合正向强化策略，帮助你建立新的学习习惯。记住，每一次微小的进步都值得庆祝，我会陪伴并支持你的改变过程。接下来我们将一起细化这个计划并开始实施，你觉得如何？"}]}

基座

基座模型采用InternLM2-Chat-7B，更多模型介绍请见InternLM

训练方式

基于xtuner进行微调。使用xtuner的train命令行工具，流程如下：

安装依赖

cd xtuner_config/
pip3 install -r requirements.txt

运行微调脚本

cd xtuner_config/
xtuner train internlm2_7b_chat_qlora_e3_scienctist.py --deepspeed deepspeed_zero2

模型转换

将得到的 PTH 模型转换为 HuggingFace 模型，生成 Adapter 文件夹

cd xtuner_config/
mkdir hf
export MKL_SERVICE_FORCE_INTEL=1
#这里假设训练了3个epoch
xtuner convert pth_to_hf internlm2_7b_chat_qlora_e3_scienctist.py ./work_dirs/internlm2_7b_chat_qlora_e3_scienctist/epoch_3.pth ./hf

模型合并

将 HuggingFace adapter 合并到大语言模型

xtuner convert merge ./internlm2-chat-7b ./hf ./merged --max-shard-size 2GB
# xtuner convert merge \
#     ${NAME_OR_PATH_TO_LLM} \
#     ${NAME_OR_PATH_TO_ADAPTER} \
#     ${SAVE_PATH} \
#     --max-shard-size 2GB

测试

cd demo/
python cli_internlm2_scientist.py

模型上传

完成测试后可将模型上传到ModelScope和Openxlab平台(不建议在Windows下操作)

ModelScope

Openxlab模型上传脚本：scripts/upload_modelscope.py

Openxlab

ModelScope模型上传

其他

欢迎大家给xtuner和EmoLLM点点star~

🎉🎉🎉🎉🎉

4.5 KiB Raw Blame History Unescape Escape