Commit Graph

32 Commits

Author SHA1 Message Date
Anooyman
14890fad56
Update code (#8)
* feat: add agents/actions/write_markdown

* [ADD] add evaluation result of base model on 5/10 epochs

* Rename mother.json to mother_v1_2439.json

* Add files via upload

* [DOC] update README

* Update requirements.txt

update mpi4py installation

* Update README_EN.md

update English comma

* Update README.md

基于母亲角色的多轮对话模型微调完毕。已上传到 Huggingface。

* 多轮对话母亲角色的微调的脚本

* Update README.md

加上了王几行XING 和 思在 的作者信息

* Update README_EN.md

* Update README.md

* Update README_EN.md

* Update README_EN.md

* Changes to be committed:
	modified:   .gitignore
	modified:   README.md
	modified:   README_EN.md
	new file:   assets/EmoLLM_transparent.png
	deleted:    assets/Shusheng.jpg
	new file:   assets/Shusheng.png
	new file:   assets/aiwei_demo1.gif
	new file:   assets/aiwei_demo2.gif
	new file:   assets/aiwei_demo3.gif
	new file:   assets/aiwei_demo4.gif

* Update README.md

rectify aiwei_demo.gif

* Update README.md

rectify aiwei_demo style

* Changes to be committed:
	modified:   README.md
	modified:   README_EN.md

* Changes to be committed:
	modified:   README.md
	modified:   README_EN.md

* [Doc] update readme

* [Doc] update readme

* Update README.md

* Update README_EN.md

* Update README.md

* Update README_EN.md

* Delete datasets/mother_v1_2439.json

* Rename mother_v2_3838.json to mother_v2.json

* Delete datasets/mother_v2.json

* Add files via upload

* Update README.md

* Update README_EN.md

* [Doc] Update README_EN.md

minor fix

* InternLM2-Base-7B QLoRA微调模型 链接和测评结果更新

* add download_model.py script, automatic download of model libraries

* 清除图片的黑边、更新作者信息
	modified:   README.md
	new file:   assets/aiwei_demo.gif
	deleted:    assets/aiwei_demo1.gif
	modified:   assets/aiwei_demo2.gif
	modified:   assets/aiwei_demo3.gif
	modified:   assets/aiwei_demo4.gif

* rectify aiwei_demo transparent

* transparent

* modify: aiwei_demo table--->div

* modified:   aiwei_demo

* modify: div ---> table

* modified:   README.md

* modified:   README_EN.md

* update model config file links

* Create internlm2_20b_chat_lora_alpaca_e3.py

20b模型的配置文件

* update model config file links

update model config file links

* Revert "update model config file links"

---------

Co-authored-by: jujimeizuo <fengzetao.zed@foxmail.com>
Co-authored-by: xzw <62385492+aJupyter@users.noreply.github.com>
Co-authored-by: Zeyu Ba <72795264+ZeyuBa@users.noreply.github.com>
Co-authored-by: Bryce Wang <90940753+brycewang2018@users.noreply.github.com>
Co-authored-by: zealot52099 <songyan5209@163.com>
Co-authored-by: HongCheng <kwchenghong@gmail.com>
Co-authored-by: Yicong <yicooong@qq.com>
Co-authored-by: Yicooong <54353406+Yicooong@users.noreply.github.com>
Co-authored-by: aJupyter <ajupyter@163.com>
Co-authored-by: MING_X <119648793+MING-ZCH@users.noreply.github.com>
Co-authored-by: Ikko Eltociear Ashimine <eltociear@gmail.com>
Co-authored-by: HatBoy <null2none@163.com>
Co-authored-by: ZhouXinAo <142309012+zxazys@users.noreply.github.com>
2024-04-14 10:09:17 +08:00
HongCheng
c902356919 update remove a space for bold 2024-03-27 20:23:36 +08:00
Anooyman
de0674ccf7
Update main code (#2)
* update rag/src/data_processing.py

* Add files via upload

allow user to load embedding & rerank models from cache

* Add files via upload

embedding_path = os.path.join(model_dir, 'embedding_model')  
rerank_path = os.path.join(model_dir, 'rerank_model')

* 测试push dev

测试push dev

* Add files via upload

两个母亲多轮对话数据集合并、清理和去重之后,得到 2439 条多轮对话数据(每条有6-8轮对话)。

* optimize deduplicate.py

Add time print information
save duplicate dataset as well
remove print(content)

* add base model qlora fintuning config file: internlm2_7b_base_qlora_e10_M_1e4_32_64.py

* add full finetune code from internlm2

* other 2 configs for base model

* update cli_internlm2.py

 three methods to load model

1. download model in openxlab
2. download model in modelscope
3. offline model

* create upload_modelscope.py

* add base model and update personal contributions

* add README.md for Emollm_Scientist

* Create README_internlm2_7b_base_qlora.md

InternLM2 7B Base QLoRA 微调指南

* [DOC]EmoLLM_Scientist微调指南

* [DOC]EmoLLM_Scientist微调指南

* [DOC]EmoLLM_Scientist微调指南

* [DOC]EmoLLM_Scientist微调指南

* [DOC]EmoLLM_Scientist微调指南

* [DOC]EmoLLM_Scientist微调指南

* update

* [DOC]README_scientist.md

* delete config

* format update

* upload xlab

* add README_Model_Uploading.md and images

* modelscope model upload

* Modify Recent Updates

* update daddy-like Boy-Friend EmoLLM

* update model uploading with openxlab

* update model uploading with openxlab

---------

Co-authored-by: zealot52099 <songyan5209@163.com>
Co-authored-by: xzw <62385492+aJupyter@users.noreply.github.com>
Co-authored-by: zealot52099 <67356208+zealot52099@users.noreply.github.com>
Co-authored-by: Bryce Wang <90940753+brycewang2018@users.noreply.github.com>
Co-authored-by: HongCheng <kwchenghong@gmail.com>
2024-03-24 11:51:19 +08:00
santiagoTOP
88218bfd4b Update RAG README 2024-03-17 20:37:26 +08:00
王友昉
da6286c151 clean qa 2024-03-16 20:45:30 +08:00
王友昉
9bcd4e060b QA clean 2024-03-16 13:12:15 +08:00
santiagoTOP
12e05e37d4 Update RAG README 2024-03-14 16:31:00 +08:00
xzw
543923dd47
Merge pull request #54 from Anooyman/main
Update multi QA generation process
2024-03-10 21:51:46 +08:00
Vicky
27458ef8ac 新增ENmd文档 2024-03-10 15:52:18 +08:00
edward_ke
c01c67c33f Update multi QA generation process
Each thread stores the generated content independently, and finally integrates the generated content into a file
2024-03-10 13:10:24 +08:00
Anooyman
d60f1dc8e1
Add concurrent functions (#1)
Add concurrent functions for QA generation

Co-authored-by: edward_ke <edward_ke@trendmicro.com>
2024-03-09 16:44:59 +08:00
Mxode
07527619fd Calling parameters from config.py 2024-03-08 19:00:45 +08:00
Mxode
b6eca08fb8 Add exception handling 2024-03-08 19:00:12 +08:00
aJupyter
1262b9f22a Added parameters for control 2024-03-08 18:40:07 +08:00
Max
bfe9902852 QA Generation - Modify the default interval to 10 2024-03-08 00:02:21 +08:00
Max
bb99153255 QA Generation - Update config.py 2024-03-07 23:59:10 +08:00
Max
52a0cc3b44 QA Generation - Update requirements.txt 2024-03-07 23:58:05 +08:00
MING_X
f90745e386
Update README.md 2024-03-07 23:05:30 +08:00
MING_X
2424d2fcf3
Update README.md 2024-03-07 22:54:32 +08:00
MING_X
86f8aaf82e
Create system_prompt_v2.md 2024-03-07 22:52:32 +08:00
MING_X
f5bdc120e8
Rename system_prompt.md to system_prompt_v1.md 2024-03-07 22:40:28 +08:00
Mxode
18997ec79c QA Generation - Update README 2024-03-07 18:01:42 +08:00
Mxode
57a9db4c5b Upload QA generation pipeline 2024-03-07 17:56:07 +08:00
8baby8
6e74a597ba MODIFY PDF2TXT 2024-02-29 12:36:38 +08:00
8baby8
d415e89b72 Add PDF2TXT 2024-02-29 11:56:58 +08:00
chaoke
7f526e2af3
Add files via upload 2024-02-29 11:16:45 +08:00
jupyter
1a6b8eac20 feat:Add new finetune configurations and datasets 2024-02-23 11:36:58 +08:00
jujimeizuo
c136dfc47b feat: add web_internlm2 and upload s.t. scripts 2024-01-25 19:02:24 +08:00
jujimeizuo
c7d35c2cc9 update: scripts 2024-01-24 14:47:55 +08:00
এ許我辞忧࿐♡
cc3bd3722a
Update 说明.txt 2024-01-22 16:37:58 +08:00
এ許我辞忧࿐♡
5651fc181e
Add files via upload 2024-01-22 16:33:51 +08:00
jujimeizuo
45b143b6ef feat: finetune Qwen and demo 2024-01-21 19:12:03 +08:00