HongCheng
0e946ca29b
Add files via upload
2024-04-20 21:09:34 +09:00
HongCheng
8cfdc44a39
Add files via upload
2024-04-20 21:08:48 +09:00
HongCheng
1d9ee54555
自我认知数据集和处理代码
2024-04-20 21:04:00 +09:00
HongCheng
89dea4826a
更正示例数据集 multi_turn_dataset_2, 添加更多描述, 移动处理文件
2024-04-20 13:44:46 +09:00
santiagoTOP
186511ce9c
Baby EmoLLM
2024-04-14 20:20:11 +08:00
MING_X
4c60db8afe
Update README_EN.md
2024-04-09 23:08:55 +08:00
MING_X
4827b3932f
Update README.md
2024-04-09 23:08:12 +08:00
MING_X
4068146fdf
Add files via upload
2024-04-09 23:05:22 +08:00
MING_X
310ecfb18f
Delete datasets/mother_v2.json
2024-04-09 23:03:09 +08:00
MING_X
907e7145f7
Rename mother_v2_3838.json to mother_v2.json
2024-04-09 23:02:59 +08:00
MING_X
d1bf15c93a
Delete datasets/mother_v1_2439.json
2024-04-09 22:57:12 +08:00
MING_X
700edfb9e8
Update README_EN.md
2024-04-09 20:53:17 +08:00
MING_X
360dc212a5
Update README.md
2024-04-09 19:31:14 +08:00
Bryce Wang
a6241bcc49
Add files via upload
2024-03-29 15:06:49 -07:00
Bryce Wang
86332780c2
Rename mother.json to mother_v1_2439.json
2024-03-29 15:06:09 -07:00
HongCheng
d0b70677f6
Merge pull request #2 from SmartFlowAI/main
...
同步
2024-03-24 01:05:02 +09:00
HongCheng
f7c35fef14
format update
2024-03-23 22:36:44 +09:00
xzw
aaacfb149b
加入微调README.md 以及相关文件 ( #130 )
2024-03-23 19:59:26 +08:00
xzw
a12a7ef107
add base model qlora fintuning config file and optimize deduplicate.py ( #128 )
2024-03-23 19:20:17 +08:00
HongCheng
950cab0262
optimize deduplicate.py
...
Add time print information
save duplicate dataset as well
remove print(content)
2024-03-23 15:24:45 +09:00
Bryce Wang
dd7b6c4cc1
Add files via upload
...
两个母亲多轮对话数据集合并、清理和去重之后,得到 2439 条多轮对话数据(每条有6-8轮对话)。
2024-03-22 15:13:30 -07:00
zealot52099
66b7617f04
测试push dev
...
测试push dev
2024-03-22 20:45:13 +08:00
xzw
8a1e0df9d3
[DOC]update datesets/README.md ( #115 )
2024-03-21 15:50:20 +08:00
HongCheng
4ff7910368
Update process_merge.py
2024-03-21 16:07:18 +09:00
HongCheng
d25a304c4d
Update process_single_turn_conversation_construction.py
2024-03-21 16:06:41 +09:00
HongCheng
085a01eafa
add dataset processing codes
...
1. update process.py for multi_turn_dataset(1 and 2) and data.json, data_pro.json
2. add datasets\processed\process_single_turn_conversation_construction.py for single-turn dataset (1 and 2)
3. add datasets\processed\process_merge.py for these 6 updated dataset in datasets\processed\
2024-03-21 16:01:54 +09:00
HongCheng
ce2cb5156c
update data.json (delete 4 empty data)
...
4 empty lines in data.json 425 483 742 1120
2024-03-21 15:56:54 +09:00
zealot52099
e2025cc8ea
[DOC]update datesets/README.md
2024-03-21 08:24:15 +08:00
zealot52099
3b21f79c3c
Merge branch 'dev' of https://github.com/SmartFlowAI/EmoLLM into dev
2024-03-21 07:59:16 +08:00
zealot52099
c354ffd7e0
[DOC]update datesets/README.md
2024-03-21 07:58:13 +08:00
xzw
f5eb0ddc93
Merge pull request #113 from lll997150986/main
...
scientist.json
2024-03-20 23:44:46 +08:00
jeky
dbdd731565
1111
2024-03-20 23:25:07 +08:00
zealot52099
77ff2d079c
update deduplicate.py
2024-03-20 23:08:36 +08:00
zealot52099
41744ed604
[DOC] update datasets/README_EN.md
2024-03-20 17:52:23 +08:00
zealot52099
9b4e58f732
[DOC]update datasets/README.md
2024-03-20 17:40:31 +08:00
zealot52099
b542929c1d
add deduplicate.py
2024-03-19 20:09:44 +08:00
zealot52099
861f12d47a
add deduplicate.py
2024-03-19 16:41:09 +08:00
MING_X
b499aec9da
Update README_EN.md
2024-03-10 16:09:17 +08:00
MING_X
49998436b9
Update README.md
2024-03-10 16:04:31 +08:00
MING_X
3a49c22983
Create README_EN.md
2024-03-06 17:58:17 +08:00
MING_X
b8bd726849
Create README.md
2024-03-05 23:24:33 +08:00
aJupyter
4d8ae7d428
feat: add internlm2-chat-7b-config
2024-03-03 21:08:52 +08:00
Nobody-ML
a71de6ce24
add SoulStar_data
2024-03-03 17:28:26 +08:00
MING_X
4a1ef9c083
Add files via upload
2024-02-28 21:18:02 +08:00
MING_X
97f0cc068a
upload smile.dataset
2024-02-28 17:44:48 +08:00
MING_X
96b0cf76dd
Delete datasets/qa_dataset.json
2024-02-27 22:03:56 +08:00
MING_X
7ebb05c236
Upload datasets
...
two cleaned single_turn datasets from qa_dataset.json
2024-02-27 22:01:53 +08:00
MrCatAI
6739f2ed4c
new_dataset
2024-02-26 17:25:06 +00:00
MrCatAI
6e70c62771
qa_dataset
2024-02-26 17:22:11 +00:00
ZhouXinAo
52c7d63d49
feat:Add new finetune configurations and datasets
2024-02-24 22:39:10 +08:00