Bryce Wang
dd7b6c4cc1
Add files via upload
...
两个母亲多轮对话数据集合并、清理和去重之后,得到 2439 条多轮对话数据(每条有6-8轮对话)。
2024-03-22 15:13:30 -07:00
xzw
8a1e0df9d3
[DOC]update datesets/README.md ( #115 )
2024-03-21 15:50:20 +08:00
HongCheng
4ff7910368
Update process_merge.py
2024-03-21 16:07:18 +09:00
HongCheng
d25a304c4d
Update process_single_turn_conversation_construction.py
2024-03-21 16:06:41 +09:00
HongCheng
085a01eafa
add dataset processing codes
...
1. update process.py for multi_turn_dataset(1 and 2) and data.json, data_pro.json
2. add datasets\processed\process_single_turn_conversation_construction.py for single-turn dataset (1 and 2)
3. add datasets\processed\process_merge.py for these 6 updated dataset in datasets\processed\
2024-03-21 16:01:54 +09:00
HongCheng
ce2cb5156c
update data.json (delete 4 empty data)
...
4 empty lines in data.json 425 483 742 1120
2024-03-21 15:56:54 +09:00
zealot52099
e2025cc8ea
[DOC]update datesets/README.md
2024-03-21 08:24:15 +08:00
zealot52099
3b21f79c3c
Merge branch 'dev' of https://github.com/SmartFlowAI/EmoLLM into dev
2024-03-21 07:59:16 +08:00
zealot52099
c354ffd7e0
[DOC]update datesets/README.md
2024-03-21 07:58:13 +08:00
xzw
f5eb0ddc93
Merge pull request #113 from lll997150986/main
...
scientist.json
2024-03-20 23:44:46 +08:00
jeky
dbdd731565
1111
2024-03-20 23:25:07 +08:00
zealot52099
77ff2d079c
update deduplicate.py
2024-03-20 23:08:36 +08:00
zealot52099
41744ed604
[DOC] update datasets/README_EN.md
2024-03-20 17:52:23 +08:00
zealot52099
9b4e58f732
[DOC]update datasets/README.md
2024-03-20 17:40:31 +08:00
zealot52099
b542929c1d
add deduplicate.py
2024-03-19 20:09:44 +08:00
zealot52099
861f12d47a
add deduplicate.py
2024-03-19 16:41:09 +08:00
MING_X
b499aec9da
Update README_EN.md
2024-03-10 16:09:17 +08:00
MING_X
49998436b9
Update README.md
2024-03-10 16:04:31 +08:00
MING_X
3a49c22983
Create README_EN.md
2024-03-06 17:58:17 +08:00
MING_X
b8bd726849
Create README.md
2024-03-05 23:24:33 +08:00
aJupyter
4d8ae7d428
feat: add internlm2-chat-7b-config
2024-03-03 21:08:52 +08:00
Nobody-ML
a71de6ce24
add SoulStar_data
2024-03-03 17:28:26 +08:00
MING_X
4a1ef9c083
Add files via upload
2024-02-28 21:18:02 +08:00
MING_X
97f0cc068a
upload smile.dataset
2024-02-28 17:44:48 +08:00
MING_X
96b0cf76dd
Delete datasets/qa_dataset.json
2024-02-27 22:03:56 +08:00
MING_X
7ebb05c236
Upload datasets
...
two cleaned single_turn datasets from qa_dataset.json
2024-02-27 22:01:53 +08:00
MrCatAI
6739f2ed4c
new_dataset
2024-02-26 17:25:06 +00:00
MrCatAI
6e70c62771
qa_dataset
2024-02-26 17:22:11 +00:00
ZhouXinAo
52c7d63d49
feat:Add new finetune configurations and datasets
2024-02-24 22:39:10 +08:00
ZhouXinAo
a691e78307
Delete datasets/tiangou.json
2024-02-24 22:38:52 +08:00
ZhouXinAo
f505efb0c4
Add files via upload
...
Add new finetune configurations and datasets
2024-02-24 22:37:08 +08:00
jupyter
1a6b8eac20
feat:Add new finetune configurations and datasets
2024-02-23 11:36:58 +08:00
jupyter
294d5d1d60
feat: add datasets and update readme
2024-01-26 22:43:38 +08:00