OliveSensorAPI/datasets
2024-03-05 23:24:33 +08:00
..
processed feat: add internlm2-chat-7b-config 2024-03-03 21:08:52 +08:00
aiwei.json feat:Add new finetune configurations and datasets 2024-02-23 11:36:58 +08:00
data_pro.json feat:Add new finetune configurations and datasets 2024-02-23 11:36:58 +08:00
data.json feat: add datasets and update readme 2024-01-26 22:43:38 +08:00
multi_turn_dataset_1.json upload smile.dataset 2024-02-28 17:44:48 +08:00
multi_turn_dataset_2.json Add files via upload 2024-02-28 21:18:02 +08:00
README.md Create README.md 2024-03-05 23:24:33 +08:00
single_turn_dataset_1.json Upload datasets 2024-02-27 22:01:53 +08:00
single_turn_dataset_2.json Upload datasets 2024-02-27 22:01:53 +08:00
SoulStar_data.json add SoulStar_data 2024-03-03 17:28:26 +08:00
tiangou.json feat:Add new finetune configurations and datasets 2024-02-24 22:39:10 +08:00

EmoLLM数据集

  • 数据集按用处分为两种类型:GeneralRole-play
  • 数据按格式分为两种类型:QAConversation
  • 数据汇总General6个数据集Role-play3个数据集

数据集类型

  • General:通用数据集,包含心理学知识、心理咨询技术等通用内容
  • Role-play:角色扮演数据集,包含特定角色对话风格数据等内容

数据类型

  • QA:问答对
  • Conversation:多轮对话

数据集汇总

Category Dataset Type Total
General data Conversation 5600+
General data_pro Conversation 36500+
General multi_turn_dataset_1 Conversation 36,000+
General multi_turn_dataset_2 Conversation 27,000+
General single_turn_dataset_1 QA 14000+
General single_turn_dataset_2 QA 18300+
Role-play aiwei Conversation 4000+
Role-play SoulStar QA 11200+
Role-play tiangou Conversation 3900+
…… …… …… ……