From 3a49c22983c1ced512a6ab51f04fe65a373292d4 Mon Sep 17 00:00:00 2001 From: MING_X <119648793+MING-ZCH@users.noreply.github.com> Date: Wed, 6 Mar 2024 17:58:17 +0800 Subject: [PATCH] Create README_EN.md --- datasets/README_EN.md | 28 ++++++++++++++++++++++++++++ 1 file changed, 28 insertions(+) create mode 100644 datasets/README_EN.md diff --git a/datasets/README_EN.md b/datasets/README_EN.md new file mode 100644 index 0000000..4831953 --- /dev/null +++ b/datasets/README_EN.md @@ -0,0 +1,28 @@ +# EmoLLM's datasets + +* Category of dataset: **General** and **Role-play** +* Type of data: **QA** and **Conversation** +* Summary: General(**6 datasets**), Role-play(**3 datasets**) + + ## Category +* **General**: generic dataset, including psychological Knowledge, counseling technology, etc. +* **Role-play**: role-playing dataset, including character-specific conversation style data, etc. + +## Type +* **QA**: question-and-answer pair +* **Conversation**: multi-turn consultation dialogue + +## Summary + +| Category | Dataset | Type | Total | +| :---------: | :-------------------: | :----------: | :-----: | +| *General* | data | Conversation | 5600+ | +| *General* | data_pro | Conversation | 36500+ | +| *General* | multi_turn_dataset_1 | Conversation | 36,000+ | +| *General* | multi_turn_dataset_2 | Conversation | 27,000+ | +| *General* | single_turn_dataset_1 | QA | 14000+ | +| *General* | single_turn_dataset_2 | QA | 18300+ | +| *Role-play* | aiwei | Conversation | 4000+ | +| *Role-play* | SoulStar | QA | 11200+ | +| *Role-play* | tiangou | Conversation | 3900+ | +| …… | …… | …… | …… |