OliveSensorAPI/datasets/README.md
Anooyman 000491f1be
Merge Main code (#2)
* Add files via upload

* 新增ENmd文档

* Update README.md

* Update README_EN.md

* Update LICENSE

* [docs] update lmdeploy file

* add ocr.md

* Update tutorial.md

* Update tutorial_EN.md

* Update General_evaluation_EN.md

* Update General_evaluation_EN.md

* Update README.md

Add InternLM2_7B_chat_full's professional evaluation results

* Update Professional_evaluation.md

* Update Professional_evaluation.md

* Update Professional_evaluation.md

* Update Professional_evaluation.md

* Update Professional_evaluation_EN.md

* Update README.md

* Update README.md

* Update README_EN.md

* Update README_EN.md

* Update README_EN.md

* [DOC] update readme

* Update LICENSE

* Update LICENSE

* update personal info and small format optimizations

* update personal info and translations for contents in a table

* Update RAG README

* Update demo link in README.md

* Update xlab app link

* Update xlab link

* add xlab model

* Update web_demo-aiwei.py

* add bitex

---------

Co-authored-by: xzw <62385492+aJupyter@users.noreply.github.com>
Co-authored-by: এ許我辞忧࿐♡ <127636623+Smiling-Weeping-zhr@users.noreply.github.com>
Co-authored-by: Vicky <vicky_3021@163.com>
Co-authored-by: MING_X <119648793+MING-ZCH@users.noreply.github.com>
Co-authored-by: Nobody-ML <1755309985@qq.com>
Co-authored-by: 8baby8 <3345710651@qq.com>
Co-authored-by: chaoke <101492509+8baby8@users.noreply.github.com>
Co-authored-by: aJupyter <ajupyter@163.com>
Co-authored-by: HongCheng <kwchenghong@gmail.com>
Co-authored-by: santiagoTOP <“1537211712top@gmail.com”>
2024-03-15 19:51:04 +08:00

1.8 KiB
Raw Blame History

EmoLLM数据集

  • 数据集按用处分为两种类型:GeneralRole-play
  • 数据按格式分为两种类型:QAConversation
  • 数据汇总General6个数据集Role-play3个数据集

数据集类型

  • General:通用数据集,包含心理学知识、心理咨询技术等通用内容
  • Role-play:角色扮演数据集,包含特定角色对话风格数据等内容

数据类型

  • QA:问答对
  • Conversation:多轮对话

数据集汇总

Category Dataset Type Total
General data Conversation 5600+
General data_pro Conversation 36500+
General multi_turn_dataset_1 Conversation 36,000+
General multi_turn_dataset_2 Conversation 27,000+
General single_turn_dataset_1 QA 14000+
General single_turn_dataset_2 QA 18300+
Role-play aiwei Conversation 4000+
Role-play SoulStar QA 11200+
Role-play tiangou Conversation 3900+
…… …… …… ……

数据集来源

General

  • 数据集 data 来自本项目
  • 数据集 data_pro 来自本项目
  • 数据集 multi_turn_dataset_1 来源 Smile
  • 数据集 multi_turn_dataset_2 来源 CPsyCounD
  • 数据集 single_turn_dataset_1 来自本项目
  • 数据集 single_turn_dataset_2 来自本项目

Role-play

  • 数据集 aiwei 来自本项目
  • 数据集 tiangou 来自本项目
  • 数据集 SoulStar 来源 SoulStar