[Merge] Merge datasets from Dev bench (#175)
This commit is contained in:
		
						commit
						43646e7cf0
					
				| @ -19,15 +19,15 @@ | |||||||
| |   Category  |        Dataset        |     Type     |  Total  | | |   Category  |        Dataset        |     Type     |  Total  | | ||||||
| | :---------: | :-------------------: | :----------: | :-----: | | | :---------: | :-------------------: | :----------: | :-----: | | ||||||
| |  *General*  |         data          | Conversation |  5600+  | | |  *General*  |         data          | Conversation |  5600+  | | ||||||
| |  *General*  |       data_pro        | Conversation | 36500+  | | |  *General*  |       data_pro        | Conversation | 36,500+ | | ||||||
| |  *General*  | multi_turn_dataset_1  | Conversation | 36,000+ | | |  *General*  | multi_turn_dataset_1  | Conversation | 36,000+ | | ||||||
| |  *General*  | multi_turn_dataset_2  | Conversation | 27,000+ | | |  *General*  | multi_turn_dataset_2  | Conversation | 27,000+ | | ||||||
| |  *General*  | single_turn_dataset_1 |      QA      | 14000+  | | |  *General*  | single_turn_dataset_1 |      QA      | 14,000+ | | ||||||
| |  *General*  | single_turn_dataset_2 |      QA      | 18300+  | | |  *General*  | single_turn_dataset_2 |      QA      | 18,300+ | | ||||||
| | *Role-play* |         aiwei         | Conversation |  4000+  | | | *Role-play* |         aiwei         | Conversation |  4000+  | | ||||||
| | *Role-play* |       SoulStar        |      QA      | 11200+  | | | *Role-play* |       SoulStar        |      QA      | 11,200+ | | ||||||
| | *Role-play* |        tiangou        | Conversation |  3900+  | | | *Role-play* |        tiangou        | Conversation |  3900+  | | ||||||
| | *Role-play* |        mother         | Conversation | 24,500+ | | | *Role-play* |        mother         | Conversation | 40,300+ | | ||||||
| | *Role-play* |       scientist       | Conversation | 28,400+ | | | *Role-play* |       scientist       | Conversation | 28,400+ | | ||||||
| |     ……      |          ……           |      ……      |   ……    | | |     ……      |          ……           |      ……      |   ……    | | ||||||
| 
 | 
 | ||||||
| @ -35,20 +35,20 @@ | |||||||
| 
 | 
 | ||||||
| ### **General** | ### **General** | ||||||
| 
 | 
 | ||||||
| * 数据集 data 来自本项目 | * 数据集 `data` 来自本项目 | ||||||
| * 数据集 data_pro 来自本项目 | * 数据集 `data_pro` 来自本项目 | ||||||
| * 数据集 multi_turn_dataset_1 来源 [Smile](https://github.com/qiuhuachuan/smile) | * 数据集 `multi_turn_dataset_1` 来源 [Smile](https://github.com/qiuhuachuan/smile) | ||||||
| * 数据集 multi_turn_dataset_2 来源 [CPsyCounD](https://github.com/CAS-SIAT-XinHai/CPsyCoun) | * 数据集 `multi_turn_dataset_2` 来源 [CPsyCounD](https://github.com/CAS-SIAT-XinHai/CPsyCoun) | ||||||
| * 数据集 single_turn_dataset_1 来自本项目 | * 数据集 `single_turn_dataset_1` 来自本项目 | ||||||
| * 数据集 single_turn_dataset_2 来自本项目 | * 数据集 `single_turn_dataset_2` 来自本项目 | ||||||
| 
 | 
 | ||||||
| ### **Role-play** | ### **Role-play** | ||||||
| 
 | 
 | ||||||
| * 数据集 aiwei 来自本项目 | * 数据集 `aiwei` 来自本项目 | ||||||
| * 数据集 tiangou 来自本项目 | * 数据集 `tiangou` 来自本项目 | ||||||
| * 数据集 SoulStar 来源 [SoulStar](https://github.com/Nobody-ML/SoulStar) | * 数据集 `SoulStar` 来源 [SoulStar](https://github.com/Nobody-ML/SoulStar) | ||||||
| * 数据集 mother 来自本项目 | * 数据集 `mother` 来自本项目 | ||||||
| * 数据集 scientist 来自本项目 | * 数据集 `scientist` 来自本项目 | ||||||
| 
 | 
 | ||||||
| ## 数据集去重 | ## 数据集去重 | ||||||
| 
 | 
 | ||||||
|  | |||||||
| @ -17,15 +17,15 @@ | |||||||
| |   Category  |        Dataset        |     Type     |  Total  | | |   Category  |        Dataset        |     Type     |  Total  | | ||||||
| | :---------: | :-------------------: | :----------: | :-----: | | | :---------: | :-------------------: | :----------: | :-----: | | ||||||
| |  *General*  |         data          | Conversation |  5600+  | | |  *General*  |         data          | Conversation |  5600+  | | ||||||
| |  *General*  |       data_pro        | Conversation | 36500+  | | |  *General*  |       data_pro        | Conversation | 36,500+ | | ||||||
| |  *General*  | multi_turn_dataset_1  | Conversation | 36,000+ | | |  *General*  | multi_turn_dataset_1  | Conversation | 36,000+ | | ||||||
| |  *General*  | multi_turn_dataset_2  | Conversation | 27,000+ | | |  *General*  | multi_turn_dataset_2  | Conversation | 27,000+ | | ||||||
| |  *General*  | single_turn_dataset_1 |      QA      | 14000+  | | |  *General*  | single_turn_dataset_1 |      QA      | 14,000+ | | ||||||
| |  *General*  | single_turn_dataset_2 |      QA      | 18300+  | | |  *General*  | single_turn_dataset_2 |      QA      | 18,300+ | | ||||||
| | *Role-play* |         aiwei         | Conversation |  4000+  | | | *Role-play* |         aiwei         | Conversation |  4000+  | | ||||||
| | *Role-play* |       SoulStar        |      QA      | 11200+  | | | *Role-play* |       SoulStar        |      QA      | 11,200+ | | ||||||
| | *Role-play* |        tiangou        | Conversation |  3900+  | | | *Role-play* |        tiangou        | Conversation |  3900+  | | ||||||
| | *Role-play* |        mother         | Conversation | 24,500+ | | | *Role-play* |        mother         | Conversation | 40,300+ | | ||||||
| | *Role-play* |       scientist       | Conversation | 28,400+ | | | *Role-play* |       scientist       | Conversation | 28,400+ | | ||||||
| |     ……      |          ……           |      ……      |   ……    | | |     ……      |          ……           |      ……      |   ……    | | ||||||
| 
 | 
 | ||||||
|  | |||||||
							
								
								
									
										75451
									
								
								datasets/mother_v1.json
									
									
									
									
									
										Normal file
									
								
							
							
						
						
									
										75451
									
								
								datasets/mother_v1.json
									
									
									
									
									
										Normal file
									
								
							
										
											
												File diff suppressed because it is too large
												Load Diff
											
										
									
								
							| @ -1 +0,0 @@ | |||||||
| 
 |  | ||||||
		Loading…
	
		Reference in New Issue
	
	Block a user
	 MING_X
						MING_X