Commit Graph

26 Commits

Author SHA1 Message Date
HongCheng
042146af56 Revert "modified merge_jsonl and merge_jsonl_r"
This reverts commit a38ef60058.
2024-03-18 22:13:35 +09:00
HongCheng
a833832908
Delete generate_data/upload_openxlab.py 2024-03-18 20:45:48 +09:00
HongCheng
19edf026eb
Delete generate_data/trans_process.py 2024-03-18 20:45:18 +09:00
HongCheng
b894b87a9d Revert "moving files in scripts related to generate data to generate_data folder"
This reverts commit 05ff5e8407.
2024-03-18 20:43:39 +09:00
HongCheng
a38ef60058 modified merge_jsonl and merge_jsonl_r
merge_jsonl is for merge jsonl files in a folder
merge_jsonl_r if for merge jsonl files in one folder's subfolders

uasge:
python merge_jsonl_r.py > qwen2.txt
python merge_jsonl_r.py > zhipuai.txt

python merge_jsonl.py > curr.txt

│   学业_merge.json
│   家人_merge.json
│   就业_merge.json
│   工作_merge.json
│   恋爱_merge.json
│   朋友_merge.json
│   环境_merge.json
│   生活_merge.json
│   社交_merge.json
│   责任_merge.json
│   身体_merge.json
│   隐私_merge.json
│
├───学业
│       兴奋.jsonl
│       冷静.jsonl
│       厌倦.jsonl
│       厌恶.jsonl
│       同情.jsonl
│       困惑.jsonl
│       娱乐.jsonl
│       嫉妒.jsonl
│       尴尬.jsonl
│       崇拜.jsonl
│       快乐.jsonl
│       怀旧.jsonl
│       性欲.jsonl
│       恐惧.jsonl
│       悲伤.jsonl
│       敬畏.jsonl
│       有趣.jsonl
│       欣赏.jsonl
│       浪漫.jsonl
│       渴望.jsonl
│       满意.jsonl
│       满足.jsonl
│       焦虑.jsonl
│       痛恨.jsonl
│       痛苦.jsonl
│       着迷.jsonl
│       钦佩.jsonl
│
├───家人
│       兴奋.jsonl
│       冷静.jsonl
│       厌倦.jsonl
│       厌恶.jsonl
│       同情.jsonl
│       困惑.jsonl
│       娱乐.jsonl
│       嫉妒.jsonl
2024-03-18 20:16:39 +09:00
HongCheng
4bcacd8b67
Delete generate_data/curr_merge.json
This is a merged file
2024-03-17 10:19:14 +09:00
HongCheng
baba0d611d move scripts/upload_openxlab.py and scripts/trans_process.py 2024-03-17 10:13:18 +09:00
HongCheng
05ff5e8407 moving files in scripts related to generate data to generate_data folder
also optimizing the merge_json.py
2024-03-17 10:05:50 +09:00
HongCheng
b47a102994
Update zhipuai_gen_data.py 添加glm-4 response获取异常处理
# 解决方法
``` python
    try:
        response = client.chat.completions.create(
            model='glm-4',
            messages=messages,
            top_p=top_p,
        )
    except:
        response = client.chat.completions.create(
            model='glm-4',
            messages=messages,
            top_p=top_p,
        )

``` 
# 报错
``` python
    # Error code: 400, with error text {"error":{"code":"1301","message":"系统检测到输入或生成内容可能包含不安全或敏感内容,请您避免输入易产生敏感内容的提示语,感谢您的配合。"}}
```
2024-03-17 08:45:33 +09:00
HongCheng
a9721ea582
Update config.yml 2024-03-16 22:02:11 +09:00
HongCheng
33e11caf97
Update requirements.txt 2024-03-16 22:01:35 +09:00
HongCheng
93067cfd64
Update config.yml 2024-03-16 22:00:51 +09:00
HongCheng
a82ae114ea
Update config.yml in generate_data 2024-03-16 21:57:29 +09:00
HongCheng
d51c894d3f
Update requirements.txt 2024-03-16 21:56:19 +09:00
HongCheng
eb12c3aa2b update qwen, zhipuai gen_data and readme 2024-03-16 21:46:02 +09:00
xzw
0fabdb8827
Merge pull request #62 from SmartFlowAI/dev
[DOC] add ocr.md
2024-03-13 00:15:42 +08:00
MING_X
53e68a804e
Update tutorial_EN.md 2024-03-11 15:28:00 +08:00
MING_X
af8c4eb85a
Update tutorial.md 2024-03-11 15:22:39 +08:00
8baby8
c644e3c696 Merge branch 'main' of github.com:8baby8/EmoLLM 2024-03-11 13:15:50 +08:00
8baby8
2fc3caeeba add ocr.md 2024-03-11 13:15:22 +08:00
Vicky
27458ef8ac 新增ENmd文档 2024-03-10 15:52:18 +08:00
MING_X
05ef864dfe
Rename Introduction.markdown to Readme.md 2024-02-29 22:52:41 +08:00
MING_X
43f58c7e56
Update Introduction.markdown 2024-02-29 22:50:48 +08:00
MING_X
e933af7e23
Update 说明.markdown 2024-02-29 22:40:34 +08:00
jupyter
294d5d1d60 feat: add datasets and update readme 2024-01-26 22:43:38 +08:00
8baby8
c5254ace61 add data_process 2024-01-23 21:25:12 +08:00