小橄榄

Go to file

xszyou f8ff5a2187 周末愉快 + 增加运行时自动清理ui缓存； + 增加gpt代理设置可为空； + 提高灵聚对接的稳定性。		2023-07-28 18:22:44 +08:00
ai_module	周末愉快	2023-07-28 18:22:44 +08:00
android_connector_demo	2023.02.10	2023-02-10 19:50:47 +08:00
bin	提供funasr集成	2023-05-17 18:38:47 +08:00
core	今天星期三	2023-07-26 18:11:52 +08:00
gui	周末愉快	2023-07-28 18:22:44 +08:00
images	今天星期三	2023-07-26 18:11:52 +08:00
python_connector_demo	修复多个bug	2023-05-23 00:49:31 +08:00
scheduler	Update	2022-06-20 11:05:10 +08:00
shell	2022.10.27	2022-10-27 14:58:48 +08:00
test	今天星期三	2023-07-26 18:11:52 +08:00
utils	今天星期三	2023-07-26 18:11:52 +08:00
.gitignore	Update	2022-06-20 11:05:10 +08:00
[Start] PowerShell.bat	2023.01	2023-01-31 12:40:36 +08:00
[Start].bat	2023.01	2023-01-31 12:40:36 +08:00
config.json	助理版多处优化	2023-06-14 20:34:36 +08:00
favicon.ico	助理版多处优化	2023-06-14 20:34:36 +08:00
fay_booter.py	20230719	2023-07-19 22:45:03 +08:00
icon.png	Update	2022-06-20 11:05:10 +08:00
LICENSE	Update	2022-06-20 11:05:10 +08:00
main.py	20230719	2023-07-19 22:45:03 +08:00
qa_demo.xlsx	2022.10.17	2022-10-17 11:32:51 +08:00
README_EN.md	周末愉快	2023-07-28 18:22:44 +08:00
README.md	周末愉快	2023-07-28 18:22:44 +08:00
requirements.txt	重构nlp模块逻辑	2023-06-28 19:44:34 +08:00
system.conf	周末愉快	2023-07-28 18:22:44 +08:00
WebSocket.md	今天星期三	2023-07-26 18:11:52 +08:00
yolov8n-pose.pt	多模态的支持	2023-05-27 17:03:43 +08:00

README_EN.md

中文

FAY

Fay Digital Human Assistant

Fay Digital Human Assistant Edition is an important branch of the Fay open-source project, focusing on building open-source solutions for intelligent digital assistants. It offers a flexible and modular design that allows developers to customize and combine various functional modules, including emotion analysis, NLP processing, speech synthesis, and speech output, among others. Fay Digital Assistant Edition provides developers with powerful tools and resources for building intelligent, personalized, and multifunctional digital assistant applications. With this edition, developers can easily create digital assistants applicable to various scenarios and domains, providing users with intelligent voice interactions and personalized services.

Fay Digital Assistant Edition

ProTip:The shopping edition has been moved to a separate branch.fay-sales-edition

*Assistant Fay controller use: voice communication, voice and text reply;**Text communication, text reply;*To connect UE, live2d, and xuniren, you need to close the panel for playback.

Assistant Fay controller

Remote Android　　　　　　Local PC　　　　　Remote PC

　　　　　└─────────────┼─────────────┘

　　　　　　Aliyun API ─┐　　　│

　　　　　　　　　　　├── ASR　　　

　　　　　　 FunASR ─┘　　　│　　　 ┌─ Yuan 1.0

　　　　　　　　　　　　　　　 │　　　 ├─ LingJu

　　　　　　　　　　　　　　NLP ────┼─ GPT/ChatGPT

　　　　　　　　　　　　　　　│　　　 ├─ Rasa+ChatGLM-6B

　　　　　　　　 Azure ─┐　　 │　　　 ├─ VisualGLM

　　　　　　 Edge TTS ─┼── TTS 　　 └─ RWKV

　　　Open source TTS ─┘　 │　　　

　　　　　　　　　　　　　　 │　　　

　　　 ┌──────────┬────┼───────┬─────────┐

Remote Android　　Live2D　　 UE　　　 xuniren　　　Remote PC

Important: Communication interface between Fay (server) and digital human (client): 'ws://127.0.0.1:10002' (connected)

Message format: View WebSocket.md

代码结构

.

├── main.py	            # Program main entry
├── fay_booter.py	    # Core boot module
├── config.json		    # Controller configuration file
├── system.conf		    # System configuration file
├── ai_module
│   ├── ali_nls.py	        # Aliyun Real-time Voice
│   ├── ms_tts_sdk.py       # Microsoft Text-to-Speech
│   ├── nlp_lingju.py       # Lingju Human-Machine Interaction - Natural Language Processing
│   ├── xf_aiui.py          # Xunfei Human-Machine Interaction - Natural Language Processing
│   ├── nlp_gpt.py          # GPT API integration
│   ├── nlp_chatgpt.py      # Reverse integration with chat.openai.com
│   ├── nlp_yuan.py         # Langchao. Yuan model integration
│   ├── nlp_rasa.py         # Preceding Rasa conversation management based on ChatGLM-6B (highly recommended)
│   ├── nlp_VisualGLM.py    # Integration with multimodal large language model VisualGLM-6B
│   ├── nlp_rwkv.py         # Offline integration with rwkv
│   ├── nlp_rwkv_api.py     # rwkv server API
│   ├── yolov8.py           # YOLOv8 object detection
│   └── xf_ltp.py           # Xunfei Sentiment Analysis
├── bin                     # Executable file directory
├── core                    # Digital Human Core
│   ├── fay_core.py         # Digital Human Core module
│   ├── recorder.py         # Recorder
│   ├── tts_voice.py        # Speech synthesis enumeration
│   ├── authorize_tb.py     # fay.db authentication table management
│   ├── content_db.py       # fay.db content table management
│   ├── interact.py         # Interaction (message) object
│   ├── song_player.py      # Music player (currently unavailable)
│   └── wsa_server.py       # WebSocket server
├── gui                     # Graphical interface
│   ├── flask_server.py     # Flask server
│   ├── static
│   ├── templates
│   └── window.py           # Window module
├── scheduler
│   └── thread_manager.py   # Scheduler manager
├── utils                   # Utility modules
│   ├── config_util.py
│   ├── storer.py
│   └── util.py
└── test                    # All surprises

Upgrade Log

2023.07.28：

Add runtime automatic cleaning of UI cache;
Add GPT proxy setting can be null;
Improve the stability of Lingju docking.

2023.07.21：

Fixed the problem of generating a large amount of WS information before connecting digital humans;
Add digital human (UE, Live2D, Xuniren) communication interface: real-time logs;
Update digital human (UE, Live2D, Xuniren) communication interface: audio push.

2023.07.21：

Multiple updates for the merchandise version.

2023.07.19：

Fixed the issue of remote voice recognition.
Fixed the issue of occasional unresponsiveness during ASR (Automatic Speech Recognition).
Removed the singing command.

2023.07.14：

Fixed Linux and macOS runtime errors.
Fixed the issue of being unable to continue execution due to lip-sync errors.
Provided an integration solution for RWKV.

2023.07.12：

Fixed an issue in Assistant Edition where text input does not read persona responses.
Fixed an issue in Assistant Edition where text input does not read QA responses.
Enhanced microphone stability.

2023.07.05：

Fixed a sound playback issue caused by the inability to run the lip-sync algorithm.

2023.06：

Refactored NLP module management logic for easier extension.
Split GPT into ChatGPT and GPT, replaced with a new GPT interface, and added the ability to configure proxy servers separately.
Specified the version of the YOLOv8 package to resolve YOLO compatibility issues.
Fixed self-talk bug and receiving multiple messages to be processed bug.
Integrated Lingju NLP API (supporting GPT3.5 and multiple applications).
UI corrections.
Integrated local lip-sync algorithm.
Resolved compatibility issues with multi-channel microphones.
Refactored fay_core.py and fay_booter.py code.
UI layout adjustments.
Restored sound selection.
Fixed logic for displaying "Thinking..."

Installation Instructions

Environment

Python 3.9、3.10
Windows、macos、linux

Installing Dependencies

pip install -r requirements.txt

Configuring Application Key

View API Modules
Browse the link, register, and create an application. Fill in the application key in ./system.conf

Starting

Starting Fay Controller

python main.py

API Modules

Application Key needs to be filled in before starting

File	Description	Link
./ai_module/ali_nls.py	Real-time Speech Recognition (Optional)	https://ai.aliyun.com/nls/trans
./ai_module/ms_tts_sdk.py	Microsoft Text-to-Speech with Emotion (Optional)	https://azure.microsoft.com/zh-cn/services/cognitive-services/text-to-speech/
./ai_module/xf_ltp.py	Xunfei Sentiment Analysis(Optional)	https://www.xfyun.cn/service/emotion-analysis
./utils/ngrok_util.py	ngrok.cc External Network Penetration (optional)	http://ngrok.cc
./ai_module/nlp_lingju.py	Lingju NLP API (supports GPT3.5 and multiple applications)(Optional)	https://open.lingju.ai Contact customer service to enable GPT3.5 access
./ai_module/yuan_1_0.py	Langchao Yuan Model (Optional)	https://air.inspur.com/

Instructions for Use

Voice Assistant: Fay Controller (with microphone input source enabled and panel playback enabled).
Remote Voice Assistant: Fay Controller (with panel playback disabled) + Remote device integration.
Digital Human Interaction: Fay Controller (with microphone input source enabled, panel playback disabled, and personality Q&A filled) + Digital Human.
Jarvis, Her: Join us to complete the experience together.

Voice Commands

Shut down	Mute	Unmute
Shut down, Goodbye, Go away	Mute, Be quiet, I want silence	Unmute, Where are you, You can speak now

For business inquiries

**business QQ **: 467665317

README_EN.md Unescape Escape

FAY

Fay Digital Human Assistant

Fay Digital Assistant Edition

Assistant Fay controller

Upgrade Log

Installation Instructions

Environment

Installing Dependencies

Configuring Application Key

Starting

API Modules

Instructions for Use

Instructions for Use

Voice Commands

For business inquiries

README_EN.md