• Stars
    star
    2,419
  • Rank 19,018 (Top 0.4 %)
  • Language
    Python
  • License
    MIT License
  • Created over 1 year ago
  • Updated 12 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Yet another voice assistant, but alive.

Digital Life Server

这是「数字生命」服务部分代码。包括与前端通信,语音识别,chatGPT接入和语音合成。
For other part of the project, please refer to:
Launcher 启动此服务器的图形界面。
UE Client 用于渲染人物动画,录音,和播放声音的前端部分。
详细的配置流程可参见readme_detail.md

Getting stuffs ready to roll:

Clone this repo

git clone https://github.com/zixiiu/Digital_Life_Server.git --recursive

Install prerequisites

  1. install pytorch

    pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
  2. install other requirements

    pip install -r requirements.txt
  3. Build monotonic_align
    This may not work that well but you know what that suppose to mean.

    cd "TTS/vits/monotonic_align"
    mkdir monotonic_align
    python setup.py build_ext --inplace
    cp monotonic_align/*.pyd .
  4. Download models
    百度网盘
    ASR Model:
    to /ASR/resources/models
    Sentiment Model:
    to /SentimentEngine/models
    TTS Model:
    to /TTS/models

  5. (对于没有Nvidia显卡的电脑,采用cpu来跑的话)需要额外做一步:

    ​ 将 Digital_Life_Server\TTS\TTService.py 文件下 36行

    self.net_g = SynthesizerTrn(...).cuda()
    修改为
    self.net_g = SynthesizerTrn(...).cpu()
    

    到这里,项目构建完毕🥰

Start the server

run-gpt3.5-api.bat