smoke-trees/Voice-synthesis

Stars
159
Rank 235,916 (Top 5 %)
Language
Python
Created over 4 years ago
Updated about 4 years ago

smoke-trees/Voice-synthesis

smoke-trees

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to-speech model trained to generalize to new voices.

Voice Cloning and Text to Speech Synthesis

A Standalone service for cloning your own voice and synthesize any text in English in your own voice.

Read more about the procedure we followed and the findings here

Functionalities

Clone voices after feeding samples to it
Synthesized voice on custom texts
Speech-to-text facility for input using microphones
RestAPI with a testing UI for testing the model

Instructions to run the trained models

If you want to try and test out the samples trained and how the model is performing on custom text you can follow these instructions.

Pre-requisites:
- For Windows
  - python (3.6 or 3.7 works best)
  - virtualenv
    
    If you dont have virtualenv check it out here to install
  - Trained embeddings from here
- For linux
  - Bash shell for executing the scripts
Directions to install:
- For windows
  - Clone the repo
  - Setup Virtualenv
```
 virtualenv env
 cd env/scripts
 activate
```
  - Install all requirements packages
```
 pip install -r requirements.txt
```
- For Linux
  - Run the run.sh file to install the project
```
 ./run.sh
```
After installing all the dependencies and environment prerequisites run the below file to check you are ready and good to go!
```
 ./test.sh
```
Directions to execute
- Test through interface
  - Start the python flask server
```
 python app.py
```
  - Log on to localhost:5000 to test the model
- Test through the Synthesize function
  - Follow the instructions given here
- Test using Docker
```
  docker build -t smoketrees/voice:latest .
  docker run smoketrees/voice:latest -p 5000:5000
```

Instructions to train your own models

If you want to work with the source code and want to train your own models on different dataset and different language medium you can check out the instructions mentioned here

For more information about the samples tested and there results you can get all the information from here

Contributors

ssr-react

website

node-template-ts

TypeScript version of our Node.JS template

fastapi-template

A template to quickly bootstrap FastAPI projects, to create a backend in Python.

sparc

Smart Grid solution which compiles home networks and grids in an effecient manner, controlled by recurrent networks which predict distribution and consumption and also supported by an energy credit system. All running as microservices supporting each other.

Jupyter Notebook

nodejs-backend-template

Standardized template for SmokeTrees node.js projects.

fast-style-transfer

Making people look like anime characters

Jupyter Notebook

ATLAS-predictors

Total Surveillance for Infiltrators, a defense security solutions suite, sort documents on the fly for malcontent, configure drones for maximum area coverage, send communications via commo hubs, protected by swarm and blockchain. TLDR : Automated Solutions for Counter Insurgency. This repo contains the AI/ML models.

forest-utils

Pypi Package for Smoketrees model zoo

gesture-drive

An autonomous vehicle solution for people who are physically challenged, calibrates limb angle to drive steering wheel and operate pedals. Made by team Smoketrees for SIH 2020 and Mercedes Benz Hack 2020.

Jupyter Notebook

golash

Utility library inspired by lodash

uproar

An application for accumulating all disaster updates in one forum, parsing them and manipulating the data so that only accurate information reaches the people. Features include SOS, mapping, video surveillance and many more features to come.

model-zoo

The contributions/pull repo for smoketrees model zoo

rust-backend-template

A rust backend template initialized with clean architecture.

CESS

Comprehensive Electoral Solutions Suite - made for DEVSOC 2019, an elections suite software which merges RFID and fingerprint recognition with extra layers of security which checks emotion when you vote while securing your transaction via a smart ethereum contract. Passively it also scours the internet for toxic comments which might influence the voters.

uproar-flutter

Flutter App for UpRoar

blockchain-contract-api

sample blockchain app on the Ethereum network using Solidity

aes-passwd

NPM package to encrypt data using 2 passwords. It uses aes-cbc-256 to encrypt the data and generate key and iv from the passwords.

ATLAS-app

Total Surveillance for Infiltrators, a defense security solutions suite, sort documents on the fly for malcontent, configure drones for maximum area coverage, send communications via commo hubs, protected by swarm and blockchain. TLDR : Automated Solutions for Counter Insurgency. This repo contains the app for the idea.

django-template

An opinionated template for quickly getting started with a django webapp

go-template

An opinionated Go template for building web applications

Generato

Winner of HackerTech, 2019, hosted by E-Cell, VIT. Also selected for Smart India Hackathon (SIH), 2020.

Jupyter Notebook

website-template

Deep_Facial_Recognition

A fast and accurate facial recognition system.

Jupyter Notebook

path-prediction

An intuitive path prediction app

Jupyter Notebook

Toxic-Comment---ML

DL model which classifies a given input sentence into 6 different labels. It is RNN which consists of various layers like LSTM, DropOut, Embeddings etc. Tokenizer and padding is used for performing the text preprocessing.