• Stars
    star
    159
  • Rank 235,916 (Top 5 %)
  • Language
    Python
  • Created over 4 years ago
  • Updated about 4 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to-speech model trained to generalize to new voices.

Voice Cloning and Text to Speech Synthesis

A Standalone service for cloning your own voice and synthesize any text in English in your own voice.


Read more about the procedure we followed and the findings here


Functionalities

  • Clone voices after feeding samples to it
  • Synthesized voice on custom texts
  • Speech-to-text facility for input using microphones
  • RestAPI with a testing UI for testing the model

Instructions to run the trained models

If you want to try and test out the samples trained and how the model is performing on custom text you can follow these instructions.

  • Pre-requisites:

    • For Windows

      • python (3.6 or 3.7 works best)
      • virtualenv

        If you dont have virtualenv check it out here to install

      • Trained embeddings from here
    • For linux

      • Bash shell for executing the scripts
  • Directions to install:

    • For windows
      • Clone the repo
      • Setup Virtualenv
         virtualenv env
         cd env/scripts
         activate
      • Install all requirements packages
         pip install -r requirements.txt
    • For Linux
      • Run the run.sh file to install the project
         ./run.sh

    After installing all the dependencies and environment prerequisites run the below file to check you are ready and good to go!
     ./test.sh
  • Directions to execute

    • Test through interface

      • Start the python flask server
         python app.py
      • Log on to localhost:5000 to test the model
    • Test through the Synthesize function

      • Follow the instructions given here
    • Test using Docker

        docker build -t smoketrees/voice:latest .
        docker run smoketrees/voice:latest -p 5000:5000

Instructions to train your own models

If you want to work with the source code and want to train your own models on different dataset and different language medium you can check out the instructions mentioned here

For more information about the samples tested and there results you can get all the information from here

Contributors

More Repositories

1

ssr-react

JavaScript
13
star
2

website

Our website.
JavaScript
9
star
3

node-template-ts

TypeScript version of our Node.JS template
TypeScript
8
star
4

fastapi-template

A template to quickly bootstrap FastAPI projects, to create a backend in Python.
Python
7
star
5

sparc

Smart Grid solution which compiles home networks and grids in an effecient manner, controlled by recurrent networks which predict distribution and consumption and also supported by an energy credit system. All running as microservices supporting each other.
Jupyter Notebook
7
star
6

nodejs-backend-template

Standardized template for SmokeTrees node.js projects.
JavaScript
6
star
7

fast-style-transfer

Making people look like anime characters
Jupyter Notebook
6
star
8

ATLAS-predictors

Total Surveillance for Infiltrators, a defense security solutions suite, sort documents on the fly for malcontent, configure drones for maximum area coverage, send communications via commo hubs, protected by swarm and blockchain. TLDR : Automated Solutions for Counter Insurgency. This repo contains the AI/ML models.
Python
5
star
9

forest-utils

Pypi Package for Smoketrees model zoo
Python
4
star
10

gesture-drive

An autonomous vehicle solution for people who are physically challenged, calibrates limb angle to drive steering wheel and operate pedals. Made by team Smoketrees for SIH 2020 and Mercedes Benz Hack 2020.
Jupyter Notebook
4
star
11

golash

Utility library inspired by lodash
Go
4
star
12

uproar

An application for accumulating all disaster updates in one forum, parsing them and manipulating the data so that only accurate information reaches the people. Features include SOS, mapping, video surveillance and many more features to come.
HTML
4
star
13

model-zoo

The contributions/pull repo for smoketrees model zoo
HTML
3
star
14

rust-backend-template

A rust backend template initialized with clean architecture.
Rust
3
star
15

CESS

Comprehensive Electoral Solutions Suite - made for DEVSOC 2019, an elections suite software which merges RFID and fingerprint recognition with extra layers of security which checks emotion when you vote while securing your transaction via a smart ethereum contract. Passively it also scours the internet for toxic comments which might influence the voters.
JavaScript
3
star
16

uproar-flutter

Flutter App for UpRoar
Dart
2
star
17

blockchain-contract-api

sample blockchain app on the Ethereum network using Solidity
JavaScript
2
star
18

aes-passwd

NPM package to encrypt data using 2 passwords. It uses aes-cbc-256 to encrypt the data and generate key and iv from the passwords.
JavaScript
2
star
19

ATLAS-app

Total Surveillance for Infiltrators, a defense security solutions suite, sort documents on the fly for malcontent, configure drones for maximum area coverage, send communications via commo hubs, protected by swarm and blockchain. TLDR : Automated Solutions for Counter Insurgency. This repo contains the app for the idea.
Java
2
star
20

django-template

An opinionated template for quickly getting started with a django webapp
Python
1
star
21

go-template

An opinionated Go template for building web applications
Go
1
star
22

Generato

Winner of HackerTech, 2019, hosted by E-Cell, VIT. Also selected for Smart India Hackathon (SIH), 2020.
Jupyter Notebook
1
star
23

website-template

JavaScript
1
star
24

Deep_Facial_Recognition

A fast and accurate facial recognition system.
Jupyter Notebook
1
star
25

path-prediction

An intuitive path prediction app
Jupyter Notebook
1
star
26

Toxic-Comment---ML

DL model which classifies a given input sentence into 6 different labels. It is RNN which consists of various layers like LSTM, DropOut, Embeddings etc. Tokenizer and padding is used for performing the text preprocessing.
Python
1
star