• Stars
    star
    374
  • Rank 114,346 (Top 3 %)
  • Language
    Python
  • License
    Apache License 2.0
  • Created over 6 years ago
  • Updated almost 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

πŸ—£οΈ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).

Voicebook

Tweet

This is an assortment of all the scripts associated with the Intro to Voice Computing Book. These scripts should give you a jumpstart in learning how to write Python code for voice-related applications.

^^ Note: If you find this code useful, please leave a star :) ^^

Voicebook video

Getting started: setting up environment (mac)

Watch the overview video for this book here.

Note that these are the instructions for Mac computers; you may need some custom setup for FFmpeg and/or SoX if you're using a Windows and/or Linux computer.

First, clone the repository and submodules:

git clone --recurse-submodules -j8 https://github.com/jim-schwoebel/voicebook

Now you need to run the setup.py script in a virtual environment to make sure you have all the required dependencies for all the chapters of the book. To do this, run:

cd voicebook
pip3 install virtualenv
virtualenv env 
source env/bin/activate
python3 setup.py

Now you have all the dependencies necessary to follow along with the chapters in the book. You don’t need to worry about any other installations.

License

This repository is licensed under the Apache 2.0 License.

Feedback

Any feedback on the book or this repository is greatly appreciated.

  • If you find something that is missing or doesn't work, please consider opening a GitHub issue.
  • If you'd like to be mentored by someone on our team, check out the Innovation Fellows Program.
  • If you are looking for a framework to start building machine learning models in voice computing, check out Allie.
  • If you want to talk to me directly, please send me an email @ [email protected].

Citation

Please use the following citation when citing this book in your research work:

References

Check out the wiki or documentation below to follow along with each chapter in the book. In this way, you can get quickly up to speed with the 200+ scripts included in this repository.

More Repositories

1

voice_datasets

πŸ”Š A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
1,658
star
2

allie

πŸ€– An automated machine learning framework for audio, text, image, video, or .CSV files (50+ featurizers and 15+ model trainers). Python 3.6 required.
Python
140
star
3

download_audioset

πŸ“ This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
Python
97
star
4

voice_gender_detection

♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).
Python
77
star
5

sound_event_detection

🎡 A repository for manually annotating files to create labeled acoustic datasets for machine learning.
Python
39
star
6

nala

🦁 Nala is an agile open-source voice assistant framework (20+ actions).
Python
35
star
7

pauses

🎀 quick library to extract pause lengths from audio files.
Python
31
star
8

audioset_models

πŸ“Š Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).
Python
29
star
9

voiceome

πŸ₯ 🎀 The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterances, 80+ health labels). Preprint: https://www.medrxiv.org/content/10.1101/2021.08.16.21262125v1
C++
28
star
10

nala_assistant

πŸ”ŠπŸ˜Š A fastapi voice-assistant framework to quickly prototype LLM-powered voice assistants in <5 minutes.
JavaScript
24
star
11

dopg_cli

🐍🐳🐘 A python command line interface for DigitalOcean postgres clusters (5+ integrations).
Python
12
star
12

sample_voice_data

Sample voice data, 52 males and 52 females as classes - useful for testing.
Python
9
star
13

youtube_scrape

πŸ“Ή Library for making playlists and scraping youtube videos - alternative to pafy, pytube, and youtube-dl.
Python
7
star
14

video_record

πŸ“Ή record videos (w/ audio and screenshare) on mac computers.
Python
6
star
15

pylicense

πŸ—’οΈ Quickly make license docs for your code base.
Python
3
star