• Stars
    star
    374
  • Rank 113,730 (Top 3 %)
  • Language
    Python
  • License
    Apache License 2.0
  • Created over 6 years ago
  • Updated almost 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

๐Ÿ—ฃ๏ธ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).

Voicebook

Tweet

This is an assortment of all the scripts associated with the Intro to Voice Computing Book. These scripts should give you a jumpstart in learning how to write Python code for voice-related applications.

^^ Note: If you find this code useful, please leave a star :) ^^

Voicebook video

Getting started: setting up environment (mac)

Watch the overview video for this book here.

Note that these are the instructions for Mac computers; you may need some custom setup for FFmpeg and/or SoX if you're using a Windows and/or Linux computer.

First, clone the repository and submodules:

git clone --recurse-submodules -j8 https://github.com/jim-schwoebel/voicebook

Now you need to run the setup.py script in a virtual environment to make sure you have all the required dependencies for all the chapters of the book. To do this, run:

cd voicebook
pip3 install virtualenv
virtualenv env 
source env/bin/activate
python3 setup.py

Now you have all the dependencies necessary to follow along with the chapters in the book. You donโ€™t need to worry about any other installations.

License

This repository is licensed under the Apache 2.0 License.

Feedback

Any feedback on the book or this repository is greatly appreciated.

  • If you find something that is missing or doesn't work, please consider opening a GitHub issue.
  • If you'd like to be mentored by someone on our team, check out the Innovation Fellows Program.
  • If you are looking for a framework to start building machine learning models in voice computing, check out Allie.
  • If you want to talk to me directly, please send me an email @ [email protected].

Citation

Please use the following citation when citing this book in your research work:

References

Check out the wiki or documentation below to follow along with each chapter in the book. In this way, you can get quickly up to speed with the 200+ scripts included in this repository.

More Repositories

1

voice_datasets

๐Ÿ”Š A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
1,658
star
2

allie

๐Ÿค– An automated machine learning framework for audio, text, image, video, or .CSV files (50+ featurizers and 15+ model trainers). Python 3.6 required.
Python
140
star
3

download_audioset

๐Ÿ“ This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
Python
97
star
4

voice_gender_detection

โ™‚๏ธโ™€๏ธ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).
Python
77
star
5

sound_event_detection

๐ŸŽต A repository for manually annotating files to create labeled acoustic datasets for machine learning.
Python
39
star
6

nala

๐Ÿฆ Nala is an agile open-source voice assistant framework (20+ actions).
Python
35
star
7

pauses

๐ŸŽค quick library to extract pause lengths from audio files.
Python
31
star
8

audioset_models

๐Ÿ“Š Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).
Python
29
star
9

voiceome

๐Ÿฅ ๐ŸŽค The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterances, 80+ health labels). Preprint: https://www.medrxiv.org/content/10.1101/2021.08.16.21262125v1
C++
27
star
10

nala_assistant

๐Ÿ”Š๐Ÿ˜Š A fastapi voice-assistant framework to quickly prototype LLM-powered voice assistants in <5 minutes.
JavaScript
24
star
11

dopg_cli

๐Ÿ๐Ÿณ๐Ÿ˜ A python command line interface for DigitalOcean postgres clusters (5+ integrations).
Python
12
star
12

sample_voice_data

Sample voice data, 52 males and 52 females as classes - useful for testing.
Python
9
star
13

youtube_scrape

๐Ÿ“น Library for making playlists and scraping youtube videos - alternative to pafy, pytube, and youtube-dl.
Python
7
star
14

video_record

๐Ÿ“น record videos (w/ audio and screenshare) on mac computers.
Python
6
star
15

pylicense

๐Ÿ—’๏ธ Quickly make license docs for your code base.
Python
3
star