Top Rating
- Top Contributors
  Discover the Top Open Source contributors by country or by language
- Interviews
  Discover real stories from Open Source developers
Discover

Discover your Favorite Language
Discover the top trending repositories and projects on Github. Explore the latest trends in your preferred languages.

C#

Crystal

Haskell

F#

TypeScript

Nix

Python

Java

More Languages
Awesome

Awesome repositories
Discover the most awesome repositories and projects of your favorite languages. Inspired by the Awesome-* lists trend in GitHub.

Zig

Julia

C#

Rust

Ada

Clojure

Go

Nix

More Languages
By Country

Rankings by Country
Discover the community of talented open source contributors in each country.

🇨🇦 Canada

🇯🇴 Jordan

🇮🇶 Iraq

🇨🇾 Cyprus

🇱🇮 Liechtenstein

🇸🇯 Svalbard and Jan Mayen

🇳🇦 Namibia

🇪🇭 Western Sahara

All Countries Compare Countries

wiseman/py-webrtcvad

Stars
2,053
Rank 22,507 (Top 0.5 %)
Language
C
License
Other
Created over 8 years ago
Updated 5 months ago

wiseman/py-webrtcvad

wiseman

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Python interface to the WebRTC Voice Activity Detector

https://travis-ci.org/wiseman/py-webrtcvad.svg?branch=master

py-webrtcvad

This is a python interface to the WebRTC Voice Activity Detector (VAD). It is compatible with Python 2 and Python 3.

A VAD classifies a piece of audio data as being voiced or unvoiced. It can be useful for telephony and speech recognition.

The VAD that Google developed for the WebRTC project is reportedly one of the best available, being fast, modern and free.

How to use it

Install the webrtcvad module:
```
pip install webrtcvad
```
Create a Vad object:
```
import webrtcvad
vad = webrtcvad.Vad()
```
Optionally, set its aggressiveness mode, which is an integer between 0 and 3. 0 is the least aggressive about filtering out non-speech, 3 is the most aggressive. (You can also set the mode when you create the VAD, e.g. vad = webrtcvad.Vad(3)):
```
vad.set_mode(1)
```

Give it a short segment ("frame") of audio. The WebRTC VAD only accepts 16-bit mono PCM audio, sampled at 8000, 16000, 32000 or 48000 Hz. A frame must be either 10, 20, or 30 ms in duration:

# Run the VAD on 10 ms of silence. The result should be False.
sample_rate = 16000
frame_duration = 10  # ms
frame = b'\x00\x00' * int(sample_rate * frame_duration / 1000)
print 'Contains speech: %s' % (vad.is_speech(frame, sample_rate)

See example.py for a more detailed example that will process a .wav file, find the voiced segments, and write each one as a separate .wav.

How to run unit tests

To run unit tests:

pip install -e ".[dev]"
python setup.py test

History

2.0.10

Fixed memory leak. Thank you, bond005!

2.0.9

Improved example code. Added WebRTC license.

2.0.8

Fixed Windows compilation errors. Thank you, xiongyihui!

mavelous

multi-platform ground station for drones that speak the MAVLink protocol

coole-radar

A very cool semi-real terminal radar app. Written in Clojurescript targeting Node using shadow-cljs.

node-sbs1

Node.js parsing code for SBS-1 ADS-B messages

arduino-serial

Python port of Tod E. Kurt's arduino-serial.c for communicating with an Arduino over a serial port..

foursquare-python

Python module to interface with the foursquare API.

turboshrimp

Clojure API for the Parrot AR.Drone.

leaflet-gorilla

Leaflet renderer for gorilla-repl

sift

Library for doing SIFT-based image matching. Based on Rob Hess' SIFT code.

droneklv

Clojure code for handling metadata embedded in drone video with KLV

orbital-detector

Detect and map orbiting helicopters.

clj-pronouncing

A simple interface to the CMU Pronouncing Dictionary.

webflight-traffic

Air traffic overlay plugin for ardrone-webflight that uses ADS-B data.

initialisms

Guess sentences from initial letters of each word

ardrone-browser-3d

energid_nlp

Natural language parsers and conceptual memory

gpsjam.org

cl-zeroconf

A Common Lisp interface to Apple's open source implementation of the Zeroconf service discovery protocol (Bonjour).

webflight-gamepad

A plugin for ardrone-webflight that lets you control a drone with a gamepad in the browser.

virtual-radar-server

Virtual radar server, an aircraft tracker that uses SBS-compatible data (ADS-B, mode S, etc.)

ar-drone-rest

A node.js REST server for controlling the AR.Drone 2.0.

pyluis

Python interface to Microsoft LUIS (Language Understanding Intelligent Service)

4mapper

Maps Foursquare checkins. Powers http://4mapper.appspot.com/

clj-pid

PID controller in Clojure.

node-planefinder

A node.js module that can get aircraft location information from planefinder.net.

sirc

IRC indexer and search engine that runs on Google AppEngine.

cl-difflib

A Common Lisp library for computing differences between sequences based on the Python difflib module.

docker-rpi-vrs

Docker image for running Virtual Radar Server on Raspberry Pi

braitenberg-vehicles

A simulator for Braitenberg vehicles, as described by A. K. Dewdney.

shrimpdroid

Android app in Clojure to control an AR.Drone.

clj-opencv-examples

Examples of using OpenCV from Clojure

chernoff-faces

Lisp & Java code to draw Chernoff faces, a technique for displaying multivariate data in the shape of a human face.

tracon

Detects aircraft interceptions, in real-time or after the fact.

java-mode-s-beast

Java library that decodes Mode-S Beast messages containing Mode S/ADS-B information.

ac-statevec

Build aircraft state vectors from pings

foolseye

Protecting journalistic integrity with image processing & crowd-sourcing.

mefingram

Metafilter infodump n-gram tools

faa-registration-data

word-freqs

A simple Ajax demo; Uses web.py, dojo, matplotlib & simplejson.

threadless-corpus

Collections of images for testing & evaluating object recognition algorithms.

clj-viterbi

Viterbi search in clojure.

cl-html-diff

A Common Lisp library for generating a human-readable diff of two HTML documents.

rid

Regressive imagery dictionary, a content analysis coding scheme designed to measure primordial vs. conceptual thinking.

secretmetafilter

Web app that highlights metafilter.com discussions that are still active on older posts (runs in Google AppEngine).

lsp

Lisp Server Pages, a simple Common Lisp version of JSP.

node-ar-pnav

Experiments with potential field-based navigation techniques for the AR.Drone.

fireflysync

"Firefly Synchronization in Ad Hoc Networks"

lisp-spotlight-indexer

Plugin that indexes Common Lisp code in Mac OS X Spotlight.

wischk

A Unix and DOS checkers-playing program.

clj-gflags

Google flags/gflags for clojure

turboshrimp-tracker

Ground control station for AR.Drone with OpenCV object tracking.

whatsoverhead-in-space

ten90

Packaging the guts of dump1090 into a library.

r4

Wrapper around p4 command-line client to add extra functionality and ease-of-use.

blubber_bot

Firmware for Jed Berk's Blubber Bot (a robot blimp).

sane-cliki

Fixes HTML encoding bugs in CLiki, the Common Lisp wiki

sklounst

Stores streaming SBS-1 messages in a database.

morning-improv-scraper

Code to scrape Scott McCloud's Morning Improv blog and convert to RSS for LiveJournal.

better-ning-feeds

Adds content to content-free ning RSS/Atom feeds.

commoncrawl_cdx

API and utilities for accessing the Common Crawl CDX index.

civileyesmesite

Drones deterring police misconduct. See http://www.civileyes.me/

claim

A Common Lisp library for interfacing to AOL Instant Messenger (AIM) using the TOC protocol.

darpa-gc-forum-scraper

Lisp code to scrape the DARPA Grand Challenge discussion forum and convert it to RSS.

ground-stop-anomalies

Code to detect certain anomalies that may have occurred during the FAA ground stop of 1/10/2022

familiar

closure-externs-generator

honeybee

Bot to play Travian.

lml

Lisp Markup Language, an s-expression based markup language.

socialaircraft

What if every aircraft had a twitter account?

aircraft_icao_country

lein-awsuberwar

Patches lein ring uberwar to not discard .ebextensions.