Warning
DeepSpeech doesn't seem to be maintained anymore by Mozilla. Coqui is one of the projects that continued building on top of this awesome project. Check out the GO bindings for Coqui here.
Astideepspeech
Golang bindings for Mozilla's DeepSpeech speech-to-text library.
astideepspeech
is compatible with version v0.9.0
of DeepSpeech
.
Installation
Install DeepSpeech
- fetch an up-to-date
native_client.<your system>.tar.xz
matching your system from DeepSpeech's "releases" - extract its content to /tmp/deepspeech/lib
- download
deepspeech.h
from https://github.com/mozilla/DeepSpeech/raw/v0.9.0/native_client/deepspeech.h - copy it to /tmp/deepspeech/include
- export CGO_LDFLAGS="-L/tmp/deepspeech/lib/"
- export CGO_CXXFLAGS="-I/tmp/deepspeech/include/"
- export LD_LIBRARY_PATH=/tmp/deepspeech/lib/:$LD_LIBRARY_PATH
Alternatively, copy the downloaded libdeepspeech.so
and deepspeech.h
files
to directories that are searched by default, e.g. /usr/local/lib
and
/usr/local/include
, respectively.
Install astideepspeech
Run the following command:
$ go get -u github.com/asticode/go-astideepspeech/...
Example
Get the pre-trained model and scorer
Run the following commands:
$ mkdir /tmp/deepspeech
$ cd /tmp/deepspeech
$ wget https://github.com/mozilla/DeepSpeech/releases/download/v0.9.0/deepspeech-0.9.0-models.pbmm
$ wget https://github.com/mozilla/DeepSpeech/releases/download/v0.9.0/deepspeech-0.9.0-models.scorer
Get the audio files
Run the following commands:
$ cd /tmp/deepspeech
$ wget https://github.com/mozilla/DeepSpeech/releases/download/v0.9.0/audio-0.9.0.tar.gz
$ tar xvfz audio-0.9.0.tar.gz
Use the client
Run the following commands (make sure $GOPATH/bin
is in your $PATH
):
$ cd /tmp/deepspeech
$ deepspeech -model deepspeech-0.9.0-models.pbmm -scorer deepspeech-0.9.0-models.scorer -audio audio/2830-3980-0043.wav
Text: experience proves this
$ deepspeech -model deepspeech-0.9.0-models.pbmm -scorer deepspeech-0.9.0-models.scorer -audio audio/4507-16021-0012.wav
Text: why should one hall on the way
$ deepspeech -model deepspeech-0.9.0-models.pbmm -scorer deepspeech-0.9.0-models.scorer -audio audio/8455-210777-0068.wav
Text: your power is sufficient i said