Elleo/gst-deepspeech

Stars
169
Rank 224,453 (Top 5 %)
Language
C++
License
Other
Created almost 7 years ago
Updated over 2 years ago

Elleo/gst-deepspeech

Elleo

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

NOTE: This plugin is now deprecated in favour of the coqui-stt branch in gst-plugins-bad: https://gitlab.freedesktop.org/philn/gstreamer/-/tree/coqui-stt/subprojects/gst-plugins-bad/ext/coqui

GStreamer DeepSpeech Plugin

NOTE: This plugin is now deprecated in favour of the coqui-stt branch in gst-plugins-bad: https://gitlab.freedesktop.org/philn/gstreamer/-/tree/coqui-stt/subprojects/gst-plugins-bad/ext/coqui

DeepSpeech is a speech recognition project created by Mozilla.

This project provides a GStreamer element which can be placed into an audio pipeline, it will then report any recognised speech via bus messages. It automatically segments audio based on configurable silence thresholds making it suitable for continuous dictation.

Here’s a couple of example pipelines using gst-launch.

To perform speech recognition on a file, printing all bus messages to the terminal:

gst-launch-1.0 -m filesrc location=/path/to/file.ogg ! decodebin ! audioconvert ! audiorate ! audioresample ! deepspeech ! fakesink

To perform speech recognition on audio recorded from the default system microphone, with changes to the silence thresholds:

gst-launch-1.0 -m pulsesrc ! audioconvert ! audiorate ! audioresample ! deepspeech silence-threshold=0.3 silence-length=20 ! fakesink

pied

Pied makes it simple to install and manage text-to-speech Piper voices for use with Speech Dispatcher.

cutespotify

A QT5 Spotify client, based on MeeSpot with support for Ubuntu Touch and SailfishOS.

ibus-deepspeech

IBus plugin to allow any Linux application to make use of speech recognition

gst-opencv

This project has now been merged into gstreamer-plugins-bad, please check out the source code from there and file any bug reports in the gstreamer bug tracker

baby_elephant

A mastodon client for WearOS

petition_generator

Generate UK government style petitions using GPT2

qt-osm-map-providers

Easily setup a Qt OSM Map providers repository to allow use of tile servers that require an API key

gst-musicxml2midi

A GStreamer element for converting MusicXML into MIDI audio (suitable for direct synthesis via the wildmidi or timidity elements). Packages are available at:

qml-box2d-game-template

A template for creating QML + Box2D based games for desktop and mobile platforms such as Ubuntu Touch.

gst-aubio

GStreamer plugins making use of the aubio audio labelling library

vegan2d

Editor for the Bacon2D game engine

libQtSpotify

Qt wrapper around libspotify

abalone

A speech recognition based input method for GNU/Linux desktops

empathy

Telepathy IM/VoIP client

flutter-emojiexample

An example project, showing how to render emoji within Linux Flutter apps

scribe

Scribe is a device for automatically captioning live performances

fedinp

Displays Radio Free Fedi now playing information on an RGB matrix

myntan

Beautiful mind mapping for Linux, capable of syncing with Mindly