• Stars
    star
    444
  • Rank 98,300 (Top 2 %)
  • Language
    Python
  • License
    Apache License 2.0
  • Created about 3 years ago
  • Updated 3 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Production First and Production Ready End-to-End Keyword Spotting Toolkit

WeKws

Roadmap | Paper

Production First and Production Ready End-to-End Keyword Spotting Toolkit.

The goal of this toolkit it to...

Small footprint keyword spotting (KWS), or specifically wake-up word (WuW) detection is a typical and important module in internet of things (IoT) devices. It provides a way for users to control IoT devices with a hands-free experience. A WuW detection system usually runs locally and persistently on IoT devices, which requires low consumptional power, less model parameters, low computational comlexity and to detect predefined keyword in a streaming way, i.e., requires low latency.

Typical Scenario

We are going to support the following typical applications of wakeup word:

  • Single wake-up word
  • Multiple wake-up words
  • Customizable wake-up word
  • Personalized wake-up word, i.e. combination of wake-up word detection and voiceprint

Installation

  • Clone the repo
git clone https://github.com/wenet-e2e/wekws.git
conda create -n wekws python=3.8
conda activate wekws
pip install -r requirements.txt
conda install pytorch=1.10.0 torchaudio=0.10.0 cudatoolkit=11.1 -c pytorch -c conda-forge

Dataset

We plan to support a variaty of open source wake-up word datasets, include but not limited to:

All the well-trained models on these dataset will be made public avaliable.

Runtime

We plan to support a variaty of hardwares and platforms, including:

  • Web browser
  • x86
  • Android
  • Raspberry Pi

Discussion

For Chinese users, you can scan the QR code on the left to follow our offical account of WeNet. We also created a WeChat group for better discussion and quicker response. Please scan the QR code on the right to join the chat group.

Reference

  • Mining Effective Negative Training Samples for Keyword Spotting (github, paper)
  • Max-pooling Loss Training of Long Short-term Memory Networks for Small-footprint Keyword Spotting (paper)
  • A depthwise separable convolutional neural network for keyword spotting on an embedded system (github, paper)
  • Hello Edge: Keyword Spotting on Microcontrollers (github, paper)
  • An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling (github, paper)

More Repositories

1

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit
Python
4,073
star
2

speech-synthesis-paper

List of speech synthesis papers.
989
star
3

wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
Python
690
star
4

WenetSpeech

A 10000+ hours dataset for Chinese speech recognition
Shell
488
star
5

WeTextProcessing

Text Normalization & Inverse Text Normalization
Python
443
star
6

wetts

Production First and Production Ready End-to-End Text-to-Speech Toolkit
Python
367
star
7

speech-recognition-papers

Towards hot directions in industrial end to end speech recognition
325
star
8

opencpop

Opencpop: A High-Quality Open Source Chinese Popular Song Database for Singing Voice Synthesis
207
star
9

wenet-kws

Production First and Production Ready End-to-End Keyword Spotting Toolkit
Python
142
star
10

west

We Speech Transcript based on LLM, in 300 lines of code.
Python
109
star
11

wesep

Target Speaker Extraction Toolkit
Python
80
star
12

wesignal

Production first, nn-based on-device signal processing toolkit.
63
star
13

WeTextProcessing.deprecated

C++
61
star
14

wesubtitle

用 OCR 提取视频硬字幕
Python
54
star
15

llm-papers

List of Large Lanugage Model Papers
51
star
16

wecut

video cut powered by AI
25
star
17

WeSpeech-AI

Open Source Speech/Text Data on AI
18
star
18

nn-singal-processing-papers

List of NN based singal processing papers
17
star
19

wenet_in_action_homework

WeNet 实战课程作业
Python
16
star
20

wenet-e2e.github.io

WeNet Community
CSS
1
star
21

wenet-contributors

Contributors of WeNet, including individual and companies.
1
star