• Stars
    star
    430
  • Rank 100,422 (Top 2 %)
  • Language
    Python
  • License
    Apache License 2.0
  • Created almost 3 years ago
  • Updated about 2 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Production First and Production Ready End-to-End Keyword Spotting Toolkit

WeKws

Roadmap | Paper

Production First and Production Ready End-to-End Keyword Spotting Toolkit.

The goal of this toolkit it to...

Small footprint keyword spotting (KWS), or specifically wake-up word (WuW) detection is a typical and important module in internet of things (IoT) devices. It provides a way for users to control IoT devices with a hands-free experience. A WuW detection system usually runs locally and persistently on IoT devices, which requires low consumptional power, less model parameters, low computational comlexity and to detect predefined keyword in a streaming way, i.e., requires low latency.

Typical Scenario

We are going to support the following typical applications of wakeup word:

  • Single wake-up word
  • Multiple wake-up words
  • Customizable wake-up word
  • Personalized wake-up word, i.e. combination of wake-up word detection and voiceprint

Installation

  • Clone the repo
git clone https://github.com/wenet-e2e/wekws.git
conda create -n wekws python=3.8
conda activate wekws
pip install -r requirements.txt
conda install pytorch=1.10.0 torchaudio=0.10.0 cudatoolkit=11.1 -c pytorch -c conda-forge

Dataset

We plan to support a variaty of open source wake-up word datasets, include but not limited to:

All the well-trained models on these dataset will be made public avaliable.

Runtime

We plan to support a variaty of hardwares and platforms, including:

  • Web browser
  • x86
  • Android
  • Raspberry Pi

Discussion

For Chinese users, you can scan the QR code on the left to follow our offical account of WeNet. We also created a WeChat group for better discussion and quicker response. Please scan the QR code on the right to join the chat group.

Reference

  • Mining Effective Negative Training Samples for Keyword Spotting (github, paper)
  • Max-pooling Loss Training of Long Short-term Memory Networks for Small-footprint Keyword Spotting (paper)
  • A depthwise separable convolutional neural network for keyword spotting on an embedded system (github, paper)
  • Hello Edge: Keyword Spotting on Microcontrollers (github, paper)
  • An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling (github, paper)

More Repositories

1

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit
Python
4,073
star
2

speech-synthesis-paper

List of speech synthesis papers.
987
star
3

wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
Python
657
star
4

WenetSpeech

A 10000+ hours dataset for Chinese speech recognition
Shell
488
star
5

WeTextProcessing

Text Normalization & Inverse Text Normalization
Python
443
star
6

wetts

Production First and Production Ready End-to-End Text-to-Speech Toolkit
Python
367
star
7

speech-recognition-papers

Towards hot directions in industrial end to end speech recognition
325
star
8

opencpop

Opencpop: A High-Quality Open Source Chinese Popular Song Database for Singing Voice Synthesis
207
star
9

wenet-kws

Production First and Production Ready End-to-End Keyword Spotting Toolkit
Python
142
star
10

west

We Speech Transcript based on LLM, in 300 lines of code.
Python
109
star
11

wesignal

Production first, nn-based on-device signal processing toolkit.
63
star
12

WeTextProcessing.deprecated

C++
61
star
13

wesubtitle

用 OCR 提取视频硬字幕
Python
54
star
14

llm-papers

List of Large Lanugage Model Papers
51
star
15

wesep

Target Speaker Extraction Toolkit
Python
42
star
16

wecut

video cut powered by AI
25
star
17

WeSpeech-AI

Open Source Speech/Text Data on AI
18
star
18

nn-singal-processing-papers

List of NN based singal processing papers
17
star
19

wenet_in_action_homework

WeNet 实战课程作业
Python
16
star
20

wenet-e2e.github.io

WeNet Community
CSS
1
star
21

wenet-contributors

Contributors of WeNet, including individual and companies.
1
star