• Stars
    star
    483
  • Rank 90,445 (Top 2 %)
  • Language
    C++
  • License
    MIT License
  • Created almost 3 years ago
  • Updated 4 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

商用级开源语音自动识别程序库,开箱即用,全平台支持,中英文混合识别。A Cross-platform implementation of ASR inference. It's based on ONNXRuntime and FunASR. We provide a set of easier APIs to call ASR models.

Rapid ASR

  • 🎉 推出知识星球RapidAI私享群,这里的提问会优先得到回答和支持,也会享受到RapidAI组织后续持续优质的服务。欢迎大家的加入。
  • Paraformer模型出自阿里达摩院Paraformer语音识别-中文-通用-16k-离线-large-pytorch
  • 本仓库仅对模型做了转换,只采用ONNXRuntime推理引擎。该项目核心代码已经并入FunASR
  • 项目仍会持续更新,欢迎关注。
  • QQ群号:645751008

📖文档导航

📆TODO以及任务认领

  • 参见这里:link

🎨整体框架

flowchart LR

A([wav]) --RapidVad--> B([各个小段的音频]) --RapidASR--> C([识别的文本内容]) --RapidPunc--> D([最终识别内容])

📣更新日志

详情 - 2023-08-21 v2.0.4 update: - 添加whl包支持 - 更新文档 - 2023-02-25 - 添加C++版本推理,使用onnxruntime引擎,预/后处理代码来自: [FastASR](https://github.com/chenkui164/FastASR) - 2023-02-14 v2.0.3 update: - 修复librosa读取wav文件错误 - 修复fbank与torch下fbank提取结果不一致bug - 2023-02-11 v2.0.2 update: - 模型和推理代码解耦(`rapid_paraformer`和`resources`) - 支持批量推理(通过`resources/config.yaml`中`batch_size`指定) - 增加多种输入方式(`Union[str, np.ndarray, List[str]]`) - 2023-02-10 v2.0.1 update: - 添加对输入音频为噪音或者静音的文件推理结果捕捉。

More Repositories

1

RapidOCR

Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVION and PaddlePaddle. (将PaddleOCR模型做了转换,采用ONNXRuntime推理,速度很快)
Python
2,639
star
2

RapidLaTeXOCR

Formula recognition based on LaTeX-OCR and ONNXRuntime.
Python
266
star
3

LabelConvert

🔄 A tool for object detection and image segmentation dataset format conversion.
Python
266
star
4

Knowledge-QA-LLM

QA based on local knowledge and LLM.
Python
189
star
5

RapidStructure

版面分析 | 表格识别 | 文档方向分类
Python
177
star
6

RapidOcrOnnx

rapidocr onnx cpp
C++
146
star
7

TableStructureRec

整理目前开源的表格识别模型,完善前后处理,模型转换为ONNX
Python
130
star
8

RapidOCRPDF

Based on RapidOCR, extract the PDF content.
Python
119
star
9

RapidLayout

Analysis of Chinese and English layouts 中英文版面分析
Python
81
star
10

RapidOcrAndroidOnnx

C++
53
star
11

RapidOcrNcnn

RapidOCR ncnn 推理
C++
50
star
12

PaddleOCRModelConvert

Convert the model in PaddleOCR to ONNX format
Python
45
star
13

LLM-EXAM

大模型中文测试题库-民间版本
45
star
14

RapidTTS

A cross platform implementation of Text-to-Speech based on ONNXRuntime.
Python
32
star
15

OnnxruntimeBuilder

Onnxruntime Builder
Shell
31
star
16

RapidOCRCSharp

C#
28
star
17

OpenCVBuilder

OpenCV Custom Builder
Shell
22
star
18

Paddle2OnnxConvertor

Convert paddle model to onnx model
Python
20
star
19

RapidPunc

A library for adding punctuation into a text from ASR.
C++
17
star
20

RapidOcrAndroidOnnxCompose

opencv onnxruntime ocr android demo, jetpack compose + kotlin
Kotlin
13
star
21

RapidVoice

The engineering implementation of SenseVoice (from Alibaba)
8
star
22

RapidTable

源自PP-Structure的表格识别算法,模型转换为ONNX,推理引擎采用ONNXRuntime,部署简单,无内存泄露问题。
Python
8
star
23

RapidOcrOnnxJvm

RapidOcr onnx java kotlin jni test
Kotlin
7
star
24

RapidOcrAndroidNcnn

C++
6
star
25

RapidLayoutRecover

针对文档类图像,整合版面分析、文字识别、表格识别和公式识别结果,还原版面布局信息。
Python
5
star
26

keyframe_extractor

To extract key frames from a video.
Python
5
star
27

paraformer_simple

Python
4
star
28

LLM-DOC

大模型研究院资料馆
4
star
29

RapidOcrNcnnJvm

RapidOcr ncnn java kotlin jni
Kotlin
4
star
30

RapidAudioKit

It's for the repository of audio resampling tools
C
3
star
31

RapidImgUtil

Image processing library to add some new formats and other supports.
C++
2
star
32

RapidOcrNcnnLibTest

rapid ocr ncnn lib test
CMake
2
star
33

RapidOCRDocs

RapidOCR Document
HTML
2
star
34

VoiceCut

1
star
35

paddleocr2ncnn

Batchfile
1
star
36

RapidAIWebSite

HTML
1
star
37

RapidOcrOnnxLibTest

rapidocr onnx cpp lib test
CMake
1
star
38

RapidPix2Pix

Inference code based on the onnxruntime about pix2pix
Python
1
star