RapidAI/RapidASR

Stars
483
Rank 91,050 (Top 2 %)
Language
C++
License
MIT License
Created almost 3 years ago
Updated 7 months ago

RapidAI/RapidASR

RapidAI

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

商用级开源语音自动识别程序库，开箱即用，全平台支持，中英文混合识别。A Cross-platform implementation of ASR inference. It's based on ONNXRuntime and FunASR. We provide a set of easier APIs to call ASR models.

Rapid ASR

🎉 推出知识星球RapidAI私享群，这里的提问会优先得到回答和支持，也会享受到RapidAI组织后续持续优质的服务。欢迎大家的加入。
Paraformer模型出自阿里达摩院Paraformer语音识别-中文-通用-16k-离线-large-pytorch。
本仓库仅对模型做了转换，只采用ONNXRuntime推理引擎。该项目核心代码已经并入FunASR。
项目仍会持续更新，欢迎关注。
QQ群号：645751008

📖文档导航

语音识别：
- rapid_paraformer:
  - rapid_paraformer-Python
  - rapid_C++/C
- rapid_wenet
  - Python
  - C++
- rapid_paddlespeech-Python
标点符号
- RapidPunc

📆TODO以及任务认领

参见这里：link

🎨整体框架

flowchart LR

A([wav]) --RapidVad--> B([各个小段的音频]) --RapidASR--> C([识别的文本内容]) --RapidPunc--> D([最终识别内容])

📣更新日志

详情

- 2023-08-21 v2.0.4 update: - 添加whl包支持 - 更新文档 - 2023-02-25 - 添加C++版本推理，使用onnxruntime引擎，预/后处理代码来自： [FastASR](https://github.com/chenkui164/FastASR) - 2023-02-14 v2.0.3 update: - 修复librosa读取wav文件错误 - 修复fbank与torch下fbank提取结果不一致bug - 2023-02-11 v2.0.2 update: - 模型和推理代码解耦（`rapid_paraformer`和`resources`） - 支持批量推理（通过`resources/config.yaml`中`batch_size`指定） - 增加多种输入方式（`Union[str, np.ndarray, List[str]]`） - 2023-02-10 v2.0.1 update: - 添加对输入音频为噪音或者静音的文件推理结果捕捉。

RapidOCR

Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVION and PaddlePaddle. （将PaddleOCR模型做了转换，采用ONNXRuntime推理，速度很快）

RapidLaTeXOCR

Formula recognition based on LaTeX-OCR and ONNXRuntime.

LabelConvert

🔄 A tool for object detection and image segmentation dataset format conversion.

Knowledge-QA-LLM

QA based on local knowledge and LLM.

RapidStructure

版面分析 | 表格识别 | 文档方向分类

RapidOcrOnnx

rapidocr onnx cpp

TableStructureRec

整理目前开源的表格识别模型，完善前后处理，模型转换为ONNX

RapidOCRPDF

Based on RapidOCR, extract the PDF content.

RapidLayout

Analysis of Chinese and English layouts 中英文版面分析

RapidOcrAndroidOnnx

RapidOcrNcnn

RapidOCR ncnn 推理

PaddleOCRModelConvert

Convert the model in PaddleOCR to ONNX format

LLM-EXAM

大模型中文测试题库-民间版本

RapidTTS

A cross platform implementation of Text-to-Speech based on ONNXRuntime.

OnnxruntimeBuilder

Onnxruntime Builder

RapidOCRCSharp

OpenCVBuilder

OpenCV Custom Builder

Paddle2OnnxConvertor

Convert paddle model to onnx model

RapidPunc

A library for adding punctuation into a text from ASR.

RapidOcrAndroidOnnxCompose

opencv onnxruntime ocr android demo, jetpack compose + kotlin

RapidVoice

The engineering implementation of SenseVoice (from Alibaba)

RapidTable

源自PP-Structure的表格识别算法，模型转换为ONNX，推理引擎采用ONNXRuntime，部署简单，无内存泄露问题。

RapidOcrOnnxJvm

RapidOcr onnx java kotlin jni test

RapidOcrAndroidNcnn

RapidLayoutRecover

针对文档类图像，整合版面分析、文字识别、表格识别和公式识别结果，还原版面布局信息。

keyframe_extractor

To extract key frames from a video.

paraformer_simple

LLM-DOC

大模型研究院资料馆

RapidOcrNcnnJvm

RapidOcr ncnn java kotlin jni

RapidAudioKit

It's for the repository of audio resampling tools

RapidImgUtil

Image processing library to add some new formats and other supports.

RapidOcrNcnnLibTest

rapid ocr ncnn lib test

RapidOCRDocs

RapidOCR Document

VoiceCut

paddleocr2ncnn

RapidAIWebSite

RapidOcrOnnxLibTest

rapidocr onnx cpp lib test

RapidPix2Pix

Inference code based on the onnxruntime about pix2pix