• Stars
    star
    189
  • Rank 204,649 (Top 5 %)
  • Language
    Python
  • License
    Apache License 2.0
  • Created over 1 year ago
  • Updated 10 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

QA based on local knowledge and LLM.

🧐 Knowledge QA LLM

SemVer2.0 GitHub

简体中文 | English

📣 We're looking for front-end development engineers interested in Knowledge QA with LLM, who can help us achieve front-end and back-end separation with our current implementation.

Introduction

  • Questions & Answers based on local knowledge base + LLM.
  • Reason:
  • Advantage:
    • The whole project is modularized and does not depend on the lanchain library, each part can be easily replaced, and the code is simple and easy to understand.
    • In addition to the large language model interface that needs to be deployed separately, other parts can use CPU.
    • Support documents in common formats, including txt, md, pdf, docx, pptx, excel etc. Of course, other types of documents can also be customized and supported.

Demo

⚠️ If you have Baidu Account, you can visit the online demo based on ERNIE Bot.

Documentation

Full documentation can be found on docs, in Chinese.

TODO

  • Support keyword + vector hybrid search.
  • Vue.js based UI .

Code Contributors

Contributing

  • Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.
  • Please make sure to update tests as appropriate.

If you want to sponsor the project, you can directly click the Buy me a coffee image, please write a note (e.g. your github account name) to facilitate adding to the sponsorship list below.

License

Apache 2.0

More Repositories

1

RapidOCR

Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVION and PaddlePaddle. (将PaddleOCR模型做了转换,采用ONNXRuntime推理,速度很快)
Python
2,639
star
2

RapidASR

商用级开源语音自动识别程序库,开箱即用,全平台支持,中英文混合识别。A Cross-platform implementation of ASR inference. It's based on ONNXRuntime and FunASR. We provide a set of easier APIs to call ASR models.
C++
483
star
3

RapidLaTeXOCR

Formula recognition based on LaTeX-OCR and ONNXRuntime.
Python
266
star
4

LabelConvert

🔄 A tool for object detection and image segmentation dataset format conversion.
Python
266
star
5

RapidStructure

版面分析 | 表格识别 | 文档方向分类
Python
177
star
6

RapidOcrOnnx

rapidocr onnx cpp
C++
146
star
7

TableStructureRec

整理目前开源的表格识别模型,完善前后处理,模型转换为ONNX
Python
130
star
8

RapidOCRPDF

Based on RapidOCR, extract the PDF content.
Python
119
star
9

RapidLayout

Analysis of Chinese and English layouts 中英文版面分析
Python
81
star
10

RapidOcrAndroidOnnx

C++
53
star
11

RapidOcrNcnn

RapidOCR ncnn 推理
C++
50
star
12

PaddleOCRModelConvert

Convert the model in PaddleOCR to ONNX format
Python
45
star
13

LLM-EXAM

大模型中文测试题库-民间版本
45
star
14

RapidTTS

A cross platform implementation of Text-to-Speech based on ONNXRuntime.
Python
32
star
15

OnnxruntimeBuilder

Onnxruntime Builder
Shell
31
star
16

RapidOCRCSharp

C#
28
star
17

OpenCVBuilder

OpenCV Custom Builder
Shell
22
star
18

Paddle2OnnxConvertor

Convert paddle model to onnx model
Python
20
star
19

RapidPunc

A library for adding punctuation into a text from ASR.
C++
17
star
20

RapidOcrAndroidOnnxCompose

opencv onnxruntime ocr android demo, jetpack compose + kotlin
Kotlin
13
star
21

RapidVoice

The engineering implementation of SenseVoice (from Alibaba)
8
star
22

RapidTable

源自PP-Structure的表格识别算法,模型转换为ONNX,推理引擎采用ONNXRuntime,部署简单,无内存泄露问题。
Python
8
star
23

RapidOcrOnnxJvm

RapidOcr onnx java kotlin jni test
Kotlin
7
star
24

RapidOcrAndroidNcnn

C++
6
star
25

RapidLayoutRecover

针对文档类图像,整合版面分析、文字识别、表格识别和公式识别结果,还原版面布局信息。
Python
5
star
26

keyframe_extractor

To extract key frames from a video.
Python
5
star
27

paraformer_simple

Python
4
star
28

LLM-DOC

大模型研究院资料馆
4
star
29

RapidOcrNcnnJvm

RapidOcr ncnn java kotlin jni
Kotlin
4
star
30

RapidAudioKit

It's for the repository of audio resampling tools
C
3
star
31

RapidImgUtil

Image processing library to add some new formats and other supports.
C++
2
star
32

RapidOcrNcnnLibTest

rapid ocr ncnn lib test
CMake
2
star
33

RapidOCRDocs

RapidOCR Document
HTML
2
star
34

VoiceCut

1
star
35

paddleocr2ncnn

Batchfile
1
star
36

RapidAIWebSite

HTML
1
star
37

RapidOcrOnnxLibTest

rapidocr onnx cpp lib test
CMake
1
star
38

RapidPix2Pix

Inference code based on the onnxruntime about pix2pix
Python
1
star