• Stars
    star
    4
  • Rank 3,304,323 (Top 66 %)
  • Language
  • Created about 8 years ago
  • Updated about 6 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Images and groundtruth text for Sanskrit for OCR evaluation

More Repositories

1

tessdata_shreetest

finetuned traineddata files for tesseract 4.0.0 for testing
Shell
157
star
2

tess5train-fonts

Files and Scripts to run Tesseract 5 LSTM Training using fonts
HTML
76
star
3

tessdata_ocrb

tesseract 4 traineddata for MRZ using OCR-B fonts
Shell
75
star
4

tessdata_ssd

Tesseract 4 traineddata for recognizing Seven Segment Display
Shell
51
star
5

hindi-hunspell

Hindi wordlists, dictionary and affix files in hunspell format
Shell
40
star
6

tessdata_arabic

Finetuned traineddata files for Arabic
Shell
29
star
7

tesstrain-Sanskrit-IAST

Tesseract Traineddata for Sanskrit transliteration
Shell
8
star
8

tessdata_emoji

Traineddata for recognizing emoji icons with Tesseract 4
Shell
8
star
9

tesstrain-ckb

Tesseract4 finetuned traineddata for Central Kurdish/Sorani
HTML
7
star
10

tesstrain-sanPlusMinus

Demo of PlusMinus training for sanskrit for tesseract5
Makefile
6
star
11

tesstrain-akk

Training Tesseract 5 Alpha for Akkadian and Cuneiform
HTML
5
star
12

tesstrain-JSTORArabic

PlusMinus Arabic training using TrainingData/JSTORArabic from OpenITI
HTML
5
star
13

tesstrain-ben

Finetune training for Bengali using makefile, training_text and fonts
Python
3
star
14

tesstrain-xsa

Finetune Training and OCR evaluation of Tesseract for Sabaean language in Ancient South Arabian script
3
star
15

tessdata_coptic

Traineddata files for Tesseract 4.0.0 for Coptic OCR
3
star
16

tessdata_jav_java

Tesseract 4.0.0 training data for Javanese Script (Aksara Jawa)
Shell
3
star
17

tessdata_tamil

Tamil traineddata for testing
2
star
18

kraken_grantha

line images and their ground truth in grantha script
2
star
19

imageshin

Images and Ground Truth files in Hindi for OCR evaluation
2
star
20

imageskan

Tesseract OCR 4.0.0 test tutorial
Shell
1
star
21

kraken_devanagari

Kraken models for Devanagari
Shell
1
star
22

tesstrain-modi

tesseract traineddata for Modi script
HTML
1
star
23

tesstrain-deva

Finetune tesseract traineddata for Devanagari script
HTML
1
star
24

imagesmar

Images and groundtruth text for Marathi for OCR evaluation
1
star
25

tess5training-rajarajan

Shell
1
star
26

tesstrain-bali

FInetuning for Balinese script
Makefile
1
star
27

xetex-itrans

Fork of https://www.ctan.org/tex-archive/macros/xetex/generic/itrans
TeX
1
star