• Stars
    star
    2,417
  • Rank 18,963 (Top 0.4 %)
  • Language
  • License
    Creative Commons ...
  • Created over 8 years ago
  • Updated almost 3 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Links to awesome OCR projects

Awesome OCR

Awesome

This list contains links to great software tools and libraries and literature related to Optical Character Recognition (OCR).

Contributions are welcome, as is feedback.

Software

OCR engines

  • tesseract - The definitive Open Source OCR engine Apache 2.0
  • EasyOCR - OCR engine built on PyTorch by JaidedAI, Apache 2.0
  • ocropus - OCR engine based on LSTM, Apache 2.0
  • ocropus 0.4 - Older v0.4 state of Ocropus, with tesseract 2.04 and iulib, C++
  • kraken - Ocropus fork with sane defaults
  • gocr - OCR engine under the GNU Public License led by Joerg Schulenburg.
  • Ocrad - The GNU OCR. GPL
  • ocular - Machine-learning OCR for historic documents
  • SwiftOCR - fast and simple OCR library written in Swift
  • attention-ocr - OCR engine using visual attention mechanisms
  • RWTH-OCR - The RWTH Aachen University Optical Character Recognition System
  • simple-ocr-opencv and its fork - A simple pythonic OCR engine using opencv and numpy
  • Calamari - OCR Engine based on OCRopy and Kraken
  • doctr - A seamless & high-performing OCR library powered by Deep Learning

Older and possibly abandoned OCR engines

  • Clara OCR - Open source OCR in C GPL
  • Cuneiform - CuneiForm OCR was developed by Cognitive Technologies
  • Eye - an experimental Java OCR (image-to-text) application
  • kognition - An omnifont OCR software for KDE
  • OCRchie - Modular Optical Character Recognition Software
  • ocre - o.c.r. easy
  • xplab - A GTK 2 tool for pattern matching
  • hebOCR - Hebrew character recognition library (previously named hocr, see Wikipedia article) GPL

OCR file formats

hOCR

  • hocr-tools - Tools for doing various useful things with hOCR files, Apache 2.0
  • hocr-spec - hOCR 1.2 specification
  • ocr-transform - CLI tool to convert between hOCR and ALTO, MIT
  • hocr-parser - hOCR Specification Python Parser
  • hOCRTools - hOCR to ALTO conversion XSLT

ALTO XML

TEI

  • TEI-OCR - TEI customization for OCR generated layout and content information
  • TEI SIG on Libraries - Best Practices for TEI in Libraries
  • GDZ - METS/TEI-based GDZ document format

PAGE XML

  • PAGE-XML Schema - XML schema of the PAGE XML format along with documentation and examples
  • omni:us Pages Format (OPF) - XML schema very similar to PAGE XML that has some additional features.
  • py-pagexml - Python library for handling PAGE XML and OPF files.

OCR CLI

  • OCRmyPDF - OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
  • Pdf2PdfOCR - A tool to OCR a PDF (or supported images) and add a text "layer" (a "pdf sandwich") in the original file making it a searchable PDF. GUI included. Tesseract and cuneiform supported.
  • Ocrocis - Project manager interface for Ocropy, see also external project homepage
  • tesseract-recognize - Tesseract-based tool that outputs result in Page XML format (docker image).

OCR GUI

  • moz-hocr-editor - Firefox Addon for editing hOCR files Discontinued
  • qt-box-editor - QT4 editor of tesseract-ocr box files.
  • ocr-gt-tools - Client-Server application for editing OCR ground truth.
  • Paperwork - Using scanners and OCR to grep paper documents the easy way.
  • Paperless - Scan, index, and archive all of your paper documents.
  • gImageReader - gImageReader is a simple Gtk/Qt front-end to tesseract-ocr.
  • VietOCR - A Java/.NET GUI frontend for Tesseract OCR engine, including jTessBoxEditor a graphical Tesseract box data editor
  • PoCoTo - Fast interactive batch corrections of complete OCR error series in OCR'ed historical documents.
  • OCRFeeder - GTK graphical user interface that allows the users to correct characters or bounding boxes, ODT export and more.
  • PRImA PAGE Viewer - Java based viewer for PAGE XML files (layout + text content). Also supports ALTO XML, FineReader XML, and HOCR.
  • LAREX - A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.
  • archiscribe - Web application for transcribing OCR ground truth from Archive.org. Deployed instance available at https://archiscribe.jbaiter.de/, results are available in @jbaiter/archiscribe-corpus.
  • nw-page-editor - Simple app for visual editing of Page XML files. Provides desktop and server docker-based versions.

OCR Preprocessing

OCR as a Service

OCR evaluation

OCR libraries by programming language

Crystal

Elixir

  • tesseract_ocr - Elixir library wrapping the tesseract executable.

Go

  • gosseract - Golang OCR library, wrapping Tesseract-ocr.

Java

  • Tess4J - Java Native Access bindings to Tesseract.
  • tess-two - Tools for compiling Tesseract on Android and Java API.

.Net

Object Pascal

PHP

Python

  • pytesseract - A Python wrapper for Google Tesseract.
  • pyocr - A Python wrapper for Tesseract and Cuneiform.
  • ocrodjvu - A library and standalone tool for doing OCR on DjVu documents, wrapping Cuneiform, gocr, ocrad, ocropus and tesseract
  • tesserocr - A Python wrapper for the tesseract-ocr API

Javascript

  • ocracy - pure javascript lstm rnn implementation based on ocropus
  • gocr.js - Javascript port (emscripten) of gocr
  • ocrad.js - Javascript port (emscripten) of ocrad
  • tesseract.js - Javascript port (emscripten) of Tesseract
  • node-tesseract-ocr - A simple wrapper for the Tesseract OCR package.
  • node-tesseract-native - C++ module for node providing OCR with tesseract and leptonica.

Ruby

  • rtesseract - Ruby library wrapping the tesseract and imagemagick executables.
  • ruby-tesseract - Native Tesseract bindings for Ruby MRI and JRuby
  • ocr_space - API wrapper for free ocr service ocr.space. Includes CLI

Rust

  • tesseract.rs - Rust bindings for tesseract OCR.
  • leptess - Productive and safe Rust bindings/wrappers for tesseract and leptonica.

R

Swift

  • Tesseract OCR iOS - Swift and Objective-C wrapper for Tesseract OCR.
  • SwiftOCR - Fast and simple OCR library written in Swift. Optimized for recognizing short, one line long alphanumeric codes.

OCR training tools

  • glyph-miner - A system for extracting glyphs from early typeset prints
  • ocrodeg - Document image degradation for OCR data augmentation

Datasets

Ground Truth

  • Rescribe - Transcriptions of Caroline Minuscule Manuscripts PDM 1.0

Literature

OCR-related publication and link lists

Blog Posts and Tutorials

OCR Showcases

  • abbyy-finereader-ocr-senate - Using OCR to parse scanned Senate Financial Disclosure forms.
  • cvOCR - An OCR system for recognizing resume or cv text, implemented in Python and C and based on tesseract
  • MathOCR - A printed scientific document recognition system, pre-alpha

Academic articles

2011 and before

2012

2013

2014

2015

2016

2017

2018

More Repositories

1

hocrjs

Working with hOCR in Javascript
HTML
106
star
2

hocr-spec

The hOCR Embedded OCR Workflow and Output Format
HTML
71
star
3

jcurses

Java Curses implementation
Java
22
star
4

canvas-editor

Vue component for editing shapes in a canvas
Vue
20
star
5

makefile-parser

Parser and documentation generator for Makefiles
JavaScript
20
star
6

page-to-alto

Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)
Python
13
star
7

transkribus-to-prima

Convert Transkribus PAGE-XML to standard PAGE-XML
Python
11
star
8

vfs

Virtual File Systems with a node fs-like API
JavaScript
9
star
9

anno-common

Node.JS/Browser Web Annotation Framework
JavaScript
9
star
10

anno-frontend

Vue application for displaying/editing annotations
JavaScript
8
star
11

hocr-spec-python

Validation of hOCR close to the specs
Python
7
star
12

tsht

A tiny shell-script based testing framework
Shell
7
star
13

zts-in-a-box

Zotero Translation Server + Simple Query API + Swagger in Docker
CoffeeScript
6
star
14

ocror-detector

Detect errors in OCR
Python
6
star
15

winston-timer

Extend winston to measure time intervals
CoffeeScript
6
star
16

turtleson

Concise, permissive, TURTLE-like dialect of JSON
CoffeeScript
5
star
17

jsonld-rapper

Create RDF from JSON-LD with rapper
CoffeeScript
5
star
18

grip-docker

Run grip markdown renderer in a docker container
Shell
5
star
19

ocr-xsl

XSLT 2.0 functions for transforming between hOCR, ALTO and ABBYY
XSLT
5
star
20

ocr-fileformat-samples

Samples for common OCR file formats (hOCR, ABBYY, ALTO)
5
star
21

hocr-dom

Extend DOM to handle hOCR
JavaScript
4
star
22

shinclude

Include directives for code/markup comments
Shell
4
star
23

libjcsi

Java Console System Interface
Java
4
star
24

gdxai-btree.vim

Vim Syntax highlighting for gdx-ai behavior tree files
Vim Script
4
star
25

kraken-docker

Docker container for the kraken OCR engine
Makefile
4
star
26

node-vim2html

Create HTML highlighted by Vim's 2html script
CoffeeScript
3
star
27

mpv-config

mpv config
Lua
3
star
28

vdhd-2021-05-05

Demos for OCR-D presentation at OCR@vDHd
Makefile
3
star
29

dhd2022

3
star
30

ocr-models

A registry of models for OCR engines
Makefile
2
star
31

mollusc

Tools for handling line-based ground truth for OCR
JavaScript
2
star
32

neon

neon roguelike engine
2
star
33

ocrad-docker

GNU ocrad in a docker container
Makefile
2
star
34

tesseract-3.03-models

Tesseract 3.03 / 3.04 models
Shell
2
star
35

2019-icdar

2
star
36

object-prune

JavaScript
2
star
37

ucc

Java UniCode Constants (UCC)
Java
1
star
38

heiper

DOI registration microservice supporting dara and datacite
JavaScript
1
star
39

unicode-paint

A Java based Unicode image editor using Lanterna
Java
1
star
40

rlforj

Roguelike Library For Java
1
star
41

ocrrect

CoffeeScript
1
star
42

dh-2022-12-01

HTML
1
star
43

sift-date

Sift expressions to check for older/newer dates
JavaScript
1
star
44

ocrd-dita

Generating user docs for OCR-D from Markdown with DITA
Makefile
1
star
45

rssscrpr

Scrape web content to RSS feeds
HTML
1
star
46

dotfiles

Framework for handling lots and lots of dotfiles
Shell
1
star
47

semtonotes-utils

Enhancing SemToNotes with helper methods
JavaScript
1
star
48

ocrd-docs

OCR-D Documentation
Makefile
1
star
49

js-utils

Javascript utility and best practice functions for browser and Noded.JS
JavaScript
1
star