• This repository has been archived on 11/Jun/2020
  • Stars
    star
    298
  • Rank 139,663 (Top 3 %)
  • Language
    Python
  • License
    MIT License
  • Created over 7 years ago
  • Updated over 4 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A TensorFlow implementation of https://github.com/bgshih/crnn

This repository has been archived. Please see all the other (much better) implementations around GitHub

CRNN

A TensorFlow implementation of https://github.com/bgshih/crnn

But what is a CRNN?

It is a Convolutional Recurrent Neural Network that can be used as an OCR

Requirements

  • Tensorflow (tested with 1.8) pip3 install tensorflow
  • Scipy pip3 install scipy

What training data was used?

All training data (200 000 examples) were generated using my other project https://github.com/Belval/TextRecognitionDataGenerator

To do the same, simply install that project with pip (pip install trdg) and do trdg -c 200000 -w 1 -t 8. -t should be your processor thread count.

Pretrained model

Available in CRNN/save. Use python3 run.py -ex ../data/test --test --restore to test.

Specify charset

You can specify charset to include only numbers python run.py --train -ex ../data/test -it 50000 -cs 0123456789

Results

It works but is a suboptimal solution for OCR in its current form as it makes some mistakes. Do note that I used a bigger char vector than the paper.

For fun, here are a list of words with their prediction:

Ground truth Prediction Image
retardates retardates 1
pay-roller poy-roler 2
rhizopodist rhizospodist 3
theriacas trenagas 4
semantically semanticaly 5
dualistic duaistic 6
high-flying highi-fling 7
grossify grsity 8
scutular scutular 9
crispened crispened 10

More Repositories

1

TextRecognitionDataGenerator

A synthetic data generator for text recognition
Python
3,268
star
2

pdf2image

A python module that wraps the pdftoppm utility to convert PDF to PIL Image object
Python
1,619
star
3

pdf2image-as-a-service

Deploying a basic application on GCP, AWS and Azure
Shell
59
star
4

ML-IDS

An IDS implementation using machine learning
Python
36
star
5

NRTR

A TensorFlow implementation of NRTR, a No-Recurrence Seq2Seq Model for Scene Text Recognition
Python
30
star
6

ki4a

SSH tunneling app with DNS forwarding. Based on https://github.com/staf621/ki4a
Java
28
star
7

NaiveCNN

A naive (very simple!) implementation of a convolutional neural network
Python
20
star
8

opencv-mser

A working example of OpenCV 3 MSER detector
Python
14
star
9

disklist

A python list implementation that uses the disk to handle very large collections
Python
14
star
10

MobileNetV3

A tensorflow implementation of the paper "Searching for MobileNetV3" with the R-ASPP segmentation head
Python
13
star
11

BitcoinRNN

A Recurrent Neural Network using Tensorflow to predict Bitcoin price
Python
11
star
12

AlphaMissenseCheck

See how pathogenic your mutations are according to AlphaMissense based on your 23andme raw data
Python
9
star
13

raytracing

Using CUDA to implement "Raytracing in one weekend" by Peter Shirley
Cuda
5
star
14

seal-rs

Experiments on using Microsoft SEAL library in Rust
Rust
4
star
15

air-quality-station

Combining the SNS011 sensor with an OrangePI to display PM2.5 and PM10 air quality measurements
Python
4
star
16

wikipedia2text

A tool to convert a Wikipedia dump file into plain text
Python
3
star
17

hdbscan

A go implementation of HDBSCAN
Go
3
star
18

dotfiles

Collection of dotfiles for vim, vscode, git, etc...
Shell
2
star
19

ebird

Detecting bird presence from satellite images
Python
2
star
20

TextRecognitionDataGeneratorDocs

Documentation for the TextRecognitionDataGenerator tool
JavaScript
2
star
21

Scanner3D

Using learned and non-learned algorithms to reconstruct 3D objects with the SR300 camera
Python
1
star
22

CubePlanet

Minecraft clone in C++
C++
1
star
23

SentimentRNN

A recurrent network that uses word embeddings to do sentiment analysis in both French and English
Python
1
star
24

go-home

Pun intended
Go
1
star
25

reddit-json-dump-parser

A parser for the reddit data dump
Python
1
star
26

WebcamEyeTracking

Track your eye movements with your webcam
Python
1
star
27

go-link-shortener

A basic link shortening service written in go
Go
1
star