• Stars
    star
    130
  • Rank 277,575 (Top 6 %)
  • Language
    Python
  • Created almost 5 years ago
  • Updated over 4 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Unofficial PyTorch implementation of 2D Attentional Irregular Scene Text Recognizer

2D Attentional Irregular Scene Text Recognizer

Unofficial PyTorch implementation of the paper, which transforms the irregular text with 2D layout to character sequence directly via 2D attentional scheme. They utilize a relation attention module to capture the dependencies of feature maps and a parallel attention module to decode all characters in parallel.

At present, the accuracy of the paper cannot be achieved. And i borrowed code from deep-text-recognition-benchmark

model

result
Test on ICDAR2019 with only 51.15%, will continue to improve.

Feature

  1. Output image string once not like the seqtoseq model

Requirements

Pytorch >= 1.1.0

Test

  1. download the pretrained model Baidu password: kdah.

  2. test on images which in demo_image folder

python demo.py --image_folder demo_image --saved_model <model_path/best_accuracy.pth>
  1. some examples
demo images Bert_OCR
available
shakesshack
london
greenstead
toast
merry
underground
ronaldo
bally
university
  1. result on benchmark data sets
IIIT5k_3000 SVT IC03_860 IC03_867 IC13_857 IC13_1015 IC15_1811 IC15_2077 SVTP CUTE80
84.367 79.907 91.860 91.465 88.448 86.010 65.654 63.215 68.527 81.185

total_accuracy: 78.423


Train

  1. I prepared a small dataset for train.The image and labels are in ./dataset/BAIDU.
python train.py --root ./dataset/BAIDU/images/ --train_csv ./dataset/BAIDU/small_train.txt --val_csv ./dataset/BAIDU/small_train.txt

Reference

  1. deep-text-recognition-benchmark
  2. 2D Attentional Irregular Scene Text Recognizer

More Repositories

1

Attention_ocr.pytorch

This repository implements the the encoder and decoder model with attention model for OCR
Python
347
star
2

CenterFace.pytorch

unofficial version of centerface, which achieves the best balance between speed and accuracy at face detection
Python
229
star
3

SRN.pytorch

Unofficial PyTorch implementation of Towards Accurate Scene Text Recognition with Semantic Reasoning Networks
Python
184
star
4

DDRNet.pytorch

This is the unofficial code of Deep Dual-resolution Networks for Real-time and Accurate Semantic Segmentation of Road Scenes. which achieve state-of-the-art trade-off between accuracy and speed on cityscapes and camvid, without using inference acceleration and extra data
Python
124
star
5

CLPR.pytorch

End to End Chinese License Plate Recognition
Python
75
star
6

FOTS.pytorch

an unofficial implementation of FOTS: Fast Oriented Text Spotting with a Unified Network
C++
67
star
7

facemask

口罩检测。凑个热闹,和百度类似的是否佩戴口罩检测分类,但是速度会更快。
Python
26
star
8

ocr_annotation

using python and flask for ocr annotation web tool
Python
23
star
9

Self_cross_entropy

Write a cross_entropy function in pytorch to remove the abnormal nan value
Python
9
star
10

Flask_UI_Pytorch

create a sample UI for deep learning by using flask
HTML
7
star
11

GIOU_EAST

use giou loss function for east
Python
6
star
12

tensorrt.test

test tensorrt c++/python api, c++/python plugins.
C++
5
star
13

dler.collection

a collection of studing
C++
4
star
14

transformerOcr

use transformer as the decoder model for OCR
1
star
15

flask_api

Flask deep learning API demo
Python
1
star
16

AI.hisi

AI在hisi开发板上的部署,评测,对比
C
1
star
17

tensorrt.collection

some example and benchmark for using tensorrt
C++
1
star