• Stars
    star
    184
  • Rank 209,187 (Top 5 %)
  • Language
    Python
  • Created over 4 years ago
  • Updated over 4 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Unofficial PyTorch implementation of Towards Accurate Scene Text Recognition with Semantic Reasoning Networks

Towards Accurate Scene Text Recognition with Semantic Reasoning Networks

Unofficial PyTorch implementation of the paper, which integrates not only global semantic reasoning module but also parallel visual attention module and visual-semantic fusion decoder.the semanti reasoning network(SRN) can be trained end-to-end.

At present, the accuracy of the paper cannot be achieved. And i borrowed code from deep-text-recognition-benchmark

model

result

IIIT5k_3000 SVT IC03_860 IC03_867 IC13_857 IC13_1015 IC15_1811 IC15_2077 SVTP CUTE80
84.600 83.617 92.907 92.849 90.315 88.177 71.010 68.064 71.008 68.641

total_accuracy: 80.597


Feature

  • predict the character at once time
  • DistributedDataParallel training

Requirements

Pytorch >= 1.1.0

Test

  1. download the evaluation data from deep-text-recognition-benchmark

  2. download the pretrained model from Baidu, Password: d2qn

  3. test on the evaluation data

python test.py --eval_data path-to-data --saved_model path-to-model

Train

  1. download the training data from deep-text-recognition-benchmark

  2. training from scratch

python train.py --train_data path-to-train-data --valid-data path-to-valid-data

Reference

  1. bert_ocr.pytorch
  2. deep-text-recognition-benchmark
  3. 2D Attentional Irregular Scene Text Recognizer
  4. Towards Accurate Scene Text Recognition with Semantic Reasoning Networks

difference with the origin paper

  • use resnet for 1D feature not resnetFpn 2D feature
  • use add not gated unit for visual-semanti fusion decoder

other

It is difficult to achieve the accuracy of the paper, hope more people to try and share

More Repositories

1

Attention_ocr.pytorch

This repository implements the the encoder and decoder model with attention model for OCR
Python
347
star
2

CenterFace.pytorch

unofficial version of centerface, which achieves the best balance between speed and accuracy at face detection
Python
229
star
3

Bert_OCR.pytorch

Unofficial PyTorch implementation of 2D Attentional Irregular Scene Text Recognizer
Python
130
star
4

DDRNet.pytorch

This is the unofficial code of Deep Dual-resolution Networks for Real-time and Accurate Semantic Segmentation of Road Scenes. which achieve state-of-the-art trade-off between accuracy and speed on cityscapes and camvid, without using inference acceleration and extra data
Python
124
star
5

CLPR.pytorch

End to End Chinese License Plate Recognition
Python
75
star
6

FOTS.pytorch

an unofficial implementation of FOTS: Fast Oriented Text Spotting with a Unified Network
C++
67
star
7

facemask

口罩检测。凑个热闹,和百度类似的是否佩戴口罩检测分类,但是速度会更快。
Python
26
star
8

ocr_annotation

using python and flask for ocr annotation web tool
Python
23
star
9

Self_cross_entropy

Write a cross_entropy function in pytorch to remove the abnormal nan value
Python
9
star
10

Flask_UI_Pytorch

create a sample UI for deep learning by using flask
HTML
7
star
11

GIOU_EAST

use giou loss function for east
Python
6
star
12

tensorrt.test

test tensorrt c++/python api, c++/python plugins.
C++
5
star
13

dler.collection

a collection of studing
C++
4
star
14

transformerOcr

use transformer as the decoder model for OCR
1
star
15

flask_api

Flask deep learning API demo
Python
1
star
16

AI.hisi

AI在hisi开发板上的部署,评测,对比
C
1
star
17

tensorrt.collection

some example and benchmark for using tensorrt
C++
1
star