• Stars
    star
    123
  • Rank 290,145 (Top 6 %)
  • Language
    Python
  • Created about 6 years ago
  • Updated almost 5 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

EnsNet: Ensconce Text in the Wild

EnsNet: Ensconce Text in the Wild

A synthetic benchmark database for scene text removal is now released by Deep Learning and Vision Computing Lab of South China University of Technology. The database can be downloaded through the following links:

Description

The training set of synthetic database consists of a total of 8000 images and the test set contains 800 images; all the training and test samples are resized to 512 × 512. The code for generating synthetic dataset and more synthetic text images as described in “Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, Synthetic Data for Text localisation in Natural Images, CVPR 2016", and can be found in (https://github.com/ankush-me/SynthText). Besides, all the real scene text images are also resized to 512 × 512.

For more details, please refer to our AAAI 2019 paper. arXiv: http://arxiv.org/abs/1812.00723

Requirements

  1. Mxnet==1.3.1
  2. Python2.
  3. NVIDA GPU+ CUDA 8.0.
  4. Matplotlib.
  5. Numpy.

Installation

  1. Clone this respository.
    git clone https://github.com/HCIILAB/Scene-Text-Removal
    

Running

1. Image Prepare

 You can refer to our given example to put data.

2. Training

To train our model, you may need to change the path of dataset or the parameters of the network etc. Then run the following code:

python train.py \
--trainset_path=[the path of dataset] \
--checkpoint=[path save the model] \
--gpu=[use gpu] \
--lr=[Learning Rate] \
--n_epoch=[Number of iterations]

3. Testing

To output the generated results of the inputs, you can use the test.py. Please run the following code:

python test.py \
--test_image=[the path of test images] \
--model=[which model to be test] \
--vis=[ vis images] \
--result=[path to save the output images]

To evalution the model performace over a dataset, you can find the evaluation metrics in this website PythonCode.zip

4. Pretrained models

Please download the ImageNet pretrained models vgg16 PASSWORD:8tof, and put it under

root/.mxmet/models/

Paper

Please consider to cite our paper when you use our database:

@article{zhang2019EnsNet,
  title     = {EnsNet: Ensconce Text in the Wild},
  author    = {Shuaitao Zhang∗, Yuliang Liu∗, Lianwen Jin†, Yaoxiong Huang, Songxuan Lai
  joural    = {AAAI}
  year      = {2019}
}

Feedback

Suggestions and opinions of dataset of this dataset (both positive and negative) are greatly welcome. Please contact the authors by sending email to [email protected].

Copyright

The synthetic database can be only used for non-commercial research purpose.

For commercial purpose usage, please contact Dr. Lianwen Jin: [email protected].

Copyright 2018, Deep Learning and Vision Computing Lab, South China University of Teacnology.http://www.dlvc-lab.net

More Repositories

1

SCUT-FBP5500-Database-Release

A diverse benchmark database for multi-paradigm facial beauty prediction
Python
731
star
2

Scene-Text-Recognition

603
star
3

Scene-Text-Detection

528
star
4

SCUT-HEAD-Dataset-Release

SCUT HEAD is a large-scale head detection dataset, including 4405 images labeld with 111251 heads.
461
star
5

Scene-Text-Recognition-Recommendations

Papers, Datasets, Algorithms, SOTA for STR. Long-time Maintaining
Python
313
star
6

DeRPN

A novel region proposal network for more general object detection ( including scene text detection ).
Python
155
star
7

Scene-Text-End2end

151
star
8

SCUT-EPT_Dataset_Release

The SCUT-EPT Dataset for the research of offline handwritten Chinese text recognition (HCTR) in educational documents has been released.
109
star
9

M6Doc

103
star
10

EPHOIE

101
star
11

SCUT-HCCDoc_Dataset_Release

76
star
12

Forward-Implementation-of-Fast-and-Compact-CNN-for-Offline-HCCR

C++
69
star
13

TKH_MTH_Datasets_Release

The Tripitaka Koreana in Han (TKH) Dataset and the Multiple Tripitaka in Han (MTH) Dataset for the research of Chinese character detection and recognition in historical documents.
60
star
14

SCUT-EnsText

53
star
15

MTHv2_Datasets_Release

50
star
16

MSDS

The official GitHub page of the MSDS dataset.
43
star
17

LAST

Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition
Python
22
star
18

SCUT_FORU_DB_Release

Flickr OCR Universal Database (SCUT_FORU_DB_Release)
22
star
19

M5HisDoc

21
star
20

Water-Meter-Number-DataSet

The water-meter images are captured by camera and labeled with water-meter number, for the research of the water-meter image recognition.
17
star
21

SCUT-CAB_Dataset_Release

14
star
22

IME_Test

This project can be used to test the recognition rate of Chinese handwriting input method.
Java
7
star
23

EvaluateHandWritingAccuracy

This project can be used to test the recognition rate of Chinese handwriting input method.
Java
4
star
24

IFN_DropRegion_Data

3
star
25

PS_OLHCCR_tmep

2
star
26

DZJ_AnnotationTool

JavaScript
1
star