Deep Learning for OCR
This is a reading list for deep learning for OCR. I became interested in this topic when I was developing a mobile app for receipt and shopping management. See my homepage for more this app. I am going to release the dataset we collected for this project.
Papers
- Reading text in the wild with convolutional neural networks (2016), M. Jaderberg et al. (DeepMind) [pdf]
- Recursive Recurrent Nets with Attention Modeling for OCR in the Wild, Chen-Yu Lee, Simon Osindero, 2016, pdf
- The Unreasonable Effectiveness of Recurrent Neural Networks, Andrej Karpathy, 2015, blog
- A Comparison of Sequence-Trained Deep Neural Networks and Recurrent Neural Networks Optical Modeling for Handwriting Recognition, Theodore Bluche, Hermann Ney, and Christopher Kermorvant, slsp, 2014. pdf
- Optical Character Recognition (OCR), Marina Samuel, blog
- Text Recognition in Videos using a Recurrent Connectionist Approach, Khaoula Elagouni, Christophe Garcia, Franck Mamalet1 , and Pascale Sebillot, 2012. pdf
- A Novel Word Spotting Method Based on Recurrent Neural Networks, Volkmar Frinken, Andreas Fischer, R. Manmatha, and Horst Bunke, pdf
- SEQUENCE TO SEQUENCE LEARNING FOR OPTICAL CHARACTER RECOGNITION, Devendra Kumar Sahu & Mohak Sukhwani, 2015, pdf
- Scan, Attend and Read: End-to-End Handwritten Paragraph Recognition with MDLSTM Attention, Theodore Bluche Jerome Louradour, Ronaldo Messina, 2016. pdf
- Recursive Recurrent Nets with Attention Modeling for OCR in the Wild, Chen-Yu Lee, Simon Osindero, CVPR, 2016, pdf
- Reading Scene Text in Deep Convolutional Sequences, Pan He, Weilin Huang, Yu Qiao, Chen Change Loy, and Xiaoou Tang, 2016, pdf
- Multi-digit Number Recognition from Street View, Imagery using Deep Convolutional Neural Networks, Ian J. Goodfellow, Yaroslav Bulatov, Julian Ibarz, Sacha Arnoud, Vinay Shet. 2014. pdf
- ReNet: A Recurrent Neural Network Based Alternative to Convolutional Networks, Francesco Visin, Kyle Kastner,Kyunghyun Cho, Matteo Matteucci,Aaron Courville, Yoshua Bengio. pdf
- End-to-End Text Recognition with Convolutional Neural Networks, Tao Wang, David J. Wu, Adam Coates, Andrew Y. Ng, ICPR, 2012. pdf
- Towards End-to-End Speech Recognition with Recurrent Neural Networks. Alex Graves, Navdeep Jaitly. ICML, 2014. pdf
- PhotoOCR: Reading Text in Uncontrolled Conditions, Alessandro Bissacco, Mark Cummins, Yuval Netzer, Hartmut Neven. pdf
- High Performance OCR for Printed English and Fraktur using LSTM Networks. ICDAR, 2013. pdf
- Image binarization for end-to-end text understanding in natural images, Sergey Milyaev, Olga Barinova, Tatiana Novikova, Pushmeet Kohli, Victor Lempitsky. ICDAR, 2013, pdf
- Deep Features for Text Spotting, European Conference on Computer Vision, M. Jaderberg, A. Vedaldi, A. Zisserman, 2014. pdf, code