• Stars
    star
    120
  • Rank 294,880 (Top 6 %)
  • Language
    Jupyter Notebook
  • License
    GNU General Publi...
  • Created about 6 years ago
  • Updated over 4 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A Box detection algorithm for any image containing boxes.

A Box detection algorithm for tabularized data.

  • This code is used to extract data which is in tabular format using image processing techniques.
  • When you are working with Optical character recognition(OCR) or any data or object recognition problem, the first thing to do is preprocessing.
  • Here preprocessing means to extract the box where our data is located. After extracting the boxes, any OCR algorithm can be performed on those crops for recognition.

Input Image

  • The problem arises when you have to detect objects which are located in any tables/boxes or in row-column format. If the image is like this then you have to detect boxes and extract them one by one. Now it should be done accurately for all images.

This algorithm helps to detect every boxes accurately and save it in a /Output/ folder. The code is shown in src/box_detection.py

USAGE:

  • Run python src/box_detection.py
  • You can see the output crops on /Output folder

alt text

You can also read the medium article for understanding about the algorithm: https://medium.com/@kananvyas/a-box-detection-algorithm-for-any-image-containing-boxes-756c15d7ed26