• Stars
    star
    3
  • Rank 3,963,521 (Top 79 %)
  • Language
    Shell
  • Created over 5 years ago
  • Updated over 3 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

This Kannada OCR benchmarking dataset contains 250 images, carefully chosen to have various kinds of recognition challenges. Some of the pages have italics and bold characters. Some of them have Halegannada poems and text; others are letterpress-printed pages, where the vowel modifiers appear as separate symbols and do not touch the consonants they go with. Some pages have interspersed English words; still others have tables with a lot of numeric data. In addition, there are old pages containing either a lot of broken characters or many words with two or more characters merged into a single connected component.