• Stars
    star
    1
  • Language
    HTML
  • Created over 3 years ago
  • Updated 7 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

More Repositories

1

Urdu-OCR

Our project is based on one of the most important application of machine learning i.e. pattern recognition. Optical character recognition or optical character reader is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene-photo or from subtitle text superimposed on an image. We are working on developing an OCR for URDU. We studied a couple of research papers related to our project. So far, we have found that Both Arabic and Urdu are written in Perso-Arabic script; at the written level, therefore, they share similarities. The styles of Arabic and Persian writing have a heavy influence on the Urdu script. There are 6 major styles for writing Arabic, Persian and Pashto as well. Urdu is written in Naskh writing style which is most famous of all. Optical character recognition (OCR) is the process of converting an image of text, such as a scanned paper document or electronic fax file, into computer-editable text [1]. The text in an image is not editable: the letters are made of tiny dots (pixels) that together form a picture of text. During OCR, the software analyzes an image and converts the pictures of the characters to editable text based on the patterns of the pixels in the image. After OCR, the converted text can be exported and used with a variety of word-processing, page layout and spreadsheet applications [2]. One of the main aims of OCR is to emulate the human ability to read at a much faster rate by associating symbolic identities with images of characters. Its potential applications include Screen Readers, Refreshable Braille Displays [3], reading customer filled forms, reading postal address off envelops, archiving and retrieving text etc. OCR’s ultimate goal is to develop a communication interface between the computer and its potential users. Urdu is the national language of Pakistan. It is a language that is understood by over 300 million people belonging to Pakistan, India and Bangladesh. Due to its historical database of literature, there is definitely a need to devise automatic systems for conversion of this literature into electronic form that may be accessible on the worldwide web. Although much work has been done in the field of OCR, Urdu and other languages using the Arabic script like Farsi, Urdu and Arabic, have received least attention. This is due in part to a lack of interest in the field and in part to the intricacies of the Arabic script. Owing to this state of indifference, there remains a huge amount of Urdu and Arabic literature unattended and rotting away on some old shelves. The proposed research aims to develop workable solutions to many of the problems faced in realization of an OCR designed specifically for Urdu Noori Nastaleeq Script, which is widely used in Urdu newspapers, governmental documents and books. The underlying processes first isolate and classify ligatures based on certain carefully chosen special, contour and statistical features and eventually recognize them with the aid of Feed-Forward Back Propagation Neural Networks. The input to the system is a monochrome bitmap image file of Urdu text written in Noori Nastaleeq and the output is the equivalent text converted to an editable text file.
Jupyter Notebook
10
star
2

Traffic-Congestion-Avoidance-For-Autonomous-Vehicles-Using-Reinforcement-Learning

The Aim of this project is to design an algorithm that will significantly decrease the trip times of vehicles as well as average car density on the map. Simulations will be used to simulate the Environment. SUMO environment will be utilized to design the road network, apply traffic policies and integrate intelligent algorithms with it. SUMO is a free and open traffic simulation suite that allows modeling of intermodal traffic systems including road vehicles, public transport and pedestrians.
Python
9
star
3

Autoencoder-

Image reconstruction using autoencoder in tensorflow
Jupyter Notebook
1
star
4

Image-Segmentation-and-Detection

Automated plant object segmentation. For each image, automatically segment plant from background.
Jupyter Notebook
1
star
5

Voice_Activity_Detection

Frame-VAD: More Effective and Efficient VAD for More Fine-grained Timestamps
Jupyter Notebook
1
star
6

Spam-Message-Detector-Model

Jupyter Notebook
1
star
7

Arduino-Self-Balancing-Robot

1
star
8

Fine-tuning-image-classification-models-from-image-search

Jupyter Notebook
1
star
9

muhammadsohaib

Config files for my GitHub profile.
1
star
10

Computer-Vision-Annotation-Tool

1
star
11

Introduction-of-Computer-Vision

Jupyter Notebook
1
star
12

Overall-average-sentiment-score

Jupyter Notebook
1
star
13

Detection-Point

Jupyter Notebook
1
star
14

Generate-a-grid-of-pitch-

Python
1
star