• Stars
    star
    752
  • Rank 58,143 (Top 2 %)
  • Language
    Python
  • License
    GNU General Publi...
  • Created almost 5 years ago
  • Updated 8 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A simple resume parser used for extracting information from resumes

pyresparser

A simple resume parser used for extracting information from resumes

Built with โค๏ธŽ and โ˜• by Omkar Pathak


GitHub stars PyPI Downloads GitHub PyPI - Python Version Say Thanks! Build Status codecov

Features

  • Extract name
  • Extract email
  • Extract mobile numbers
  • Extract skills
  • Extract total experience
  • Extract college name
  • Extract degree
  • Extract designation
  • Extract company names

Installation

  • You can install this package using
pip install pyresparser
  • For NLP operations we use spacy and nltk. Install them using below commands:
# spaCy
python -m spacy download en_core_web_sm

# nltk
python -m nltk.downloader words
python -m nltk.downloader stopwords

Documentation

Official documentation is available at: https://www.omkarpathak.in/pyresparser/

Supported File Formats

  • PDF and DOCx files are supported on all Operating Systems
  • If you want to extract DOC files you can install textract for your OS (Linux, MacOS)
  • Note: You just have to install textract (and nothing else) and doc files will get parsed easily

Usage

  • Import it in your Python project
from pyresparser import ResumeParser
data = ResumeParser('/path/to/resume/file').get_extracted_data()

CLI

For running the resume extractor you can also use the cli provided

usage: pyresparser [-h] [-f FILE] [-d DIRECTORY] [-r REMOTEFILE]
                   [-re CUSTOM_REGEX] [-sf SKILLSFILE] [-e EXPORT_FORMAT]

optional arguments:
  -h, --help            show this help message and exit
  -f FILE, --file FILE  resume file to be extracted
  -d DIRECTORY, --directory DIRECTORY
                        directory containing all the resumes to be extracted
  -r REMOTEFILE, --remotefile REMOTEFILE
                        remote path for resume file to be extracted
  -re CUSTOM_REGEX, --custom-regex CUSTOM_REGEX
                        custom regex for parsing mobile numbers
  -sf SKILLSFILE, --skillsfile SKILLSFILE
                        custom skills CSV file against which skills are
                        searched for
  -e EXPORT_FORMAT, --export-format EXPORT_FORMAT
                        the information export format (json)

Notes:

  • If you are running the app on windows, then you can only extract .docs and .pdf files

Result

The module would return a list of dictionary objects with result as follows:

[
  {
    'college_name': ['Marathwada Mitra Mandalโ€™s College of Engineering'],
    'company_names': None,
    'degree': ['B.E. IN COMPUTER ENGINEERING'],
    'designation': ['Manager',
                    'TECHNICAL CONTENT WRITER',
                    'DATA ENGINEER'],
    'email': '[email protected]',
    'mobile_number': '8087996634',
    'name': 'Omkar Pathak',
    'no_of_pages': 3,
    'skills': ['Operating systems',
              'Linux',
              'Github',
              'Testing',
              'Content',
              'Automation',
              'Python',
              'Css',
              'Website',
              'Django',
              'Opencv',
              'Programming',
              'C',
              ...],
    'total_experience': 1.83
  }
]

References that helped me get here

Donation

If you have found my softwares to be of any use to you, do consider helping me pay my internet bills. This would encourage me to create many such softwares ๐Ÿ˜„

PayPal Donate via PayPal!
โ‚น (INR) Donate via Instamojo

Stargazer over time

Stargazers over time

More Repositories

1

pygorithm

A Python module for learning all major algorithms
Python
4,369
star
2

Python-Programs

My collection of Python Programs
Python
988
star
3

Data-Structures-using-Python

This is my repository for Data Structures using Python
Jupyter Notebook
924
star
4

ResumeParser

A simple resume parser used for extracting information from resumes
Python
272
star
5

django-notekeeper

๐Ÿ“‹ A pretty easy django notes app. Have a note, save it!
HTML
72
star
6

A-Simple-Note-Taking-Web-App

๐Ÿ“‹ An easy to use and deploy note taking web app built using Flask
HTML
72
star
7

Django-Gym-Member-Manager

A Gym Member Manager Web App using Django
JavaScript
52
star
8

Smart-Surveillance-System-using-Raspberry-Pi

This is my Third Year Project for face recognition using OpenCV
Python
48
star
9

Django-to-do

A simple Django to do web app
Python
33
star
10

A-Simple-Note-Taking-Terminal-App

A simple terminal note taking application using Python
Python
24
star
11

Django-Bookworm

A fun project to store my learning from each book that I read.
Python
24
star
12

Django-Invoice-Generator

A minimal Invoice Generator using Django
JavaScript
18
star
13

Password-Strength-Evaluator-using-Machine-Learning

My TE Seminar mini project
HTML
15
star
14

Term-Similarity-using-Machine-Learning

This is a small project to find similar terms in corpus of documents
Python
13
star
15

Codechef-Problem-Statement-Downloader

Script for downloading codechef problem statements
Python
10
star
16

omkarpathak.github.io

Personal Website based on Jekyll and Github Pages
HTML
9
star
17

Playing-with-datasets

This is the repo for mini projects to get your hands dirty with Machine Learning
Jupyter Notebook
8
star
18

My-Personal-Website

[DEPRECATED] This is the source code for my website
HTML
8
star
19

CCDSAP-Prep

My Practice Problems for CodeChef Certified Data Structure & Algorithms Programme
Python
6
star
20

Garbage-Level-Monitoring-System

My BE Project using IoT, Cloud and Django
JavaScript
6
star
21

Django-Invoice-Generator-For-Local-Dairy-Shop

A simple web app made to automate invoice generation created to help a local dairy shop
JavaScript
5
star
22

Word-Counter

A Python word counter module to quickly count number of words in a sentence
Python
4
star
23

Tic-Tac-Toe-Using-Minimax

The classic tic-tac-toe game with a minimal AI using Minimax Algorithm
Python
3
star
24

SlowBros

JavaScript
3
star
25

poc

Python
2
star
26

old-website

Code to my old website
HTML
2
star
27

illustrations

My collection of all the illustration I created myself
2
star
28

globchem

Source code for Global Construction Chemical website
CSS
2
star
29

dotfiles

My dotfiles and preferences
Shell
1
star
30

django-rest-todo

A simple Django REST Api to create todo lists
TypeScript
1
star
31

omkarpathak

1
star
32

IoT_Workshop

These are the files that we used for our IoT workshops.
PHP
1
star
33

techminds

A new sample website template
HTML
1
star
34

django-ecommerce-template

A generic template to include all basic features needed for an ecommerce website
Python
1
star
35

Fun-with-Pygame

Some weekend fun with learning Python pygame module
Python
1
star