• Stars
    star
    142
  • Rank 256,971 (Top 6 %)
  • Language
    Python
  • License
    MIT License
  • Created almost 9 years ago
  • Updated over 4 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Scrapes g4g and creates PDF

GeeksForGeeksScrapper

Scrapes GeeksForGeeks and creates html & PDF for chosen category along with syntax highlighting for the code.

Screenshots

Example of articles of C category as html:

alt tag

Example of articles of C category as pdf:

alt tag

Installation

To use the scrapper, install the following:

$ sudo apt-get install wkhtmltopdf

Then create venv

$ virtualenv /path/to/g4g-env

Switch to venv

$ source /path/to/g4g-env/bin/activate

Now install BeautifulSoup as:

$ pip install beautifulsoup4

or via package manager as:

$ sudo apt-get install python-bs4

or for Python dependencies, you can just install via requirements.txt inside the virtual environment.

$ pip install -r requirements.txt

Run the G4G_Scrapper

$ python g4g.py

Choose the category you want to scrape from the menu and wait for the magic to happen :)

You can find the output as G4G_<category_name>.html and G4G_<category_name>.pdf in the same directory.

Disclaimer: This is strictly for educational purpose only. Author will not be liable for anything, what so ever; you do with this script.

More Repositories

1

django-phone-verify

A Django app to support phone number verification using security code / One-Time-Password (OTP) sent via SMS.
Python
257
star
2

Awesome-Learning

A repo to keep track of Awesome Learning from ideas, books, people and resources
48
star
3

jdf-latex

LaTex template for Joyner Document Format (JDF)
TeX
32
star
4

dotfiles

Customised ~/.* files with β™₯ for my machine :)
Shell
21
star
5

nexus

Streamline & automate the boring parts
Python
11
star
6

git-intro

Learning version control with git
CSS
6
star
7

API-stub-generator

Mock propsed API endpoints with stub
Python
6
star
8

TrackMyLinks

App to Track all your Resources at a single place and makes management efficient.
HTML
5
star
9

TagLask

A light weight webapp made on Flask to index and search user files.
HTML
4
star
10

standup

Automating standups; one task at a time!
Python
3
star
11

CompetitiveProgramming

Solutions to Competitive Programming problems I've solved.
Python
3
star
12

CuriousLearner.github.io

A site about me. Who am I and what I do.
HTML
3
star
13

Weatherango

An App to visualize various weather parameters in any city for specific date range
Python
2
star
14

til

Collection of my daily learning
Python
2
star
15

blog

Random Junkyard for stuff I'm upto. FOSS, Life, Learning in one place :)
Python
2
star
16

Algorithms

Revising the algorithms
C++
1
star
17

goals

Some of the goals accomplished and some are still to be accomplished, but they're happening every day πŸš€
1
star
18

tenzies

A simple game to get all 10 dice to have the same face.
JavaScript
1
star
19

inpycon2016-workshop-qr-codes

Getting name and QR codes for sticker printing for PyCon India 2016
HTML
1
star
20

asyncio-file-upload

Test repository for playing with asyncio
Shell
1
star
21

dj-file-async

Test repo for developing async file uploads
Python
1
star
22

python-projects

Python projects for learning Python
Python
1
star
23

Personal-Python-Scripts

A repo for Personal Python Scripts which I've written from time to time.
Python
1
star
24

AngelHackDelhi2016

Hackathon project for AngelHack Delhi 2016 | Trending Tweeps
JavaScript
1
star
25

Shopster

NFC/RFID/QR Code based shoppping
Java
1
star
26

python-docs-hi-in

Hindi Translations for the Python Documentation
Makefile
1
star
27

dj-init-travis-test

Python
1
star
28

presentations

Contains all presentations I've given from 2017 onwards
CSS
1
star