• Stars
    star
    259
  • Rank 157,074 (Top 4 %)
  • Language
    Python
  • Created over 4 years ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Web Scraping Javascript generated pages using Python and Selenium

Web Scraping JavaScript Generated Pages with Python

This project was created just for educational proposes.

The code shows how to do web scraping dynamic content pages generated from Javascript using Python and Selenium.

We use as data the NBA site to extract stats information from players and generate a json file with some top 10 rankings.
Important: Educational Purposes Only

Getting Started

These instructions will get you a copy of the project up and running on your local machine for development and testing purposes.

Prerequisites

What things you need to install the software and how to install them

  • Python 3.x
  • Geckodriver
  • Firefox (you can use another browser)
  • Some Python libraries following

Installing

A step by step series of examples that tell you how to get a development env running

Install the following Python libraries:

  • requests2 - Requests is the only Non-GMO HTTP library for Python, safe for human consumption;
  • pandas - A great Python Data Analysis Library;
  • lxml - Library for processing XML and HTML;
  • beautfulsoup4 - Library for pulling data out of HTML and XML files;
  • selenium - An API to write functional/acceptance tests using Selenium WebDriver.

With:

pip install -r requirements.txt

Geckodriver

You can find install instructions in the official repository.

Running the code

python webscraping.py

Contributing

Feel free to submitting pull requests to us.

Authors

  • Gabriel Froes - Initial work - Twitter
  • Vanessa Weber - Initial work - Twitter

License

This project is licensed under the GNU General Public License.

Acknowledgments

  • First steps in Python language
  • Create simple and useful things
  • Build content for C贸digo Fonte TV, our Youtube Channel.

More Repositories

1

tic-tac-toe

Tic Tac Toe by C贸digo Fonte TV
JavaScript
169
star
2

among-us-start-reactor-game-vanilla-js

Jogo Start Reactor do Among Us desenvolvido utilizando tecnologias web, HTML 5, CSS 3 e JavaScript
JavaScript
100
star
3

photo-organizer

Photo Organizer made with Python
Python
99
star
4

youtube-chapter-extractor

Youtube Chapter Extractor
HTML
65
star
5

python-automated-actions

Simplify the creation of projects for: video makers, developers, designers, photographers or any kind of professional that need automated some things.
Python
55
star
6

wordpress-development-environment-with-docker

Create Fast and Powerful Wordpress Development Environment using Docker
PHP
46
star
7

sorteio-websocket

Aplica莽茫o de Sorteio em tempo real utilizando Websocket
JavaScript
44
star
8

github-actions-ftp

HTML
41
star
9

design-pattern-strategy-na-pratica

Aprenda a implementar o Padr茫o de Projeto Strategy
PHP
39
star
10

byebg

Remova o fundo das fotos automagicamente!
Dockerfile
36
star
11

8-ways-looping-arrays-javascript

JavaScript
34
star
12

api-nodejs-docker

Exemplo simples de uma API em Node.js utilizando container Docker
JavaScript
29
star
13

curso-desenvolvimento-plugins-wordpress

Curso de Desenvolvimento de Plugins no Wordpress
PHP
26
star
14

python-random-youtube-members

Script em Python para sortear membros do clube de canais no YouTube
Python
25
star
15

compilado-cli

CLI para o projeto Compilado (Newsletter e Podcast do C贸digo Fonte TV)
Python
23
star
16

observer-pattern

Implementa莽茫o do Observer Pattern em TypeScript para o C贸digo Fonte TV
TypeScript
15
star
17

Youtube-Subscriber-Public-Counts

HTML
15
star
18

design-pattern-facade-na-pratica

Aprenda a implementar o Padr茫o de Projeto Facade
TypeScript
11
star
19

flutter_mobx_email_reativo

Mini projeto para demonstra莽茫o da lib MobX em Flutter (Gerenciamento de Estados com Reatividade)
Dart
10
star
20

my-instagram-plaque

C++
10
star
21

My-Youtube-Plaque

Created for C贸digo Fonte TV channel 100K subscribers celebration.
C++
9
star
22

youtube_video_validator

A simple dart class for validate Video URL and ID on Youtube
Dart
6
star
23

pubsub-pattern

Implementa莽茫o do Pattern Arquitetural Pub/Sub em TypeScript para o C贸digo Fonte TV
TypeScript
5
star
24

clube-membros-youtube

Gerenciar clube de membros de um canal do YouTube
Python
4
star
25

prompts-incriveis-para-codigo

Colet芒nea de Prompts Incr铆veis e 脷teis para Entender e Refatorar C贸digo de Outros Devs
C#
2
star
26

ProjetoCDFTV

CSS
1
star