• Stars
    star
    140
  • Rank 256,179 (Top 6 %)
  • Language
    Jupyter Notebook
  • Created over 4 years ago
  • Updated 10 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Logo

Data Science for Beginners

A collection of Jupyter Notebooks, HTML and JS code for Data Scientists.

Comments on the single scripts can be found on my Medium blog as well as on my Web site.

Programming Languages

  • Python 3.X
  • HTML5
  • Javascript, with a focus on D3.JS
  • CSS

Other interesting tutorials can be found on my Observable Profile.

Authors

Documentation

The Project is organised in different folders, one covered topic:

  • Data Collection - data extraction from HTML, Twitter, PDF ...
  • Preprocessing - missing data, duplicates, normalisation, binning ...
  • Data Analysis - full workflow in scikit-learn and PyCaret, as well as overfitting, Auto ML, ...
  • Text Analysis - sentiment analysis, ...
  • Data Visualisation - examples in Altair, Plotly, D3.js, ...
  • Data Narrative - how to improve data visualisations.