• Stars
    star
    197
  • Rank 197,722 (Top 4 %)
  • Language
    Jupyter Notebook
  • License
    MIT License
  • Created over 6 years ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

🐳📊🤓Cookiecutter template to launch an awesome dockerized Data Science toolstack (incl. Jupyster, Superset, Postgres, Minio, AirFlow & API Star)

Data Science Stack - Cookiecutter

Maintainers Wanted

Cookiecutter to launch an awesome Data Science toolstack in Docker.

See it in action

asciicast

Overall Architecture

architecture

Used Variables

The following table provides an overview about parameter, that are queried by cookiecutter (and why)

Name Description Injected in Services
project_name Name of your project 
jupyter_password Password to protect your Jupyter service Jupyter
postgres_db_password Password of standard postgres user Postgres
shared_db_password Password for shared database Airflow
Jupyter
Postgres
superset_db_password Password for superset database Postgres
Superset
superset_admin_password Password for superset admin user Superset
minio_access_key Access key for Minio store Airflow
Apistar
Jupyter
Minio
minio_secret_key Secret key for Minio store Airflow
Apistar
Jupyter
Minio

More Repositories

1

beyond-jupyter

🐍💻📊 All material from the PyCon.DE 2018 Talk "Beyond Jupyter Notebooks - Building your own data science platform with Python & Docker" (incl. Slides, Video, Udemy MOOC & other References)
152
star
2

distribution-cheatsheet

📈📄👀A lookup repo for a variety of discrete and continuous distributions (incl. Beta, Binomial, Cauchy, Chi-squared, Geometric, Hypergeometric, Normal & Poisson)
Jupyter Notebook
51
star
3

beyond-jupyter-mooc

All the material from the Udemy course "Beyond Jupyter Notebooks"
Jupyter Notebook
16
star
4

corona-hackathon

🦠🤖🧪#WirVsVirus Corona-Crisis Hackathon - organized by the German government. SWAG (Smart Workforce Allocator Germany) helps to mitigate this imbalance and directly connects employer and employees, building on the principles ease of use, flexibility & optimized allocation.
JavaScript
12
star
5

advanced-statistics

Source code for the module "Advanced Statistics" 📊
Jupyter Notebook
8
star
6

meetif.ai

🥶👫🕸Social Knowledge Graph based on data from meetup.com to generate conversation opening iceabreakers (based on RDF(S), RML, Neo4j & other Technologies)
TypeScript
7
star
7

PySchool

Crossover Python Learning Material
Python
5
star
8

t-47

🧱📦🚀LEGO assembly data and knowledge graph for the T-47 Airspeeder set (incl. Data Set, Knowledeg Gaph & custom Plugins)
Shell
3
star
9

CI-testing

Testing a CI Pipeline
Python
3
star
10

metabolic-query-generator

🤖🧠❓Exam Preparation Query Generator around Human Biology and Metabolic Pathways, such as "What does FBP stand for?' or "Which enzyme is responsible for the transformation from glucose to glucose?"
Java
3
star
11

Superstack

Data Science & Visualization starter kit
3
star
12

nlp-feature-extractor

Feature extractors for NLP
Python
3
star
13

ml-homework

Homework from ML-class
Jupyter Notebook
3
star
14

MarchMadness

NCAA March Madness Competition 2018
Jupyter Notebook
3
star
15

SportsPrediction

Predicting Sport Event outcomes.
Jupyter Notebook
2
star
16

DataMining

Jupyter Notebook
1
star
17

whatsapp_language_model

Python
1
star
18

t-47-procedures

👾🕸🧮Custom Neo4j procedures to ease the construction of and interaction with the T-47 knowledge graph
Java
1
star
19

Programming-for-DataScience

Datascience Exercises in R & Python
Jupyter Notebook
1
star