• Stars
    star
    171
  • Rank 215,215 (Top 5 %)
  • Language
    Jupyter Notebook
  • Created about 1 year ago
  • Updated 7 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Data Science boot camp aims to make the field of data science accessible and understandable to a wide range of individuals, regardless of their background or expertise.

Data Science For EveryOne Mentorship and BootCamp Program

Data Science For Everyone Program is LuxDevHQ program that aims to make the field of data science accessible and understandable to a wide range of people, regardless of their background or expertise.

In the program, we recognize that data science has the potential to bring valuable insights and solutions to various domains and industries, and therefore, it is important to demystify and democratize this field.

Data science involves extracting knowledge and insights from large and complex datasets using various techniques, such as data mining, statistical analysis, machine learning, and visualization. Traditionally, data science has been associated with specialized skills and technical expertise, requiring a strong background in mathematics, statistics, programming, and domain knowledge.

This Free Bootcamp emphasizes the need for clear communication, intuitive visualizations, and user-friendly tools that enable individuals to explore and analyze data without requiring an in-depth understanding of complex algorithms or programming languages.

Overall, we are aiming to bridge the gap between technical experts and non-technical professionals, enabling a broader audience to leverage the power of data and make informed decisions based on evidence and insights derived from data analysis.

Possible Data Science Career Paths


1). Data Scientist.

  • Data scientists are responsible for collecting, cleaning, and analyzing large datasets to extract valuable insights and make data-driven decisions. They use various machine learning and statistical techniques to build predictive models and solve complex problems.

  • Data scientists often work closely with business stakeholders to identify opportunities for leveraging data to drive business growth.

2). Data Anlayst.

  • Data analysts focus on examining data to provide actionable insights to their organizations. They perform data cleaning, data visualization, and basic statistical analysis to help businesses understand trends, patterns, and make informed decisions.
  • Data analysts may work in various industries such as finance, marketing, or healthcare.

3). Data Engineer.

  • Data engineers are responsible for the design, construction, and maintenance of data pipelines and infrastructure. They ensure that data is collected, stored, and made accessible for analysis by data scientists and analysts.
  • Data engineers work with tools like Hadoop, Spark, and databases to manage and process large volumes of data efficiently.

4). Data Architect.

  • Data architects design the overall structure and organization of data within an organization. They create data models, define data standards, and ensure data is stored, integrated, and accessed effectively.
  • Data architects play a critical role in establishing data governance and ensuring data quality.

Note:

  • These are just a few of the many career paths within the data science and analytics field. Depending on your interests and skills, you may also consider roles such as Machine Learning Engineer, Business Intelligence Analyst, Statistician, or even specialized roles like Natural Language Processing (NLP) Engineer or Computer Vision Engineer. The field of data science is continually evolving, so there are always new opportunities and roles emerging as technology advances and businesses become more data-driven. It's important to choose a path that aligns with your interests and career goals.

  • Also note that we will not be able to cover all the topics and concepts but we will set a solid foundation for your data career.

Course Overview - Program information:

  • Duration: 5 Weeks.

  • Learning Mode: Online with weekly project and technical article.

Week 1: Learn the fundamentals of data science.

  • Understand the key concepts of data science and the possible data science career paths.
  • Familiarize yourself with fundamental, that is statistical, mathematical, and programming concepts.
  • Learn the basics of Python and SQL for data manipulation, wrangling, and analysis Introduction to SQL and Python for data science.
  • Understand the fundamentals of data science, statistics, probability, linear algebra, calculus, Python and SQL programming languages.
  • Introduce specific data visualization tools like Matplotlib and Seaborn. Provide hands-on exercises or tutorials for students to create visualizations using these tools.

Tools you will learn on week 1: Python, SQL, pandas, numpy, matplotlib, seaborn and statistics module which was introduced in Python 3.4


Week 1 Classes:

(i). Wednesday, September 27th, 2023 8:00 PM EAT.

(ii) Saturday, September 30th, 2023 10:00 AM EAT.


Week 1 Article: Data Science for Beginners: 2023 - 2024 Complete Roadmap.


Week 1 Projects:

Question 1). Imagine you're working with Sprint, one of the biggest telecom companies in the USA. They're really keen on figuring out how many customers might decide to leave them in the coming months. Luckily, they've got a bunch of past data about when customers have left before, as well as info about who these customers are, what they've bought, and other things like that.

So, if you were in charge of predicting customer churn how would you go about using machine learning to make a good guess about which customers might leave? Like, what steps would you take to create a machine learning model that can predict if someone's going to leave or not?

Question 2). Letโ€™s say youโ€™re a Product Data Scientist at Instagram. How would you measure the success of the Instagram TV product?


Week 2: Learn Basics Data Science Concepts.

  • Learn about data visualization, exploratory data analysis (EDA), and basic statistical measures.

  • Learn about exploratory data analysis, feature engineering, and modelling using real-world data.

  • Learn how problem statements, developing KPIs, Working and collaborating with a remote team, communacation skills, problem solving skills, and to write modern data resume.

    Tools you will learn on week 1: Python, SQL, Pyspark, Problem Solving, and Non-Technical Concepts


Week 2 Classes:

(i). Wednesday, October 4th, 2023 8:00 PM EAT.

  • Lucille Wanjiku, Data Scientist.
  • Recording: Not Available Yet

(ii) Saturday, October 7th, 2023 10:00 AM EAT.

  • Harun Mbaabu Mwenda, Analytics and Data Engineeer.
  • Recording: Not Available Yet

Week 2 Article: Exploratory Data Analysis using Data Visualization Techniques.


Week 2 Project:

Question 1). Read through this case study and solve it https://statso.io/rfm-analysis-case-study/

Question 2). Letโ€™s say we want to build a model to predict booking prices on Airbnb. Between linear regression and random forest regression, which model would perform better and why?


Week 3: Teach Someone Data Science #TeachSomeoneDataScience.

In week 3, you will have some time to explore the different data science career paths and decide which one you want to specialize in. As an assignment, you will find someone new to data science and teach them about it for at least 30 minutes. You will also pick a specific topic in data science and write about it, such as creating a data analysis roadmap on Twitter or LinkedIn.


Week 4: Learn Intermediate Data Science Concepts and Time Series Modeling.

  • Learn dimensionality reduction techniques, a way to reduce the number of features in a dataset without losing too much information. This will be helpful in improving the performance of machine learning models.
  • Learn feature engineering, the process of transforming raw data into features that are more informative and useful for machine learning models.
  • Learn ensemble learning, a technique that combines multiple machine learning models to improve the overall performance.
  • Learn neural networks, a type of machine learning model that can learn complex relationships between features and labels.
  • LearnTime series analysis modelling, the process of analyzing data that is collected over time. This can be used to forecast future trends or identify patterns in the data.

Week 4 Classes:

(i). Wednesday, October 18th, 2023 8:00 PM EAT.

  • ********************, Data Scientist.
  • Recording: Not Available Yet

(ii) Saturday, October 21th, 2023 10:00 AM EAT.

  • ********************, Analytics and Data Engineer.
  • Recording: Not Available Yet

Week 4 Article: The Complete Guide to Time Series Models

Week 4 Project:

Using the Craigslist Vehicles Dataset available on Kaggle (https://www.kaggle.com/datasets/mbaabuharun/craigslist-vehicles), we'd like you to create a Time-Series Model following the approach outlined below.

Here are the key steps:

  • Start by addressing missing values in the dataset. You can handle this by filling in missing values with the median for numerical columns and the mode for categorical columns.
  • Ensure that the data types of the columns are appropriate. Specifically, make sure to convert the 'posting_date' column to a datetime data type.
  • Utilize the 'posting_date' column to create a datetime index for the dataset. This will facilitate the analysis of temporal patterns.
  • With clean data, explore it using various visualizations and statistical analysis techniques. This step is crucial for understanding temporal patterns, identifying seasonal trends, and analyzing demand-supply dynamics by region and vehicle type.
  • Build the time-series chart.
  • Finally, create a GitHub Repository and push your work there, also document your process through each of the steps and demonstrate your understanding by implementing them on the dataset.

Week 5 Project: Exploring Data Engineering and Analytics Engineering with Harun Mbaabu.

  • Clearly differentiate between Data Engineering and Analytics Engineering.
  • Learn about ETL and ELT and when is the Best Time to use which method.
  • Master the modern data stack, how to optimize Python and SQL code, track metrics, and impact as a team play in a data team.
  • Learn how to write modern data profession CVs/resumes, where you apply for data jobs, how to get started freelancing, and how to optimize your chances of being hired.

Week 5 Article: Data Engineering for Beginners: A Step-by-Step Guide


Week 5 Project:

1). Project 1:

As a lead data engineer at Data Science East Africa, you are responsible for building a data engineering pipeline to move weather data from a public API to Azure Synapse Analytics. You will then use Power BI to access the data from Azure Synapse Analytics and create a modern dashboard.

(i). What are the best practices for moving data from a public API to a data lake?

(ii). How can we ensure the security and reliability of the data pipeline?

(iii). Implement this project, optimize your process, and create a GitHub repository where you will document all the processes including Screenshort, and a short video explaining the whole process.

2). Project 5.

In week 4, we performed time series modelling on the Craigslist vehicles dataset, which is available on Kaggle at https://www.kaggle.com/datasets/mbaabuharun/craigslist-vehicles. This project builds on that work. You will need to download the dataset, copy the data using SQL to a local PostgreSQL database, move the data from your local database to Snowflake, perform data transformation with DBT (data build tool), and use your preferred data visualization tool to create a report and dashboard.


Note.

To be eligible for a certificate of completion, you must write four articles of at least 400 words each and complete all four projects. I know you can do this! You are talented nerds and I believe in you. Writing these articles will help you improve your writing skills and learn a lot about the subject matter. They will also be a valuable addition to your portfolio.

Important Road Maps.

1). Preparing for a Data Analyst career? Here's a roadmap:

2). Complete Guide to Becoming a Data Scientist

3). Analytics Engineer Road-map With Free Resources : Modern Data Stack

4). Ultimate Data Engineering Road Map: Become a Data Engineer In 2023.

Bonus: Different Ways to Make Money in Data Science.

This is a very intense program that requires a commitment of up to 20 hours per week. Only those who are willing to put in the time and effort will be able to complete it. By the end of the bootcamp, you will be in a position to build basic end-to-end data projects.

More Repositories

1

Login-System-with-Python-Flask-and-MySQL

Login System with Python framework Flask and MySQL database system.
Python
86
star
2

Python-BoootCamp-Guide

Python BoootCamp by Lux Academy and Data Science East Africa Guide
Jupyter Notebook
69
star
3

BeyonicAPI-Python-Examples

Beyonic API Python official client library simplified examples using Flask, Django and Fast API.
Python
45
star
4

Basics-Of-Computer-Programming

This repository provides a comprehensive introduction to computer programming for beginners.
31
star
5

Awesome-Analytics-Engineering

An awesome Analytics Engineering repository to learn and apply for real world problems.
29
star
6

Fast-API-and-Docker-BootCamp

This repository contains learning resources for Python Fast API Framework and Docker, Build High Performing Apps With Python BootCamp by Lux Academy and Data Science East Africa.
Python
26
star
7

Data-Cleaning-With-Python

Cleaning data in data scince (Detecting Missing Values)
Python
23
star
8

Data-Structures-and-Algorithms

This repository contains over 200 data structures and algorithm questions and their solution for interview preparation from different websites and Cracking the Coding Interview book by Laakmann Gayle McDowell
Python
22
star
9

Adding-Storyblok-to-NextJS-like-a-Pro

Adding Headless CMS to NextJS like a Pro, this repository contains code examples and guide on how to integrate Storyblok, a headless CMS to NextJS.
JavaScript
21
star
10

Data-Processing-With-Python-Guide

Jupyter Notebook
19
star
11

Clean-JavaScript-Code

Software engineering principles, from Robert C. Martin's book Clean Code, adapted for JavaScript. This is not a style guide. It's a guide to producing readable, reusable, and refactorable software in JavaScript.
16
star
12

Deploy-Machine-Learning-Model-WIth-Flask-and-Deta-Space

A step-by-step guide to build a credit card fraud detection machine learning model and deploying it using Flask and deta.sh.
Python
15
star
13

Flask-and-Docker-Application-Demo

Lux Academy & Data Science East Africa Python Boot Camp, Building and Deploying Flask Application Using Docker Demo App.
Python
12
star
14

AUTO-EDA

Python library with code example to perform EDA and data cleaning.
Python
11
star
15

Machine-Learing-Models-Using-Python

Introduction To Machine Learning with Python All You Need To Know About Machine Learning with Python, with examples and use cases.
Python
11
star
16

Django-Instagram-clone

This is an instagram clone project developed using DJANGO2.22 Framework and Bootstrap 4, to replicate Original Instagram with the aim of demonstrating level of understanding of the various concepts in django.
Python
9
star
17

Data-Science-Ultimate-Guide

Data Science 2022/2023 Class Resources
Jupyter Notebook
8
star
18

100-Days-Of-JavaScript-Boot-Camp-By-Lux-Academy

100 Days of JavaScript Boot Camp by Lux Academy Learning Resources.
CSS
6
star
19

Data-Build-Tool-Ultimate-Guide

5
star
20

Location-Based-Web-App-With-Python

A Location-Based Web App With Django, GeoDjango,spatial database (PostgreSQL and PostGIS) and delopying it on Heroku.
5
star
21

Lux-Tech-Resume-Builder

Angular Resume Builder
TypeScript
4
star
22

common_python_algorithms.

Common algorithms Implemented Python.
Python
4
star
23

Lux-Voting-System

An online Voting System (online voting software) developed in Python microframework flask, JavaScript library reactjs, bootstrap,CSS and HTML.
Python
4
star
24

Flask_voting_app

A votiing system developed in python flask and JavaScript
HTML
3
star
25

Factorial-using-C-.

c++ code to claculate Factorial of a given number Design and analysis of algorithm
C++
3
star
26

JavaScript-2021

JavaScript
3
star
27

django_log-in-project

a log-in project written in django 2.2 with form validation, python3.7, css,html, javascript and (jquery and bootstrap libraries)
HTML
3
star
28

2022-Python3-Essentials-Bootcamp-Curriculum

3
star
29

Personal-Resume.

A simple but functional resume
HTML
3
star
30

Machine-Learning-Using-Tensorflow

alc challange two but i made a chatapp
Python
3
star
31

SQLAlchemy-for-Database-Operations-In-Python

Jupyter Notebook
3
star
32

lux-school

JavaScript
3
star
33

DSC-Murang-a-University-of-Technology-Website

MaProD club Developers student Club official website.
CSS
3
star
34

Guide-to-Acing-SQL-Interviews

Comprehensive guide to excel in SQL interviews, covering concepts, queries, performance tuning, and more.
3
star
35

pyspark-machine-learning

Building Machine Learning Models with PySpark's pyspark.ml Library: A Comprehensive Guide
Jupyter Notebook
3
star
36

creating-a-landing-page-Using-Django

Python css and Html
HTML
3
star
37

Fetch-API-demo

CSS
3
star
38

Complete-Guide-for-SDE-Preparation-Sheet

SDE Sheet is a list of the most important topics or the most popular questions that are asked in the Software Development Engineer Interviews.
3
star
39

DataScience-and-Machine-Laerning-at-MurangaUniversity-of-Technology.-

3
star
40

Deploying-ML-Model-Using-Flask

Jupyter Notebook
2
star
41

Javascript-chaos-game

A web based game developed in HTML and JavaScript
JavaScript
2
star
42

smartcard-challange

Andele dev-training first challange HTML,CSS and ECMAScript6
JavaScript
2
star
43

Data-Analyst-Assessment

Jupyter Notebook
2
star
44

DSEA-EDA-WITH-PYTHON-CLASS

2
star
45

PyPractice

Python
2
star
46

python3-args-and-Kwargs

Understanding Args and Kwargs in Python3
2
star
47

Lux-Academy

A personal profile lux_cv
CSS
2
star
48

Full-Stack-In-Flask

Python for Full-Stack web development Using Flask Microframework.
HTML
2
star
49

React-Demo

JavaScript
2
star
50

Lux-FLask-Version-1.00

Lux ACademy flask version
JavaScript
2
star
51

python-data-types

Python
2
star
52

Tic-Tac-Toe-Angular-andTypeScript

TypeScript
2
star
53

BLOG-Django-2.2-blog

A simple fully functional blog developed in python(Django 2.2), bootstrap 4, css and Html.
Python
2
star
54

form_validation_using_Jquery

HTML
2
star
55

Statistics_using_Python

Major calculation in statistics using python
Python
2
star
56

Harun_MbaabuFlak_resume.

Flask(Python3 Micro framework and W3.css) Resume.
HTML
2
star
57

Data-Science-in-Action-DSAIC-Hands-on-Hackathon-Experience

2
star
58

Geo-Search--DevC-program

Geo-Search is a tool that allows people to search for places and extract information such as timezone, map and local weather conditions like temperature, wind speed, humidity, and precipitation.
2
star
59

python-data-visualization-plots

An introduction to data visualization using python Matplotlib
Python
2
star
60

Flask_python_blog

A functional blog developed in python (flask frameweork), HTML & CSS, JavaScript and W3.CSS framework
HTML
2
star
61

Challange

CSS
1
star
62

ML-JS-Demo

JavaScript
1
star
63

MaProD-Website-Devevelopment-tutorials.

JavaScript
1
star
64

Yahoo-Finance-Data-Analysis-using-Apache-Spark

Jupyter Notebook
1
star
65

DataScience-East-Africa-UI

CSS
1
star
66

MaProD-JavaScript-and-ES6-Meet_Up-Talk

HTML
1
star
67

React-Tutorial

JavaScript
1
star
68

PythonFastAPI-

Python
1
star
69

Craigslist-Used-Vehicles--Time-Series-Project

Jupyter Notebook
1
star
70

Perfomance-Dashboard

Performance Dashboard Challenge.
Jupyter Notebook
1
star
71

AIRPLANE_DETECTION_FOR_SATELLITES

Python
1
star
72

covid-19-estimator

JavaScript
1
star
73

Netpap-Hotspot-app

HTML
1
star
74

KeDevs

CSS
1
star
75

JavaScript-Queue-Data-Structure

JavaScript Queue Data Structure
JavaScript
1
star
76

flask-MySql

Python
1
star
77

Dom-Assessment

HTML
1
star
78

Intermediate-Coding-Challenges-

Python
1
star
79

HarunMbaabu

1
star
80

FastAPI-Projects-Demos

Python
1
star
81

Modern-Data-Satck-Stack-with-Airbyte-Snowflake-and-DBT

1
star
82

Object-Detection-Project

Object Detection Project
Jupyter Notebook
1
star
83

Data-Engineering-Ultimate-Guide

1
star
84

JavaScript-chat-bot

JavaScript
1
star
85

Hackerrank-SQL-Questions-and-Solutions

1
star
86

Calc-Project-JavaScript-BootCamp

CSS
1
star
87

Graph-in-Python

Python
1
star
88

Web-Scraping-with-Beautiful-Soup

Jupyter Notebook
1
star
89

Portfolio-Resume---JAVASCRIPT

A simple portfolio using react Js
HTML
1
star
90

Practice

Practice
Jupyter Notebook
1
star
91

JavaScript-Boot-Camp-assessment-

JavaScript
1
star
92

NodeJS-Chat-Bot

1
star
93

Getting-Started-With-PySpark-

1
star
94

MaProD-Murang-a-University-of-Technology-CodingCHALLENGE

A coding challange held by mathematics and programming for development coding challenge questions.
Java
1
star
95

Modern-Data-Engineering-Hackathon-Guide

1
star
96

Progressive-Web-App

JavaScript
1
star
97

Fake-News-Detection-Python-Project

Jupyter Notebook
1
star
98

JavaScriptCheatSheet

JavaScript
1
star
99

maprodchallenge

Nice Challlenge I took part in
CSS
1
star
100

Visualize-Decision-Trees-using-Matplotlib

Jupyter Notebook
1
star