• Stars
    star
    170
  • Rank 223,357 (Top 5 %)
  • Language
    Jupyter Notebook
  • Created over 1 year ago
  • Updated about 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Data Science boot camp aims to make the field of data science accessible and understandable to a wide range of individuals, regardless of their background or expertise.

Data Science For EveryOne Mentorship and BootCamp Program

Data Science For Everyone Program is LuxDevHQ program that aims to make the field of data science accessible and understandable to a wide range of people, regardless of their background or expertise.

In the program, we recognize that data science has the potential to bring valuable insights and solutions to various domains and industries, and therefore, it is important to demystify and democratize this field.

Data science involves extracting knowledge and insights from large and complex datasets using various techniques, such as data mining, statistical analysis, machine learning, and visualization. Traditionally, data science has been associated with specialized skills and technical expertise, requiring a strong background in mathematics, statistics, programming, and domain knowledge.

This Free Bootcamp emphasizes the need for clear communication, intuitive visualizations, and user-friendly tools that enable individuals to explore and analyze data without requiring an in-depth understanding of complex algorithms or programming languages.

Overall, we are aiming to bridge the gap between technical experts and non-technical professionals, enabling a broader audience to leverage the power of data and make informed decisions based on evidence and insights derived from data analysis.

Possible Data Science Career Paths


1). Data Scientist.

  • Data scientists are responsible for collecting, cleaning, and analyzing large datasets to extract valuable insights and make data-driven decisions. They use various machine learning and statistical techniques to build predictive models and solve complex problems.

  • Data scientists often work closely with business stakeholders to identify opportunities for leveraging data to drive business growth.

2). Data Anlayst.

  • Data analysts focus on examining data to provide actionable insights to their organizations. They perform data cleaning, data visualization, and basic statistical analysis to help businesses understand trends, patterns, and make informed decisions.
  • Data analysts may work in various industries such as finance, marketing, or healthcare.

3). Data Engineer.

  • Data engineers are responsible for the design, construction, and maintenance of data pipelines and infrastructure. They ensure that data is collected, stored, and made accessible for analysis by data scientists and analysts.
  • Data engineers work with tools like Hadoop, Spark, and databases to manage and process large volumes of data efficiently.

4). Data Architect.

  • Data architects design the overall structure and organization of data within an organization. They create data models, define data standards, and ensure data is stored, integrated, and accessed effectively.
  • Data architects play a critical role in establishing data governance and ensuring data quality.

Note:

  • These are just a few of the many career paths within the data science and analytics field. Depending on your interests and skills, you may also consider roles such as Machine Learning Engineer, Business Intelligence Analyst, Statistician, or even specialized roles like Natural Language Processing (NLP) Engineer or Computer Vision Engineer. The field of data science is continually evolving, so there are always new opportunities and roles emerging as technology advances and businesses become more data-driven. It's important to choose a path that aligns with your interests and career goals.

  • Also note that we will not be able to cover all the topics and concepts but we will set a solid foundation for your data career.

Course Overview - Program information:

  • Duration: 5 Weeks.

  • Learning Mode: Online with weekly project and technical article.

Week 1: Learn the fundamentals of data science.

  • Understand the key concepts of data science and the possible data science career paths.
  • Familiarize yourself with fundamental, that is statistical, mathematical, and programming concepts.
  • Learn the basics of Python and SQL for data manipulation, wrangling, and analysis Introduction to SQL and Python for data science.
  • Understand the fundamentals of data science, statistics, probability, linear algebra, calculus, Python and SQL programming languages.
  • Introduce specific data visualization tools like Matplotlib and Seaborn. Provide hands-on exercises or tutorials for students to create visualizations using these tools.

Tools you will learn on week 1: Python, SQL, pandas, numpy, matplotlib, seaborn and statistics module which was introduced in Python 3.4


Week 1 Classes:

(i). Wednesday, September 27th, 2023 8:00 PM EAT.

(ii) Saturday, September 30th, 2023 10:00 AM EAT.


Week 1 Article: Data Science for Beginners: 2023 - 2024 Complete Roadmap.


Week 1 Projects:

Question 1). Imagine you're working with Sprint, one of the biggest telecom companies in the USA. They're really keen on figuring out how many customers might decide to leave them in the coming months. Luckily, they've got a bunch of past data about when customers have left before, as well as info about who these customers are, what they've bought, and other things like that.

So, if you were in charge of predicting customer churn how would you go about using machine learning to make a good guess about which customers might leave? Like, what steps would you take to create a machine learning model that can predict if someone's going to leave or not?

Question 2). Let’s say you’re a Product Data Scientist at Instagram. How would you measure the success of the Instagram TV product?


Week 2: Learn Basics Data Science Concepts.

  • Learn about data visualization, exploratory data analysis (EDA), and basic statistical measures.

  • Learn about exploratory data analysis, feature engineering, and modelling using real-world data.

  • Learn how problem statements, developing KPIs, Working and collaborating with a remote team, communacation skills, problem solving skills, and to write modern data resume.

    Tools you will learn on week 1: Python, SQL, Pyspark, Problem Solving, and Non-Technical Concepts


Week 2 Classes:

(i). Wednesday, October 4th, 2023 8:00 PM EAT.

  • Lucille Wanjiku, Data Scientist.
  • Recording: Not Available Yet

(ii) Saturday, October 7th, 2023 10:00 AM EAT.

  • Harun Mbaabu Mwenda, Analytics and Data Engineeer.
  • Recording: Not Available Yet

Week 2 Article: Exploratory Data Analysis using Data Visualization Techniques.


Week 2 Project:

Question 1). Read through this case study and solve it https://statso.io/rfm-analysis-case-study/

Question 2). Let’s say we want to build a model to predict booking prices on Airbnb. Between linear regression and random forest regression, which model would perform better and why?


Week 3: Teach Someone Data Science #TeachSomeoneDataScience.

In week 3, you will have some time to explore the different data science career paths and decide which one you want to specialize in. As an assignment, you will find someone new to data science and teach them about it for at least 30 minutes. You will also pick a specific topic in data science and write about it, such as creating a data analysis roadmap on Twitter or LinkedIn.


Week 4: Learn Intermediate Data Science Concepts and Time Series Modeling.

  • Learn dimensionality reduction techniques, a way to reduce the number of features in a dataset without losing too much information. This will be helpful in improving the performance of machine learning models.
  • Learn feature engineering, the process of transforming raw data into features that are more informative and useful for machine learning models.
  • Learn ensemble learning, a technique that combines multiple machine learning models to improve the overall performance.
  • Learn neural networks, a type of machine learning model that can learn complex relationships between features and labels.
  • LearnTime series analysis modelling, the process of analyzing data that is collected over time. This can be used to forecast future trends or identify patterns in the data.

Week 4 Classes:

(i). Wednesday, October 18th, 2023 8:00 PM EAT.

  • ********************, Data Scientist.
  • Recording: Not Available Yet

(ii) Saturday, October 21th, 2023 10:00 AM EAT.

  • ********************, Analytics and Data Engineer.
  • Recording: Not Available Yet

Week 4 Article: The Complete Guide to Time Series Models

Week 4 Project:

Using the Craigslist Vehicles Dataset available on Kaggle (https://www.kaggle.com/datasets/mbaabuharun/craigslist-vehicles), we'd like you to create a Time-Series Model following the approach outlined below.

Here are the key steps:

  • Start by addressing missing values in the dataset. You can handle this by filling in missing values with the median for numerical columns and the mode for categorical columns.
  • Ensure that the data types of the columns are appropriate. Specifically, make sure to convert the 'posting_date' column to a datetime data type.
  • Utilize the 'posting_date' column to create a datetime index for the dataset. This will facilitate the analysis of temporal patterns.
  • With clean data, explore it using various visualizations and statistical analysis techniques. This step is crucial for understanding temporal patterns, identifying seasonal trends, and analyzing demand-supply dynamics by region and vehicle type.
  • Build the time-series chart.
  • Finally, create a GitHub Repository and push your work there, also document your process through each of the steps and demonstrate your understanding by implementing them on the dataset.

Week 5 Project: Exploring Data Engineering and Analytics Engineering with Harun Mbaabu.

  • Clearly differentiate between Data Engineering and Analytics Engineering.
  • Learn about ETL and ELT and when is the Best Time to use which method.
  • Master the modern data stack, how to optimize Python and SQL code, track metrics, and impact as a team play in a data team.
  • Learn how to write modern data profession CVs/resumes, where you apply for data jobs, how to get started freelancing, and how to optimize your chances of being hired.

Week 5 Article: Data Engineering for Beginners: A Step-by-Step Guide


Week 5 Project:

1). Project 1:

As a lead data engineer at Data Science East Africa, you are responsible for building a data engineering pipeline to move weather data from a public API to Azure Synapse Analytics. You will then use Power BI to access the data from Azure Synapse Analytics and create a modern dashboard.

(i). What are the best practices for moving data from a public API to a data lake?

(ii). How can we ensure the security and reliability of the data pipeline?

(iii). Implement this project, optimize your process, and create a GitHub repository where you will document all the processes including Screenshort, and a short video explaining the whole process.

2). Project 5.

In week 4, we performed time series modelling on the Craigslist vehicles dataset, which is available on Kaggle at https://www.kaggle.com/datasets/mbaabuharun/craigslist-vehicles. This project builds on that work. You will need to download the dataset, copy the data using SQL to a local PostgreSQL database, move the data from your local database to Snowflake, perform data transformation with DBT (data build tool), and use your preferred data visualization tool to create a report and dashboard.


Note.

To be eligible for a certificate of completion, you must write four articles of at least 400 words each and complete all four projects. I know you can do this! You are talented nerds and I believe in you. Writing these articles will help you improve your writing skills and learn a lot about the subject matter. They will also be a valuable addition to your portfolio.

Important Road Maps.

1). Preparing for a Data Analyst career? Here's a roadmap:

2). Complete Guide to Becoming a Data Scientist

3). Analytics Engineer Road-map With Free Resources : Modern Data Stack

4). Ultimate Data Engineering Road Map: Become a Data Engineer In 2023.

Bonus: Different Ways to Make Money in Data Science.

This is a very intense program that requires a commitment of up to 20 hours per week. Only those who are willing to put in the time and effort will be able to complete it. By the end of the bootcamp, you will be in a position to build basic end-to-end data projects.

More Repositories

1

Login-System-with-Python-Flask-and-MySQL

Login System with Python framework Flask and MySQL database system.
Python
96
star
2

Python-BoootCamp-Guide

Python BoootCamp by Lux Academy and Data Science East Africa Guide
Jupyter Notebook
73
star
3

BeyonicAPI-Python-Examples

Beyonic API Python official client library simplified examples using Flask, Django and Fast API.
Python
46
star
4

Awesome-Analytics-Engineering

An awesome Analytics Engineering repository to learn and apply for real world problems.
35
star
5

Basics-Of-Computer-Programming

This repository provides a comprehensive introduction to computer programming for beginners.
32
star
6

Data-Cleaning-With-Python

Cleaning data in data scince (Detecting Missing Values)
Python
26
star
7

Fast-API-and-Docker-BootCamp

This repository contains learning resources for Python Fast API Framework and Docker, Build High Performing Apps With Python BootCamp by Lux Academy and Data Science East Africa.
Python
26
star
8

Data-Structures-and-Algorithms

This repository contains over 200 data structures and algorithm questions and their solution for interview preparation from different websites and Cracking the Coding Interview book by Laakmann Gayle McDowell
Python
23
star
9

Adding-Storyblok-to-NextJS-like-a-Pro

Adding Headless CMS to NextJS like a Pro, this repository contains code examples and guide on how to integrate Storyblok, a headless CMS to NextJS.
JavaScript
21
star
10

Data-Processing-With-Python-Guide

Jupyter Notebook
18
star
11

Deploy-Machine-Learning-Model-WIth-Flask-and-Deta-Space

A step-by-step guide to build a credit card fraud detection machine learning model and deploying it using Flask and deta.sh.
Python
16
star
12

Clean-JavaScript-Code

Software engineering principles, from Robert C. Martin's book Clean Code, adapted for JavaScript. This is not a style guide. It's a guide to producing readable, reusable, and refactorable software in JavaScript.
15
star
13

Flask-and-Docker-Application-Demo

Lux Academy & Data Science East Africa Python Boot Camp, Building and Deploying Flask Application Using Docker Demo App.
Python
12
star
14

AUTO-EDA

Python library with code example to perform EDA and data cleaning.
Python
11
star
15

Machine-Learing-Models-Using-Python

Introduction To Machine Learning with Python All You Need To Know About Machine Learning with Python, with examples and use cases.
Python
11
star
16

Django-Instagram-clone

This is an instagram clone project developed using DJANGO2.22 Framework and Bootstrap 4, to replicate Original Instagram with the aim of demonstrating level of understanding of the various concepts in django.
Python
10
star
17

Data-Science-Ultimate-Guide

Data Science 2022/2023 Class Resources
Jupyter Notebook
8
star
18

100-Days-Of-JavaScript-Boot-Camp-By-Lux-Academy

100 Days of JavaScript Boot Camp by Lux Academy Learning Resources.
CSS
6
star
19

Data-Build-Tool-Ultimate-Guide

5
star
20

Lux-Academy-and-Data-Science-East-Africa-Q3-Data-Career-Build-Meet-Up

Lux Academy and Data Science East Africa Q3 Data Career Build Meet Up Projects List.
5
star
21

Lux-Tech-Resume-Builder

Angular Resume Builder
TypeScript
4
star
22

Guide-to-Acing-SQL-Interviews

Comprehensive guide to excel in SQL interviews, covering concepts, queries, performance tuning, and more.
4
star
23

Lux-Voting-System

An online Voting System (online voting software) developed in Python microframework flask, JavaScript library reactjs, bootstrap,CSS and HTML.
Python
4
star
24

Flask_voting_app

A votiing system developed in python flask and JavaScript
HTML
3
star
25

Factorial-using-C-.

c++ code to claculate Factorial of a given number Design and analysis of algorithm
C++
3
star
26

Data-Analyst-Assessment

Jupyter Notebook
3
star
27

2022-Python3-Essentials-Bootcamp-Curriculum

3
star
28

Personal-Resume.

A simple but functional resume
HTML
3
star
29

SQLAlchemy-for-Database-Operations-In-Python

Jupyter Notebook
3
star
30

pyspark-machine-learning

Building Machine Learning Models with PySpark's pyspark.ml Library: A Comprehensive Guide
Jupyter Notebook
3
star
31

Fetch-API-demo

CSS
3
star
32

JavaScript-2021

JavaScript
3
star
33

Complete-Guide-for-SDE-Preparation-Sheet

SDE Sheet is a list of the most important topics or the most popular questions that are asked in the Software Development Engineer Interviews.
3
star
34

DataScience-and-Machine-Laerning-at-MurangaUniversity-of-Technology.-

3
star
35

Deploying-ML-Model-Using-Flask

Jupyter Notebook
2
star
36

Javascript-chaos-game

A web based game developed in HTML and JavaScript
JavaScript
2
star
37

smartcard-challange

Andele dev-training first challange HTML,CSS and ECMAScript6
JavaScript
2
star
38

DSEA-EDA-WITH-PYTHON-CLASS

2
star
39

PyPractice

Python
2
star
40

Lux-Academy

A personal profile lux_cv
CSS
2
star
41

Full-Stack-In-Flask

Python for Full-Stack web development Using Flask Microframework.
HTML
2
star
42

React-Demo

JavaScript
2
star
43

python-data-types

Python
2
star
44

Modern-Data-Engineering-Hackathon-Guide

2
star
45

Tic-Tac-Toe-Angular-andTypeScript

TypeScript
2
star
46

Lux-FLask-Version-1.00

Lux ACademy flask version
JavaScript
2
star
47

BLOG-Django-2.2-blog

A simple fully functional blog developed in python(Django 2.2), bootstrap 4, css and Html.
Python
2
star
48

Statistics_using_Python

Major calculation in statistics using python
Python
2
star
49

Harun_MbaabuFlak_resume.

Flask(Python3 Micro framework and W3.css) Resume.
HTML
2
star
50

Data-Science-in-Action-DSAIC-Hands-on-Hackathon-Experience

2
star
51

Geo-Search--DevC-program

Geo-Search is a tool that allows people to search for places and extract information such as timezone, map and local weather conditions like temperature, wind speed, humidity, and precipitation.
2
star
52

Challange

CSS
1
star
53

ML-JS-Demo

JavaScript
1
star
54

MaProD-Website-Devevelopment-tutorials.

JavaScript
1
star
55

Yahoo-Finance-Data-Analysis-using-Apache-Spark

Jupyter Notebook
1
star
56

DataScience-East-Africa-UI

CSS
1
star
57

MaProD-JavaScript-and-ES6-Meet_Up-Talk

HTML
1
star
58

React-Tutorial

JavaScript
1
star
59

PythonFastAPI-

Python
1
star
60

Craigslist-Used-Vehicles--Time-Series-Project

Jupyter Notebook
1
star
61

Perfomance-Dashboard

Performance Dashboard Challenge.
Jupyter Notebook
1
star
62

AIRPLANE_DETECTION_FOR_SATELLITES

Python
1
star
63

covid-19-estimator

JavaScript
1
star
64

Netpap-Hotspot-app

HTML
1
star
65

KeDevs

CSS
1
star
66

JavaScript-Queue-Data-Structure

JavaScript Queue Data Structure
JavaScript
1
star
67

flask-MySql

Python
1
star
68

Dom-Assessment

HTML
1
star
69

Intermediate-Coding-Challenges-

Python
1
star
70

HarunMbaabu

1
star
71

FastAPI-Projects-Demos

Python
1
star
72

Object-Detection-Project

Object Detection Project
Jupyter Notebook
1
star
73

JavaScript-chat-bot

JavaScript
1
star
74

Hackerrank-SQL-Questions-and-Solutions

1
star
75

Calc-Project-JavaScript-BootCamp

CSS
1
star
76

Graph-in-Python

Python
1
star
77

Web-Scraping-with-Beautiful-Soup

Jupyter Notebook
1
star
78

Portfolio-Resume---JAVASCRIPT

A simple portfolio using react Js
HTML
1
star
79

Practice

Practice
Jupyter Notebook
1
star
80

JavaScript-Boot-Camp-assessment-

JavaScript
1
star
81

Modern-Data-Satck-Stack-with-Airbyte-Snowflake-and-DBT

1
star
82

Data-Engineering-Ultimate-Guide

1
star
83

NodeJS-Chat-Bot

1
star
84

Getting-Started-With-PySpark-

1
star
85

MaProD-Murang-a-University-of-Technology-CodingCHALLENGE

A coding challange held by mathematics and programming for development coding challenge questions.
Java
1
star
86

Progressive-Web-App

JavaScript
1
star
87

Fake-News-Detection-Python-Project

Jupyter Notebook
1
star
88

JavaScriptCheatSheet

JavaScript
1
star
89

maprodchallenge

Nice Challlenge I took part in
CSS
1
star
90

Visualize-Decision-Trees-using-Matplotlib

Jupyter Notebook
1
star
91

React-Boot-Camp-Ladies-of-Lux

1
star
92

Airflow-Starter-Project

Beginner Friend Apache Airflow Project : Move data from PostgreSQL to Snowflake database.
Python
1
star
93

Detecting-Malaria-With-Machine-Learning

Python
1
star
94

Tensenses-Dashboard

Tensenses Dashboard for DigiFarm
Python
1
star
95

Lux-express.js-app

JavaScript
1
star
96

PWA-With-VanillaJS-CSS-AND-HTML

JavaScript
1
star
97

vanillaJSmodal

1
star
98

LuxPwa

HTML
1
star
99

Data-Analysis-From-Scratch-

1
star
100

Data-Science-Interview

Interview Scripts
Jupyter Notebook
1
star