• Stars
    star
    357
  • Rank 119,149 (Top 3 %)
  • Language
    Python
  • License
    MIT License
  • Created over 4 years ago
  • Updated 4 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Apache Spark 3 - Spark Programming in Python for Beginners

Apache Spark 3 - Spark Programming in Python for Beginners

This is the central repository for all the materials related to Apache Spark 3 - Spark Programming in Python for Beginners
Course by Prashant Pandey.
You can get the full course at Apache Spark Course @ Udemy.

Apache Spark 3 - Spark Programming in Python for Beginners

Description

I am creating Apache Spark 3 - Spark Programming in Python for Beginners course to help you understand the Spark programming and apply that knowledge to build data engineering solutions. This course is example-driven and follows a working session like approach. We will be taking a live coding approach and explain all the needed concepts along the way.

Who should take this Course?

I designed this course for software engineers willing to develop a Data Engineering pipeline and application using the Apache Spark. I am also creating this course for data architects and data engineers who are responsible for designing and building the organization’s data-centric infrastructure. Another group of people is the managers and architects who do not directly work with Spark implementation. Still, they work with the people who implement Apache Spark at the ground level.

Spark and source code version

This Course is using the Apache Spark 3.x. I have tested all the source code and examples used in this Course on Apache Spark 3.0.0 open-source distribution.

More Repositories

1

ApacheKafkaTutorials

Example Code for Kafka Tutorials @ Learning Journal
Java
175
star
2

Kafka-Streams-Real-time-Stream-Processing

This is the central repository for all materials related to Kafka Streams : Real-time Stream Processing! Book by Prashant Pandey.
Java
160
star
3

Spark-Streaming-In-Python

Apache Spark 3 - Structured Streaming Course Material
Python
120
star
4

SparkProgrammingInScala

Apache Spark Course Material
Scala
84
star
5

Apache-Kafka-For-Absolute-Beginners

This is the central repository for all the materials related to Apache Kafka For Absolute Beginners Course by Prashant Pandey.
Java
72
star
6

Spark-Tutorials

Code and Notebooks for Spark Tutorials for Learning Journal @ Youtube
Jupyter Notebook
55
star
7

Spark-Streaming-In-Scala

Apache Spark 3 - Structured Streaming Course Material
Scala
42
star
8

Kafka-Streams-Master-Class

41
star
9

Apache-Spark-and-Databricks-Stream-Processing-in-Lakehouse

Python
34
star
10

ScalaTutorials

Initial Commit
Jupyter Notebook
27
star
11

HadoopTutorials

Hadoop tutorial Files. For detailed Tutorials visit www.youtube.com/learningjournalin
Java
26
star
12

Kafka-Streams-with-Spring-Cloud

Java
24
star
13

Python-Foundation-Course

Jupyter Notebook
20
star
14

apache-kafka-in-python

Python
9
star
15

Confluent-Kafka-with-Spring-Boot

Java
7
star
16

Codility-Test-Scala-Solutions

Solutions for Codility Programming Problems in Scala
Scala
7
star
17

NumPy-Crash-Course

This is the central repository for all the materials related to NumPy Crash Course by Prashant Pandey.
Jupyter Notebook
4
star
18

aws-certified-cloud-practitioner

Java
3
star
19

AivenEx

Aiven Platform Demo
Java
3
star
20

SBDL

Python
3
star
21

databricks-course

1
star
22

Azure-Databricks-for-Data-Engineers-Resources

1
star