Learning Spark 2nd Edition
Welcome to the GitHub repo for Learning Spark 2nd Edition.
Chapters 2, 3, 6, and 7 contain stand-alone Spark applications. You can build all the JAR files for each chapter by running the Python script: python build_jars.py
.
Or you can cd to the chapter directory and build jars as specified in each README. Also, include $SPARK_HOME/bin
in $PATH
so that you
don't have to prefix SPARK_HOME/bin/spark-submit
for these standalone applications.
For all the other chapters, we have provided notebooks in the notebooks folder. We have also included notebook equivalents for a few of the stand-alone Spark applications in the aforementioned chapters.
Have Fun, Cheers!