Demo notebooks for Apache Zeppelin
Update your Zeppelin instance with all notebooks in this repo:
- If you are using hortonworks sandbox, you can execute command below to get latest notebooks:
curl -sSL https://raw.githubusercontent.com/hortonworks-gallery/zeppelin-notebooks/master/update_all_notebooks.sh | sudo -u zeppelin -E sh
- Please note that if you have changed or created your own notebooks they will be moved to a folder: /opt/incubator-zeppelin/notebook/old_yyyyMMdd-HHmmss, you can copy them back to /opt/incubator-zeppelin/notebook/ directory
Importing specific notebooks:
Copy the "json" link URL from the table below and paste it into Zeppelin's import from URL tool.
Adding notebooks:
You can add notebooks by making Github pull request, but we request you also update the table below by adding a record for your notebook. To generate the 'view' link for the notebook:
- Uploading the notebook into this repo and obtain the url (it should look like this: https://github.com/hortonworks-gallery/zeppelin-notebooks/blob/master/2A94M5J1Z/note.json)
- Generate a link to preview the notebook by pasting the url from #1 into ZeppelinHub viewer (https://www.zepl.com/viewer). This will generate a preview page for your notebook (at a url that looks something like https://www.zepl.com/viewer/notebooks/aHR0cHM6Ly9yYXcuZ2l0aHVidXNlcmNvbnRlbnQuY29tL2hvcnRvbndvcmtzLWdhbGxlcnkvemVwcGVsaW4tbm90ZWJvb2tzL21hc3Rlci8yQTk0TTVKMVovbm90ZS5qc29u/)
- Copy the url from #2 of the notebook Include the preview link under the 'View' link in table below. You can follow examples from existing notebooks for this at the markdown version of README here
Demos:
Description | Components | Code | View | Author | Comments |
---|---|---|---|---|---|
Starter Bank demo | Spark/SparkSQL | json | view | NFLabs | Default notebook installed by Zeppelin |
Australian Spending dataset | Spark/SparkSQL | json | view | Ned Shawa | |
Australian Spending dataset | Hive | json | view | Ned Shawa | |
Map Earthquake dataset | Spark/SparkSQL | json | view | NFLabs (updated by Mark Laney) | |
IoT Trucking demo | Spark/SparkSQL | json | view | Nauman Fakhar, Dhruv Kumar | IoT Truck Demo from Hadoop Summit |
Intra-day stock prices | Phoenix | json | view | Ali Bajwa | |
Magellan: Geospatial Analytics Using Spark | Spark | json | view | Ram Sriharsha | Magellan Blog as Zeppelin Notebook. Requires Spark 1.4.1 See here for more details |
Twitter sentiment analysis using Spark Streaming | Spark streaming | json | view | Guilherme Braccialli | |
PySpark tutorial: Analyzing network intrusion dataset with Python and Spark | PySpark | json | view | Saptak Sen | |
Single view demo | Hive/Sqoop/HDF | json | view | Ali Bajwa/Ajay Singh | Notebook version of Single view lab |
Predicting Airline Delays | Spark/MLLib | json | view | Ofer Mendelevitch | Related to this Blog post and part 2 |
Sensors Predictive Analysis | SparkSQL/PySpark | json | view | Vedant Jain | Adding Predictions to this Blog post |
Analyzing Server Web Logs | SparkSQL/PySpark | json | view | Paul Hargis | Parse and Analyze HTTP Web Server Logs (ported from Databricks) |
HDFS Space Usage Visualization | HDFS / D3.js | json | view | Guilherme Braccialli | A D3.js Sunburst visualization to explore HDFS space utilization and other metadata info, more details |
Credit card fraud prevention demo | Spark/MLLib | json | view | Kirk Haslbeck | more details |