• Stars
    star
    236
  • Rank 170,450 (Top 4 %)
  • Language
    Jupyter Notebook
  • License
    GNU Affero Genera...
  • Created over 2 years ago
  • Updated 2 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Tutorials for the Hopsworks Platform

πŸ‘¨πŸ»β€πŸ« Hopsworks Tutorials

We are happy to welcome you to our collection of tutorials dedicated to exploring the fundamentals of Hopsworks and Machine Learning development. In addition to offering different types of use cases and common subjects in the field, it facilitates navigation and use of models in a production environment using Hopsworks Feature Store.

βš™οΈ How to run the tutorials:

For the tutorials to work, you will need a Hopsworks account. To do so, go to app.hopsworks.ai and create one. With a managed account, just run the Jupyter notebook from within Hopsworks.

Generally the notebooks contain the information you will need on how to interact with the Hopsworks Platform.

If you have an app.hopsworks.ai account; you may connect to Hopsworks with the following line; this will prompt you with a link to your Token which will link to the feature store.

import hopsworks
 
project = hopsworks.login()
fs = project.get_feature_store()

In some cases, you may also need to install Hopsworks; to be able to work with the package. Simply start your notebook with:

!pip install -U hopsworks --quiet

The walkthrough and tutorials are provided in the form of Python notebooks, you will therefore need to run a jupyter environment or work within a colaboratory notebook in google; the later option might lead to some minor errors being displayed or libraries might require different library versions to work.

✍🏻 Concepts:

In order to understand the tutorials you need to be familiar with general concepts of Machine Learning and Python development. You may find some useful information in the Hopsworks documentation.

πŸ—„οΈ Table of Content:

  • Basic Tutorials:
    • QuickStart: Introductory tutorial to get started quickly.
    • Churn: Predict customers that are at risk of churning.
    • Fraud Batch: Detect Fraud Transactions (Batch use case).
    • Fraud Online: Detect Fraud Transactions (Online use case).
    • Iris: Classify iris flower species.
    • Loan Approval: Predict loan approvals.
  • Advanced Tutorials:
  • Integrations:
    • BigQuery Storage Connector: Create an External Feature Group using BigQuery Storage Connector.
    • Google Cloud Storage: Create an External Feature Group using GCS Storage Connector.
    • Redshift: Create an External Feature Group using Redshift Storage Connector.
    • Snowflake: Create an External Feature Group using Snowflake Storage Connector.
    • DBT Tutorial with BigQuery: Perform feature engineering in DBT on BigQuery.
    • WandB: Build a machine learning model with Weights & Biases.
    • Great Expectations: Introduction to Great Expectations concepts and classes which are relevant for integration with the Hopsworks MLOps platform.
    • Monitoring: How to implement feature monitoring in your production pipeline.
    • Bytewax: Real time feature computation using Bytewax.
    • Apache Beam: Real time feature computation using Apache Beam, Google Cloud Dataflow and Hopsworks Feature Store.
    • Apache Flink: Real time feature computation using Apache Flink and Hopsworks Feature Store.

πŸ“ Feedbacks & Comments:

We welcome feedbacks and suggestions, you can contact us on any of the following channels:

More Repositories

1

hopsworks

Hopsworks - Data-Intensive AI platform with a Feature Store
Java
1,125
star
2

rondb

This is RonDB, a distribution of NDB Cluster developed and used by Hopsworks AB. It also contains development branches of RonDB.
C++
574
star
3

hops-examples

Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops
Jupyter Notebook
117
star
4

maggy

Distribution transparent Machine Learning experiments on Apache Spark
Python
89
star
5

feature-store-api

Python - Java/Scala API for the Hopsworks feature store
Python
53
star
6

aml_end_to_end

AML End to End Example
Jupyter Notebook
50
star
7

hops-tensorflow

HopsYARN Tensorflow Framework.
Python
33
star
8

hops-util-py

Utility Library for Hopsworks. Issues can be posted at https://community.hopsworks.ai
Python
27
star
9

hopsworks-chef

Chef Cookbook for Hopsworks
Ruby
12
star
10

hops-docs

Documentation for Hopsworks and Hops
11
star
11

hopsworks-iot

Scala
8
star
12

hopsworks-api

Python SDK to interact with the Hopsworks API
Python
8
star
13

machine-learning-api

Hopsworks Machine Learning Api πŸš€ Model management with a model registry and model serving
Python
8
star
14

hops-hadoop-chef

Chef cookbook for Hops Hadoop
Ruby
7
star
15

hops-util

Utility Library for Hopsworks
Java
6
star
16

ndb-chef

Chef cookbook for MySQL Cluster (NDB)
Ruby
6
star
17

karamel-chef

This chef cookbook installs Karamel. Used by Vagrant to provision multi-node clusters.
Shell
6
star
18

flink-chef

Chef cookbook for Apache Flink.
Ruby
5
star
19

zeppelin-chef

Cookbook for installing Zeppelin/Spark
HTML
4
star
20

terraform-provider-hopsworksai

Hopsworks.ai Terraform provider
Go
4
star
21

hopslog-chef

Karamelized wrapper cookbook for installing Kibana and Logstash to work with Hopsworks.
Ruby
4
star
22

quartz

Logical Clocks Design System β€”Β NPM Package
TypeScript
3
star
23

spark-chef

Apache Spark chef cookbook
Ruby
3
star
24

ePipe

ePipe is a metadata system for HopsFS that provides replicated-metadata-as-a-service.
C++
3
star
25

sysbench-0.4.12

Sysbench tree for benchmarking iRoNDB
Shell
3
star
26

dr-elephant-chef

Chef cookbook to install Dr Elephant for Hadoop.
HTML
3
star
27

terraform-hopsworksai-helpers

Terraform module that creates the required cloud resources for Hopsworks.ai clusters on AWS and AZURE.
HCL
3
star
28

hops-kafka-authorizer

Kafka Authorization Engine for Hopsworks
Java
3
star
29

logicalclocks.github.io

Hopsworks documentation
2
star
30

kzookeeper

Karamelized, wrapper Chef cookbook for zookeeper
Ruby
2
star
31

flyingduck-chef

Chef cookbook for the installation of Flying Duck (Arrow Flight Server with DuckDB)
Ruby
2
star
32

elasticsearch-chef

Karamelized Chef cookbook that installs ElasticSearch.
Ruby
2
star
33

kube-hops-chef

Karamelized cookbook to deploy Kubernetes on the Hops platform
HTML
2
star
34

dela-chef

Dela is a p2p service for sharing datasets between Hadoop/Kafka Clusters.
Ruby
2
star
35

hopsworks-cloud-sdk

SDK for integrating Hopsworks with different cloud solutions
Python
1
star
36

hopsmonitor-chef

Wrapper chef cookbook Prometheus
HTML
1
star
37

cloud-chef

Cookbook to setup Hopsworks Cloud AMIs
Ruby
1
star
38

conda-chef

Karamelized Chef cookbook for installing Anaconda python package manager
Ruby
1
star
39

git-nullmerge

Git command to find and merge identical trees
Shell
1
star
40

mysqld_exporter

Exporter for MySQL server metrics
Go
1
star
41

tensorflow-chef

This is Chef Cookbook (Karamelized) to install Tensorflow using chef solo. Tensorflow is a Google framework for deep learning.
Ruby
1
star