• This repository has been archived on 27/Dec/2021
  • Stars
    star
    41
  • Rank 668,415 (Top 14 %)
  • Language
    Shell
  • License
    Apache License 2.0
  • Created almost 10 years ago
  • Updated over 9 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

This is an introduction of Apache Spark DataFrames.

More Repositories

1

tensorflow-serving-example

Examples to server tensorflow models with tensorflow serving
Python
96
star
2

machine-learning-microservice-python

Example to implement machine learning microservice with gRPC and Docker in Python
Python
81
star
3

action-sqlfluff

Run sqlfluff with reviewdog to check or format styles
Shell
67
star
4

dbt-artifacts-parser

A dbt artifacts parser in python
Python
66
star
5

bigquery-to-datastore

Export a whole BigQuery table to Google Datastore with Apache Beam/Google Dataflow
Java
58
star
6

google-cloud-deep-learning-kit

Create a GPU instance on GCP with Jupyter + Keras(Tensorflow) + Nvidia Docker
Shell
39
star
7

click-through-rate-prediction

Kaggle's click through rate prediction with Spark Pipeline API
Scala
23
star
8

dbt-unittest

A dbt package provides macros for unit testing, inspired by python's unittest module
Shell
23
star
9

spark-streaming-with-google-cloud-example

an example of integrating Spark Streaming with Google Pub/Sub and Google Datastore
Scala
17
star
10

dbt-artifacts-loader

Load dbt artifacts uploaded to GCS to BigQuery in order to track historical dbt results
Python
17
star
11

dbt-airflow-macros

Dbt package for Apache Airflow inspired macros
Shell
15
star
12

kuromoji-for-bigquery

Tokenize Japanese text on BigQuery with Kuromoji in Apache Beam/Google Dataflow at scale
Java
14
star
13

spark-ranking-algorithms

Ranking algorithms for Spark machine learning pipeline
Scala
14
star
14

auto-sklearn-examples

auto-sklearn examples on Jupyter notebooks
Jupyter Notebook
13
star
15

bisecting-kmeans

An implementation of Bisecting KMeans Clustering which is a kind of Hierarchical Clustering algorithm on Spark
Scala
12
star
16

gihyo-spark-book-example

技術評論社「詳解Apache Spark」のサンプルコード
Scala
10
star
17

dbt-bigquery-project-template

Python
9
star
18

gpt-code-review-action

A GitHub Action for GPT to review a pull request
Python
8
star
19

spark-kuromoji-tokenizer

Kuromoji Tokenizer for Spark DataFrames
Scala
6
star
20

google-log-aggregation-example

Example to aggregate logs from Google Pub/Sub to date-partitioned BigQuery on Dataflow
Java
5
star
21

kotlin-spark-example

An example to use Apache Spark in Kotlin
Kotlin
4
star
22

dbt-ops

A set of dbt macros to maintain dbt projects
Python
4
star
23

dbt-gcp-billing

Reusable dbt models to deal BigQuery tables of Google Cloud billing
Makefile
4
star
24

SPARK-5992-LSH-design-doc

3
star
25

click-custom-multi-commands-example

Example to dynamically load sub-commands with click
Python
3
star
26

lightdash-client-python

A python-based client to call Lightdash APIs
Python
3
star
27

python-grpc-example

Example to use gRPC in python with docker
Python
3
star
28

tensorflow-hub-with-ml-engine

Example of transfer learning with tensorflow-hub and Google ML Engine
Python
3
star
29

P1_Facial_Keypoints

Udacity Compyter Vision Nanodegree Project 1 Facial Keypoints Detection
HTML
2
star
30

polyaxon-tf-distributed-training

Python
2
star
31

bisecting-kmeans-blog

2
star
32

lightdash-ops

A python-based CLI to operate Lightdash
Python
2
star
33

convert-to-draft-action

A custom GitHub Action to convert a pull request to draft if all workflows aren't passed
JavaScript
2
star
34

freeze-optimize-tf-model-example

Python
1
star
35

mxnet-in-keras-example

Example of Apache MXNet in Keras
Jupyter Notebook
1
star
36

deep-learning-with-keras

Deep Learning with Keras
Python
1
star
37

.vim

Vim Script
1
star
38

python-rest-apis

RESTful API with various web frameworks with machine learning in python
Python
1
star
39

spark-deep-neural-network

Shell
1
star
40

R-functions

1
star
41

dbt-issue-with-multiple-service-accounts-on-bigquery

Reproduce an issue of dbt with multiple service accounts on BigQuery
HCL
1
star
42

terraform-google-copy-bq-datasets

A terraform module to copy BigQuery datasets across regions
HCL
1
star
43

action-terrascan

Run terrascan with reviewdog on pull requests to enforce security best practices
Shell
1
star