DSAID (@dsaidgovsg)

Top repositories

1

airflow-pipeline

An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR
Python
171
star
2

terraform-modules

Reusable Terraform modules
HCL
78
star
3

multimodal-learning-hands-on-tutorial

Jupyter Notebook
61
star
4

k-shortest-path

Implements K shortest path algorithms for networkx
Python
16
star
5

vigilantgantry

Face segmentation algorithm that the VigilantGantry uses to identify potential missed out causes by current thermal systems (due to occlusion from fringe, cap, head-gear).
Python
16
star
6

python-spark

Docker image for a Python installation with Spark, Hadoop and Sqoop binaries
15
star
7

TDBSCAN

TDBSCAN with Move Ability: Spatiotemporal Density Clustering
Python
9
star
8

spark-geo-privacy

Geospatial privacy functions for Apache Spark
Scala
6
star
9

yarn-reverse-proxy

Reverse proxy for the status pages of a YARN cluster
Shell
5
star
10

nomad-parametric-autoscaler

A customizable Nomad/EC2 auto-scaling service
JavaScript
5
star
11

folium-resource-server

Simple server to just host folium JS and CSS resources
JavaScript
4
star
12

stack-2022-differential-privacy-workshop

Jupyter Notebook
4
star
13

spark-k8s-addons

Dockerfile setup to install cloud related utilities onto the standard Spark K8s Docker images
Dockerfile
3
star
14

registrywatcher

Go
2
star
15

zeppelin

Zeppelin Dockerfile set-up with a wrapping dynamic GitHub releases JAR loader
Dockerfile
2
star
16

spark-custom-addons

Dockerfile Set-up to add dependencies into `spark-custom` images
Dockerfile
2
star
17

spark-k8s

CI set-up to generate Spark with Kubernetes Docker images
Shell
2
star
18

sg-tileserver-gl

Repackaged repository to build Docker image for Singapore tiles only
Shell
2
star
19

python-node

python-node
2
star
20

spark-base

Dockerfile setup for Spark set-up, imbued with varying degree of Python data science packages
Shell
2
star
21

spark-jupyterhub

Experimental and opinionated set-up to conveniently get going JupyterHub
Shell
1
star
22

zeppelin-jar-loader

Provides a simple JAR loader for dynamic loading of JAR files at the start for Zeppelin
Scala
1
star
23

data-privacy-workshop

Jupyter Notebook
1
star
24

spark-custom

Dockerfile set-up for building Spark releases from source code
Dockerfile
1
star
25

ljumphost

Proper jumphost set-up and instructions repository
Shell
1
star
26

benchmarking-differential-privacy-tools

Jupyter Notebook
1
star
27

spark-k8s-addons-ds

Dockerfile
1
star
28

kms-reencrypt

Python boto3 script to recursively KMS re-encrypt objects
Python
1
star
29

datavis-examples

Repo with sample dataset and codes to show some simple visualization.
Jupyter Notebook
1
star
30

ilytics

gunicorn, flask, and darknet model
C
1
star
31

folium-override-server

Webserver to re-generate uploaded folium map to use custom URLs
Python
1
star
32

ura-subzones

Simple library for dealing with URA subzones
JavaScript
1
star