• Stars
    star
    432
  • Rank 97,305 (Top 2 %)
  • Language
    JavaScript
  • License
    GNU General Publi...
  • Created over 5 years ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

All-in-one AI container for rapid prototyping

header image

GitHub last commit

All-in-one AI development container for rapid prototyping, compatible with the nvidia-docker GPU-accelerated container runtime as well as JupyterHub. This is designed as a lighter and more portable alternative to various cloud provider "Deep Learning Virtual Machines". Get up and running with a wide range of machine learning and deep learning tasks by pulling and running the container on your workstation, on the cloud or within JupyterHub.

What's included?

frameworks

ide

Using the AI Lab Container

This image can be used together with NVIDIA GPUs on workstation, servers, cloud instances. It can also be used via JupyterHub deployments as no additional ports are required things like for TensorBoard. Please note that the following instructions assume you already have the NVIDIA drivers and container runtime already installed. If not, here are some quick instructions.

Pulling the container

docker pull nvaitc/ai-lab:20.03

Running an interactive shell (bash)

nvidia-docker run --rm -it nvaitc/ai-lab:20.03 bash

Run Jupyter Notebook

The additional command line flags define the following options:

  • forward port 8888 to your host machine
  • mount /home/$USER as the working directory (/home/jovyan)
nvidia-docker run --rm \
 -p 8888:8888 \
 -v /home/$USER:/home/jovyan \
 nvaitc/ai-lab:20.03

Run JupyterLab by replacing tree with lab in the browser address bar.

There is a default blank password for the Jupyter interface. To set your own password, pass it as an environment variable NB_PASSWD as follows:

nvidia-docker run --rm \
 -p 8888:8888 \
 -v /home/$USER:/home/jovyan \
 -e NB_PASSWD='mypassword' \
 nvaitc/ai-lab:20.03

Run Batch Job

It is also perfectly possible to run a batch job with this container, be it on a workstation or as part of a larger cluster with a scheduler that can schedule Docker containers.

nvidia-docker run --rm bash nvaitc/ai-lab:20.03 -c 'echo "Hello world!" && python3 script.py'

Additional Instructions

For extended instructions, please take a look at: INSTRUCTIONS.md.

INSTRUCTIONS.md contains full instructions and addresses common questions on deploying to public cloud (GCP/AWS), as well as using PyTorch DataLoader or troubleshooting permission issues with some setups.

If you have any ideas or suggestions, please feel free to open an issue.

FAQ

1. Can I modify/build this container myself?

Sure! The Dockerfile is provided in this repository. All you need is a fast internet connection and about 1h of time to build this container from scratch.

Should you only require some extra packages, you can build your own Docker image using nvaitc/ai-lab as the base image.

For a detailed guide, check out BUILD.md.

2. Do you support MXNet/some-package?

See Point 1 above to see how to add MXNet/some-package into the container. I had chosen not to distribute MXNet/some-package with the container as it is less widely used and is large in size, and can be easily installed with pip since the environment is already properly configured. If you have a suggestion for a package that you would like to see added, open an issue.

3. Do you support multi-node or multi-GPU tasks?

Multi-GPU has been tested with tf.distribute and Horovod, and it works as expected. Multi-node has not been tested.

4. Can I get hardware accelerated GUI (OpenGL) applications?

Yes! Be sure to pull the vnc version of the container e.g. nvaitc/ai-lab:20.03-vnc and use the "New" menu in Jupyter Notebook to launch a new VNC Desktop. This will allow you to use a virtual desktop interface. Next, you need to allow the container to access your host X server (this may be a security concern for some people).

xhost +local:root
nvidia-docker --rm run \
 -e "DISPLAY" \
 -v /tmp/.X11-unix:/tmp/.X11-unix:rw \
 -p 8888:8888 \
 -v /home/$USER:/home/jovyan \
 nvaitc/ai-lab:20.03-vnc

Next, start your application adding vglrun in front of the application command (e.g. vglrun glxgears). You can see a video of SuperTuxKart running in the VNC desktop here.

5. How does this contrast with NGC containers?

NVIDIA GPU Cloud (NGC) features NVIDIA tuned, tested, certified, and maintained containers for deep learning and HPC frameworks that take full advantage of NVIDIA GPUs on supported systems, such as NVIDIA DGX products. We recommend the use of NGC containers for performance critical and production workloads.

The AI Lab container was designed for students and researchers. The container is primarily designed to create a frictionless experience (by including all frameworks) during the initial prototyping and exploration phase, with a focus on iteration with fast feedback and less focus on deciding on specific approaches or frameworks. This is not an official NVIDIA product!

If you would like to use NGC containers in an AI Lab like container, there is an example of how you can build one yourself. Take a look at tf-amp.Dockerfile. Do note that you are restricted from distributing derivative images from NGC containers in a public Docker registry.

6. What GPUs do you support?

The container supports compute capability 6.0, 6.1, 7.0, 7.5:

  • Pascal (P100, GTX 10-series)
  • Volta (V100, Titan V)
  • Turing (T4, RTX 20-series)

7. Any detailed system requirements?

  1. Ubuntu 18.04+, or a close derivative distro
  2. NVIDIA drivers (>=418, or >=410 Tesla-ready driver)
  3. NVIDIA container runtime (nvidia-docker)
  4. NVIDIA Pascal, Volta or Turing GPU
    • If you have a GTX 10-series or newer GPU, you're fine
    • K80 and GTX 9-series cards are not supported

Support

  • Core Maintainer: Timothy Liu (tlkh)
  • This is not an official NVIDIA product!
  • The website, its software and all content found on it are provided on an “as is” and “as available” basis. NVIDIA/NVAITC does not give any warranties, whether express or implied, as to the suitability or usability of the website, its software or any of its content. NVIDIA/NVAITC will not be liable for any loss, whether such loss is direct, indirect, special or consequential, suffered by any party as a result of their use of the libraries or content. Any usage of the libraries is done at the user’s own risk and the user will be solely responsible for any damage to any computer system or loss of data that results from such activities.
  • Please open an issue if you encounter problems or have a feature request

Adapted from the Jupyter Docker Stacks

GitHub contributors GitHub

More Repositories

1

asitop

Perf monitoring CLI tool for Apple Silicon
Python
2,866
star
2

tf-metal-experiments

TensorFlow Metal Backend on Apple Silicon Experiments (just for fun)
Jupyter Notebook
265
star
3

text-emotion-classification

Archived - not answering issues
Jupyter Notebook
200
star
4

prowler

Distributed Network Vulnerability Scanner
Python
121
star
5

m1-cpu-benchmarks

Jupyter Notebook
44
star
6

SmartBin

Spring 2018 - 10.009 Digital World 1D Project
Python
42
star
7

pycon-sg19-tensorflow-tutorial

PyCon SG 2019 Tutorial: Optimizing TensorFlow Performance
Jupyter Notebook
25
star
8

t2t-tuner

Convenient Text-to-Text Training for Transformers
Jupyter Notebook
19
star
9

depsep-conv-benchmarks

Code for Depth-wise Separable Convolutions: Performance Investigations
Jupyter Notebook
19
star
10

fake-news-chrome-extension

Chrome Extension to help fight Online Misinformation
JavaScript
16
star
11

onprem-gpu-cluster-setup

On-prem GPU Cluster Setup
Shell
8
star
12

rhh-2017-crowd-tracking

Red Hat Hackathon Singapore 2017 - Camera-based Crowd Tracking Solution
Python
8
star
13

transformers-benchmarking

just for fun
Jupyter Notebook
8
star
14

shortcuts

Painless curl | bash installs. Not that you should!
Shell
7
star
15

awesome-tf2-implementations

List of official/unofficial TF2 implementations of models
7
star
16

hyperconverged-private-cloud-guide

A guide to building a hyper-converged private cloud on commodity hardware
6
star
17

prowler-dashboard

Dashboard for Prowler
HTML
6
star
18

fake-news-web-api

Backend endpoint for the Fake News Chrome Extension
Python
5
star
19

milair-dataset

Military Aircraft Image Dataset
5
star
20

nbvscode

VS Code in Jupyter
Python
3
star
21

50.012-dask-network-project

50.012: Networks Project (2019)
Jupyter Notebook
2
star
22

paraphrase-metrics

ACL 2022 paper "Towards Better Characterization of Paraphrases"
Jupyter Notebook
2
star
23

xfmers

Quickly initialize bespoke Transformers
Python
2
star
24

t5-fp16-surgery

T5 FP16 Surgery
Jupyter Notebook
2
star
25

simple-knowledge-graph

Some simple knowledge graph experiments
Jupyter Notebook
2
star
26

atomic-orbitals

Jupyter Notebook
1
star
27

sg-rainmap-predictor

SG Rain Areas Prediction
Jupyter Notebook
1
star
28

NVStatsRecorder

Python-based NVIDIA GPU Stats Recorder
Python
1
star
29

libsutd

Singapore University of Technical Difficulty
Python
1
star
30

hoax-images

Dataset of images commonly used for online hoaxes
1
star
31

tf-pipeline-model-parallel

TensorFlow Pipeline Model Parallel Experiments
Python
1
star
32

endgame

If we find something it's the endgame (also, Avengers yay)
HTML
1
star
33

pbs-demo-sutd

PBS Demo for SUTD HPC
Python
1
star
34

cyber-range-automation

Cyber-range Automation
Python
1
star
35

reverse-image-search

Reverse Image Search (Database)
Jupyter Notebook
1
star
36

serverless-transformers

Serve HuggingFace Transformer models via Cloud Run
Python
1
star
37

Cayenne-PlantMonitor

ESP8266 Plant Monitor using Cayenne
C++
1
star
38

dnn_animations

Animations for explaining DNN layers
1
star
39

sg-rainmap-dataset

Singapore Rain Areas Dataset crawled from NEA
Jupyter Notebook
1
star
40

browser-trafficgen

Realistic browser-based traffic generation with Python
Python
1
star