• Stars
    star
    116
  • Rank 303,894 (Top 6 %)
  • Language
    Jupyter Notebook
  • License
    MIT License
  • Created over 4 years ago
  • Updated about 4 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Explore 120 million taxi trips in real time with Dash and Vaex

dash-120million-taxi-app

Explore 120 million taxi trips in real time with Dash and Vaex

Available at https://dash.vaex.io/

img

Running this app

Clone the repo

$ git clone https://github.com/vaexio/dash-120million-taxi-app

Run in debug mode:

$ python app.py

Make sure the data is downloaded from s3:

$ python getdata.py

(You can ignore the warning WARNING:ThreadPoolExecutor-1_31:urllib3.connectionpool:Connection pool is full, discarding connection: vaex.s3.amazonaws.com)

Run in production mode (make sure the data is downloaded if you stream from s3):

$ VAEX_NUM_THREADS=8 gunicorn -w 16 app:server -b 0.0.0.0:8050

Settings

Change settings in the dash app

$ export TAXI_PATH=/data/taxi/yellow_taxi_2012_zones.hdf5  # change the default s3 file
$ export VAEX_NUM_THREADS=16     # change the number of threads per process/worker
$ export DASH_CACHE_TIMEOUT=240  # increase cache timeout to 4 minutes
$ export DASH_CACHE_TIMEOUT=-1  # disable cache (useful for benchmarking)