• Stars
    star
    1,997
  • Rank 23,197 (Top 0.5 %)
  • Language
    Python
  • License
    MIT License
  • Created over 6 years ago
  • Updated about 4 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Semantic segmentation on aerial and satellite imagery. Extracts features such as: buildings, parking lots, roads, water, clouds

Note: Robosat is neither maintained not actively developed any longer by Mapbox. See this issue.
The main developers (@daniel-j-h, @bkowshik) are no longer with Mapbox.

RoboSat

Generic ecosystem for feature extraction from aerial and satellite imagery RoboSat pipeline extracting buildings from aerial imagery Berlin aerial imagery, segmentation mask, building outlines, simplified GeoJSON polygons

Table of Contents

  1. Overview
  2. Installation
  3. Usage
  4. Extending
  5. Contributing
  6. License

Overview

RoboSat is an end-to-end pipeline written in Python 3 for feature extraction from aerial and satellite imagery. Features can be anything visually distinguishable in the imagery for example: buildings, parking lots, roads, or cars.

Have a look at

The tools RoboSat comes with can be categorized as follows:

  • data preparation: creating a dataset for training feature extraction models
  • training and modeling: segmentation models for feature extraction in images
  • post-processing: turning segmentation results into cleaned and simple geometries

Tools work with the Slippy Map tile format to abstract away geo-referenced imagery behind tiles of the same size.

The data preparation tools help you with getting started creating a dataset for training feature extraction models. Such a dataset consists of aerial or satellite imagery and corresponding masks for the features you want to extract. We provide convenient tools to automatically create these datasets downloading aerial imagery from the Mapbox Maps API and generating masks from OpenStreetMap geometries but we are not bound to these sources.

The modelling tools help you with training fully convolutional neural nets for segmentation. We recommend using (potentially multiple) GPUs for these tools: we are running RoboSat on AWS p2/p3 instances and GTX 1080 TI GPUs. After you trained a model you can save its checkpoint and run prediction either on GPUs or CPUs.

The post-processing tools help you with cleaning up the segmentation model's results. They are responsible for denoising, simplifying geometries, transforming from pixels in Slippy Map tiles to world coordinates (GeoJSON features), and properly handling tile boundaries.

If this sounds almost like what you need, see the extending section for more details about extending RoboSat. If you want to contribute, see the contributing section for more details about getting involved with RoboSat.

Installation

We provide pre-built Docker images for both CPU as well as GPU environments on Docker Hub in mapbox/robosat.

Using a CPU container to show all available sub-commands

docker run -it --rm -v $PWD:/data --ipc=host --network=host mapbox/robosat:latest-cpu --help

Using a GPU container (requires nvidia-docker on the host) to train a model

docker run --runtime=nvidia -it --rm -v $PWD:/data --ipc=host mapbox/robosat:latest-gpu train --model /data/model.toml --dataset /data/dataset.toml --workers 4

Arguments

  • --runtime=nvidia enables the nvidia-docker runtime for access to host GPUs
  • --ipc=host is required for shared memory communication between workers
  • --network=host is required for network communication in the download tool
  • -v $PWD:/data makes the current directory on the host accessible at /data in the container

For installation from source (requires installing dependencies) see the Dockerfiles in the docker/ directory.

Usage

The following describes the tools making up the RoboSat pipeline. All tools can be invoked via

./rs <tool> <args>

Also see the sub-command help available via

./rs --help
./rs <tool> --help

Most tools take a dataset or model configuration file. See examples in the configs directory. You will need to adapt these configuration files to your own dataset, for example setting your tile resolution (e.g. 256x256 pixel). You will also need to adapt these configuration files to your specific deployment setup, for example using CUDA and setting batch sizes.

rs extract

Extracts GeoJSON features from OpenStreetMap to build a training set from.

The result of rs extract is a GeoJSON file with the extracted feature geometries.

The rs extract tool walks OpenStreetMap .osm.pbf base map files (e.g. from Geofabrik) and gathers feature geometries. These features are for example polygons for parking lots, buildings, or roads.

rs cover

Generates a list of tiles covering GeoJSON features to build a training set from.

The result of rs cover is a file with tiles in (x, y, z) Slippy Map tile format covering GeoJSON features.

The rs cover tool reads in the GeoJSON features generated by rs extract and generates a list of tiles covering the feature geometries.

rs download

Downloads aerial or satellite imagery from a Slippy Map endpoint (e.g. the Mapbox Maps API) based on a list of tiles.

The result of rs download is a Slippy Map directory with aerial or satellite images - the training set's images you will need for the model to learn on.

The rs download tool downloads images for a list of tiles in (x, y, z) Slippy Map tile format generated by rs cover.

The rs download tool expects a Slippy Map endpoint where placeholders for {x}, {y}, and {z} are formatted with each tile's ids. For example, for the Mapbox Maps API: https://api.mapbox.com/v4/mapbox.satellite/{z}/{x}/{y}@2x.webp?access_token=TOKEN.

rs rasterize

Rasterizes GeoJSON features into mask images based on a list of tiles.

The result of rs rasterize is a Slippy Map directory with masks - the training set's masks you will need for the model to learn on.

The rs rasterize tool reads in GeoJSON features and rasterizes them into single-channel masks with a color palette attached for quick visual inspection.

rs train

Trains a model on a training set made up of (image, mask) pairs.

The result of rs train is a checkpoint containing weights for the trained model.

The rs train tool trains a fully convolutional neural net for semantic segmentation on a dataset with (image, mask) pairs generated by rs download and rs rasterize. We recommend using a GPU for training: we are working with the AWS p2 instances and GTX 1080 TI GPUs.

Before you can start training you need the following.

  • You need a dataset which you should split into three parts: training and validation for rs train to train on and to calculate validation metrics on and a hold-out dataset for final model evaluation. The dataset's directory need to look like the following.

    dataset
    โ”œโ”€โ”€ training
    โ”‚ย ย  โ”œโ”€โ”€ images
    โ”‚ย ย  โ””โ”€โ”€ labels
    โ””โ”€โ”€ validation
        โ”œโ”€โ”€ images
        โ””โ”€โ”€ labels
    
  • You need to calculate label class weights with rs weights on the training set's labels

  • You need to add the path to the dataset's directory and the calculated class weights and statistics to the dataset config.

rs export

Exports a trained model in ONNX format for prediction across different backends (like Caffe2, TensorFlow).

The result of rs export is an ONNX GraphProto .pb file which can be used with the ONNX ecosystem.

Note: the rs predict tool works with .pth checkpoints. In contrast to these .pth checkpoints the ONNX models neither depend on PyTorch or the Python code for the model class and can be used e.g. in resource constrained environments like AWS Lambda.

rs predict

Predicts class probabilities for each image tile in a Slippy Map directory structure.

The result of rs predict is a Slippy Map directory with a class probability encoded in a .png file per tile.

The rs predict tool loads the checkpoint weights generated by rs train and predicts semantic segmentation class probabilities for a Slippy Map dataset consisting of image tiles.

rs masks

Generates segmentation masks for each class probability .png file in a Slippy Map directory structure.

The result of rs masks is a Slippy Map directory with one single-channel image per tile with a color palette attached for quick visual inspection.

The rs masks tool loads in the .png tile segmentation class probabilities generated by rs predict and turns them into segmentation masks. You can merge multiple Slippy Map directories with class probabilities into a single mask using this tool in case you want to make use of an ensemble of models.

rs features

Extracts simplified GeoJSON features for segmentation masks in a Slippy Map directory structure.

The result of rs features is a GeoJSON file with the extracted simplified features.

The rs features tool loads the segmentation masks generated by rs masks and turns them into simplified GeoJSON features.

rs merge

Merges close adjacent GeoJSON features into single features.

The result of rs merge is a GeoJSON file with the merged features.

The rs merge tool loads GeoJSON features and depending on a threshold merges adjacent geometries together.

rs dedupe

Deduplicates predicted features against existing OpenStreetMap features.

The result of rs dedupe is a GeoJSON file with predicted features which are not in OpenStreetMap.

The rs dedupe deduplicates predicted features against OpenStreetMap.

Note: rs extract to generate a GeoJSON file with OpenStreetMap features.

rs serve

Serves tile masks by providing an on-demand segmentation tileserver.

The rs serve tool implements a Slippy Map raster tileserver requesting satellite tiles and applying the segmentation model on the fly.

Notes: useful for visually inspecting the raw segmentation masks on the fly; for serious use-cases use rs predict and similar.

rs weights

Calculates class weights for a Slippy Map directory with masks.

The result of rs weights is a list of class weights useful for rs train to adjust the loss based on the class distribution in the masks.

The rs weights tool computes the pixel-wise class distribution on the training dataset's masks and outputs weights for training.

rs compare

Prepares images, labels and predicted masks, side-by-side for visual comparison.

The result of rs compare is a Slippy Map directory with images that have the raw image on the left, the label in the middle and the prediction on the right.

rs subset

Filters a Slippy Map directory based on a list of tile ids.

The result of rs subset is a Slippy Map directory filtered by tile ids.

The main use-case for this tool is hard-negative mining where we want to filter false positives from a prediction run.

Extending

There are multiple ways to extend RoboSat for your specific use-cases. By default we use Mapbox aerial imagery from the Maps API and feature masks generated from OpenStreetMap geometries. If you want to bring your own imagery, masks, or features to extract, the following will get you started.

Bring your own imagery

RoboSat's main abstraction is the Slippy Map tile format. As long as your imagery is geo-referenced and you can convert it to a Slippy Map directory structure to point the command lines to, you are good to go. Make sure imagery and masks are properly aligned.

Bring your own masks

RoboSat's main abstraction is the Slippy Map tile format. As long as you can convert your masks to a Slippy Map directory structure to point the command lines to, you are good to go. Masks have to be single-channel .png files with class indices starting from zero. Make sure imagery and masks are properly aligned.

Add support for feature in pre-processing

Pre-processing (rs extract) is responsible for turning OpenStreetMap geometries and tags into polygon feature masks. If you want to add a new feature based on geometries in OpenStreetMap you have to:

And that's it! From there on the pipeline is fully generic.

Add support for feature in post-processing

Post-processing (rs features) is responsible for turning segmentation masks into simplified GeoJSON features. If you want to add custom post-processing for segmentation masks you have to:

And that's it! From there on the pipeline is fully generic.

Contributing

We are thankful for contributions and are happy to help; that said there are some constraints to take into account:

  • For non-trivial changes you should open a ticket first to outline and discuss ideas and implementation sketches. If you just send us a pull request with thousands of lines of changes we most likely won't accept your changeset.
  • We follow the 80/20 rule where 80% of the effects come from 20% of the causes: we strive for simplicity and maintainability over pixel-perfect results. If you can improve the model's accuracy by two percent points but have to add thousands of lines of code we most likely won't accept your changeset.
  • We take responsibility for changesets going into master: as soon as your changeset gets approved it is on us to maintain and debug it. If your changeset can not be tested, or maintained in the future by the core developers we most likely won't accept your changeset.

License

Copyright (c) 2018 Mapbox

Distributed under the MIT License (MIT).

More Repositories

1

mapbox-gl-js

Interactive, thoroughly customizable maps in the browser, powered by vector tiles and WebGL
JavaScript
10,264
star
2

pixelmatch

The smallest, simplest and fastest JavaScript pixel-level image comparison library
JavaScript
6,055
star
3

mapbox-gl-native

Interactive, thoroughly customizable maps in native Android, iOS, macOS, Node.js, and Qt applications, powered by vector tiles and OpenGL
C++
4,297
star
4

tippecanoe

Build vector tilesets from large collections of GeoJSON features.
C++
2,423
star
5

awesome-vector-tiles

Awesome implementations of the Mapbox Vector Tile specification
2,311
star
6

delaunator

An incredibly fast JavaScript library for Delaunay triangulation of 2D points
JavaScript
2,255
star
7

earcut

The fastest and smallest JavaScript polygon triangulation library for your WebGL apps
JavaScript
2,174
star
8

supercluster

A very fast geospatial point clustering library for browsers and Node.
JavaScript
2,061
star
9

mapbox.js

Mapbox JavaScript API, a Leaflet Plugin
HTML
1,902
star
10

geojson.io

A quick, simple tool for creating, viewing, and sharing spatial data
JavaScript
1,740
star
11

geojson-vt

Slice GeoJSON into vector tiles on the fly in the browser
JavaScript
1,731
star
12

flamebearer

Blazing fast flame graph tool for V8 and Node ๐Ÿ”ฅ
JavaScript
1,634
star
13

maki

A POI Icon Set
JavaScript
1,475
star
14

polylabel

A fast algorithm for finding the pole of inaccessibility of a polygon (in JavaScript and C++)
C++
1,312
star
15

togeojson

convert KML and GPX to GeoJSON, without the fuss
JavaScript
1,185
star
16

mapbox-studio-classic

JavaScript
1,136
star
17

node-pre-gyp

Node.js tool for easy binary deployment of C++ addons
JavaScript
1,071
star
18

webgl-wind

Wind power visualization with WebGL particles
JavaScript
943
star
19

geobuf

A compact binary encoding for geographic data.
JavaScript
917
star
20

mapbox-navigation-ios

Turn-by-turn navigation logic and UI in Swift on iOS
Swift
861
star
21

earcut.hpp

Fast, header-only polygon triangulation
C
850
star
22

mapbox-gl-draw

Draw tools for mapbox-gl-js
JavaScript
827
star
23

Fingertips

Touch indicators on external displays for iOS applications.
Swift
809
star
24

XcodeClangFormat

Format code in Xcode 8+ with clang-format
Objective-C++
808
star
25

vector-tile-spec

Mapbox Vector Tile specification
805
star
26

pbf

A low-level, lightweight protocol buffers implementation in JavaScript.
JavaScript
746
star
27

mapbox-android-demo

Google Play demo app for the Mapbox Maps SDK for Android
Java
704
star
28

mbutil

Importer and Exporter of MBTiles
Python
694
star
29

osm-bright

A Carto template for OpenStreetMap data
CartoCSS
690
star
30

mapbox-unity-sdk

Mapbox Unity SDK - https://www.mapbox.com/unity/
C#
677
star
31

mapboxgl-jupyter

Use Mapbox GL JS to visualize data in a Python Jupyter notebook
Python
661
star
32

carto

fast CSS-like map stylesheets
JavaScript
653
star
33

mapbox-sdk-js

A JavaScript client to Mapbox services, supporting Node, browsers, and React Native
JavaScript
652
star
34

concaveman

A very fast 2D concave hull algorithm in JavaScript
JavaScript
640
star
35

leaflet-omnivore

universal format parser for Leaflet & Mapbox.js
JavaScript
625
star
36

mapbox-react-examples

Example patterns for building React apps with Mapbox GL JS
JavaScript
615
star
37

martini

A JavaScript library for real-time RTIN terrain mesh generation
JavaScript
609
star
38

polyline

polyline encoding and decoding in javascript
JavaScript
604
star
39

mapbox-navigation-android

Mapbox Navigation SDK for Android
Kotlin
572
star
40

mbtiles-spec

specification documents for the MBTiles tileset format
569
star
41

tiny-sdf

Browser-side SDF font generator
HTML
562
star
42

mapnik-vector-tile

Mapnik implemention of Mapbox Vector Tile specification
C++
546
star
43

storytelling

Storytelling with maps template
HTML
541
star
44

mapbox-gl-leaflet

binding from Mapbox GL JS to the Leaflet API
JavaScript
518
star
45

tilelive

fast interface to tiles with pluggable backends - NOT ACTIVELY MAINTAINED
JavaScript
514
star
46

cheap-ruler

Fast approximations for common geodesic measurements ๐ŸŒ
JavaScript
416
star
47

mapbox-java

The Mapbox Java SDK โ€“ Java wrappers around Mapbox APIs and other location data
Java
403
star
48

jni.hpp

A modern, type-safe, header-only, C++14 wrapper for JNI
C++
388
star
49

mercantile

Spherical mercator tile and coordinate utilities
Python
381
star
50

mapbox-maps-android

Interactive, thoroughly customizable maps in native Android powered by vector tiles and OpenGL.
Kotlin
368
star
51

variant

C++11/C++14 Variant
C++
365
star
52

leaflet-image

leaflet maps to images
JavaScript
360
star
53

mapbox-gl-geocoder

Geocoder control for mapbox-gl-js using Mapbox Geocoding API
JavaScript
357
star
54

mbview

View mbtiles locally
EJS
353
star
55

csv2geojson

magically convert csv files to geojson files
JavaScript
353
star
56

mbxmapkit

DEPRECATED - Lightweight Mapbox integration with MapKit on iOS
Objective-C
336
star
57

DEPRECATED-mapbox-ios-sdk

REPLACED โ€“ use https://www.mapbox.com/ios-sdk instead
Objective-C
325
star
58

mapbox-sdk-py

Python SDK for Mapbox APIs **DEVELOPMENT IS TEMPORARILY PAUSED, SEE CONTRIBUTING.md**
Python
319
star
59

potpack

A tiny rectangle packing JavaScript library (for sprite layouts)
JavaScript
314
star
60

mapbox-maps-ios

Interactive, thoroughly customizable maps for iOS powered by vector tiles and Metal
Swift
313
star
61

vector-tile-js

Parses vector tiles with JavaScript
JavaScript
308
star
62

node-mbtiles

mbtiles utility, renderer, and storage backend for tilelive
JavaScript
285
star
63

mapbox-maps-flutter

Interactive, thoroughly customizable maps for Flutter powered by Mapbox Maps SDK
Swift
282
star
64

mapbox-ar-unity

DEPRECATED! A place to create/learn with Unity, ARKit/ARCore, and Mapbox!
C#
279
star
65

geo-googledocs

Tools to integrate Mapbox with Google Docs
JavaScript
276
star
66

delatin

A fast JavaScript terrain mesh generation tool based on Delaunay triangulation
JavaScript
273
star
67

hubdb

a github-powered database
JavaScript
272
star
68

flutter-mapbox-gl

Moved to https://github.com/tobrun/flutter-mapbox-gl
Java
271
star
69

mapbox-gl-styles

Prebuilt Mapbox GL styles for use in Mapbox GL JS or the Mapbox Mobile SDKs and as a starting point for custom maps built with Mapbox Studio
JavaScript
268
star
70

simplestyle-spec

A simple styling convention for GeoJSON data
266
star
71

postgis-vt-util

postgres helper functions for making vector tiles
PLpgSQL
265
star
72

gzip-hpp

Gzip header-only C++ library
C++
265
star
73

protozero

Minimalist protocol buffer decoder and encoder in C++
C++
261
star
74

sphericalmercator

Spherical Mercator math in Javascript
JavaScript
259
star
75

shp-write

create and write to shapefiles in pure javascript
JavaScript
254
star
76

geojsonhint

IMPORTANT: development of this project has been paused, see the README (Validate GeoJSON against the specification)
JavaScript
253
star
77

mason

Cross platform package manager for C/C++ apps
Python
252
star
78

wellknown

GeoJSON-emitting WKT parser for browsers and node
JavaScript
249
star
79

Hecate

Fast Geospatial Feature Storage API
Rust
247
star
80

pyskel

Skeleton of a Python package
Python
243
star
81

mapbox-plugins-android

Mapbox Android Plugins are a collection of libraries that extend our other SDKs, helping you design powerful mapping features while the plugins handle most of the heavy lifting.
Java
240
star
82

mapbox-gl-directions

Directions plugin for mapbox-gl-js using Mapbox Directions API.
JavaScript
236
star
83

tilejson-spec

JSON format for describing map tilesets.
234
star
84

mapping

OpenStreetMap contributions from the data team at Mapbox
JavaScript
233
star
85

tilebelt

simple tile utilities
JavaScript
230
star
86

geojson-merge

Merge multiple GeoJSON files into one FeatureCollection.
JavaScript
229
star
87

mapbox-arkit-ios

Utilities for combining Mapbox maps and location services with ARKit in your applications.
Swift
224
star
88

mapbox-scenekit

Swift
224
star
89

react-native-mapbox-ar

Location based augmented reality components using React Native, Viro and Mapbox
Objective-C
221
star
90

mapbox-gl-native-android

Interactive, thoroughly customizable maps in native Android powered by vector tiles and OpenGL
Java
211
star
91

mapbox-gl-native-ios

Interactive, thoroughly customizable maps for iOS powered by vector tiles and OpenGL
Objective-C++
211
star
92

Simple-KML

Simple KML is a simple & lightweight parsing library for KML written in Objective-C for the iOS platform.
Objective-C
208
star
93

turf-swift

A Swift language port of Turf.js.
Swift
205
star
94

react-colorpickr

A themeable colorpicker with HSL and RGB support for React
TypeScript
205
star
95

eternal

A C++14 compile-time/constexpr map and hash map with minimal binary footprint
C++
201
star
96

node-fontnik

Fonts โ‡ข protobuf-encoded SDF glyphs
JavaScript
201
star
97

ecs-watchbot

Make robots do your work for you
JavaScript
194
star
98

leaflet-pip

point in polygon intersections for leaflet
JavaScript
194
star
99

tile-cover

Generate the minimum number of tiles to cover a geojson geometry
JavaScript
183
star
100

MapboxStatic.swift

Static map snapshots with overlays in Swift or Objective-C on iOS, macOS, tvOS, and watchOS
Swift
183
star