• Stars
    star
    163
  • Rank 229,656 (Top 5 %)
  • Language
    Python
  • License
    Apache License 2.0
  • Created almost 6 years ago
  • Updated 5 days ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Batch generation from xarray datasets

xbatcher: Batch Generation from Xarray Datasets

github actions build status code coverage docs pypi conda-forge license

Xbatcher is a small library for iterating Xarray DataArrays and Datasets in batches. The goal is to make it easy to feed Xarray objects to machine learning libraries such as PyTorch or TensorFlow. View the docs for more info.

Installation

Xbatcher can be installed from PyPI as:

python -m pip install xbatcher

Or via Conda as:

conda install -c conda-forge xbatcher

Or from source as:

python -m pip install git+https://github.com/xarray-contrib/xbatcher.git

Note

The required dependencies installed with Xbatcher are Xarray, Dask, and NumPy. You will need to separately install TensorFlow or PyTorch to use those data loaders or Xarray accessors. Review the installation instructions for more details.

Documentation

Documentation is hosted on ReadTheDocs: https://xbatcher.readthedocs.org

License

Apache License 2.0, see LICENSE file.

Acknowledgements

This work was funded in part by:

NASA ACCESS19-0049: Pangeo ML: Open Source Tools and Pipelines for Scalable Machine Learning Using NASA Earth Observation Data

This work was motivated by many conversations in the Pangeo community and Pangeo ML working group

More Repositories

1

xskillscore

Metrics for verifying forecasts
Python
222
star
2

xarray-tutorial

Xarray Tutorials
Jupyter Notebook
172
star
3

datatree

WIP implementation of a tree-like hierarchical data structure for xarray.
Python
169
star
4

xarray_leaflet

An xarray extension for tiled map plotting.
Python
161
star
5

cf-xarray

an accessor for xarray objects that interprets CF attributes
Python
155
star
6

flox

Fast & furious GroupBy operations for dask.array
Python
123
star
7

xpublish

Publish Xarray Datasets via a REST API.
Python
116
star
8

pint-xarray

Interface for using pint with xarray, providing convenience accessors
Python
101
star
9

xeofs

Comprehensive EOF analysis in Python with xarray: A versatile, multidimensional, and scalable tool for advanced climate data analysis
Python
98
star
10

xvec

Vector data cubes for Xarray
Python
93
star
11

xarray-simlab

Xarray extension and framework for computer model simulations
Python
73
star
12

cupy-xarray

Interface for using cupy in xarray, providing convenience accessors.
Python
65
star
13

xwrf

A lightweight interface for working with the Weather Research and Forecasting (WRF) model output in Xarray.
Python
58
star
14

xarray-regrid

Regridding utility for xarray
Python
58
star
15

xoak

xarray extension that provides tree-based indexes used for selecting irregular, n-dimensional data.
Python
57
star
16

xdggs

Xarray extension for DGGS
Python
54
star
17

xarray-schema

Schema validation for Xarray objects
Python
39
star
18

sphinx-autosummary-accessors

sphinx extension to document pandas and xarray accessors
Python
13
star
19

xarray.dev

The Xarray landing page
JavaScript
12
star
20

cubed-xarray

Interface for using cubed with xarray
Python
11
star
21

issue-from-pytest-log

create issues from pytest-reportlog files
Python
10
star
22

xarray-contrib

Central repository for xarray-contrib organization
9
star
23

xncml

Tools for manipulating NcML (NetCDF Markup Language) files with/for xarray
Python
7
star
24

ci-trigger

A github action to detect trigger keywords in the summary line of commit messages
Shell
3
star
25

xwrf-data

Data repository for xwrf documentation, tutorials, testing
Python
2
star
26

xarray-array-testing

testing framework for testing duck array compatibility with xarray
Python
1
star