Parallel computing with task scheduling
Python Stream Processing
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
Run MapReduce jobs on Hadoop or Amazon Web Services
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Run Python in Apache Storm topologies. Pythonic API, CLI tooling, and a topology DSL.