• Stars
    star
    397
  • Rank 105,003 (Top 3 %)
  • Language
    Python
  • License
    MIT License
  • Created almost 15 years ago
  • Updated 11 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Parallel programming with Python

Jug: A Task-Based Parallelization Framework

Jug allows you to write code that is broken up into tasks and run different tasks on different processors.

https://static.pepy.tech/personalized-badge/jug?period=total&units=international_system&left_color=black&right_color=blue&left_text=Downloads

It uses the filesystem to communicate between processes and works correctly over NFS, so you can coordinate processes on different machines.

Jug is a pure Python implementation and should work on any platform.

Python versions 3.5 and above are supported.

Website: http://luispedro.org/software/jug

Documentation: https://jug.readthedocs.org/

Video: On vimeo or showmedo

Mailing List: https://groups.google.com/group/jug-users

Testimonials

"I've been using jug with great success to distribute the running of a reasonably large set of parameter combinations" - Andreas Longva

Install

You can install Jug with pip:

pip install Jug

or use, if you are using conda, you can install jug from conda-forge using the following commands:

conda config --add channels conda-forge
conda install jug

Citation

If you use Jug to generate results for a scientific publication, please cite

Coelho, L.P., (2017). Jug: Software for Parallel Reproducible Computation in Python. Journal of Open Research Software. 5(1), p.30.

https://doi.org/10.5334/jors.161

Short Example

Here is a one minute example. Save the following to a file called primes.py (if you have installed jug, you can obtain a slightly longer version of this example by running jug demo on the command line):

from jug import TaskGenerator
from time import sleep

@TaskGenerator
def is_prime(n):
    sleep(1.)
    for j in range(2,n-1):
        if (n % j) == 0:
            return False
    return True

primes100 = [is_prime(n) for n in range(2,101)]

This is a brute-force way to find all the prime numbers up to 100. Of course, this is only for didactical purposes, normally you would use a better method. Similarly, the sleep function is so that it does not run too fast. Still, it illustrates the basic functionality of Jug for embarassingly parallel problems.

Type jug status primes.py to get:

Task name                  Waiting       Ready    Finished     Running
----------------------------------------------------------------------
primes.is_prime                  0          99           0           0
......................................................................
Total:                           0          99           0           0

This tells you that you have 99 tasks called primes.is_prime ready to run. So run jug execute primes.py &. You can even run multiple instances in the background (if you have multiple cores, for example). After starting 4 instances and waiting a few seconds, you can check the status again (with jug status primes.py):

Task name                  Waiting       Ready    Finished     Running
----------------------------------------------------------------------
primes.is_prime                  0          63          32           4
......................................................................
Total:                           0          63          32           4

Now you have 32 tasks finished, 4 running, and 63 still ready. Eventually, they will all finish and you can inspect the results with jug shell primes.py. This will give you an ipython shell. The primes100 variable is available, but it is an ugly list of jug.Task objects. To get the actual value, you call the value function:

In [1]: primes100 = value(primes100)

In [2]: primes100[:10]
Out[2]: [True, True, False, True, False, True, False, False, False, True]

What's New

Version 2.2.3 (26 May 2023) - Fix jug shell for newer versions of IPython

Version 2.2.2 (19 July 2022) - Fix jug cleanup when packs are used (jug pack)

Version 2.2.1 (19 May 2022) - Fix bug with jug cleanup and the redis backend (#86)

Version 2.2.0 (3 May 2022)

  • Add jug pack subcommand
  • Make get_tasks() return a copy of the tasks inside jug shell
  • Remove six dependency

Version 2.1.1 (18 March 2021)

  • Include requirements files in distribution

Version 2.1.0 (18 March 2021)

  • Improvements to webstatus (by Robert Denham)
  • Removed Python 2.7 support
  • Fix output encoding for Python 3.8
  • Fix bug mixing mapreduce() & status --cache
  • Make block_access (used in mapreduce()) much faster (20x)
  • Fix important redis bug
  • More precise output in cleanup command

Version 2.0.2 (Thu Jun 11 2020)

  • Fix command line argument parsing

Version 2.0.1 (Thu Jun 11 2020)

  • Fix handling of JUG_EXIT_IF_FILE_EXISTS environmental variable
  • Fix passing an argument to jug.main() function
  • Extend --pdb to exceptions raised while importing the jugfile (issue #79)

version 2.0.0 (Fri Feb 21 2020)

  • jug.backend.base_store has 1 new method 'listlocks'
  • jug.backend.base_lock has 2 new methods 'fail' and 'is_failed'
  • Add 'jug execute --keep-failed' to preserve locks on failing tasks.
  • Add 'jug cleanup --failed-only' to remove locks from failed tasks
  • 'jug status' and 'jug graph' now display failed tasks
  • Check environmental exit variables by default (suggested by Renato Alves, issue #66)
  • Fix 'jug sleep-until' in the presence of barrier() (issue #71)

For older version see ChangeLog file or the full history.

More Repositories

1

BuildingMachineLearningSystemsWithPython

Source Code for the book Building Machine Learning Systems with Python
Python
2,109
star
2

mahotas

Computer Vision in Python
Python
811
star
3

milk

MILK: Machine Learning Toolkit
Python
610
star
4

diskhash

Diskbased (persistent) hashtable
C
143
star
5

django-gitcms

A git based cms for django
Python
71
star
6

imread

Read images to numpy arrays
C++
69
star
7

nixml

NIX + YAML for easy to use reproducible environments
Python
60
star
8

pymorph

Python Morphology Toolbox
Python
47
star
9

talk-python-intro

Introduction to Python (Jupyter based)
Jupyter Notebook
30
star
10

hex

Reimplementation of TeX in Haskell: pre-alpha
Haskell
30
star
11

python-image-tutorial

Python image tutorial (based on ipython notebooks)
Jupyter Notebook
29
star
12

milksets

Machine Learning Toolkit Datasets: A collection of UCI datasets with a Python interface
Python
26
star
13

Programming-for-Scientists

Source Material for a course on Programming targeted at scientists
TeX
25
star
14

pythonvision_org

django-gitcms files for http://pythonvision.org
CSS
17
star
15

mergedirs

Merge two directories without losing files
Python
16
star
16

luispedro_org

jekyll files for luispedro.org
JavaScript
12
star
17

Coelho2009_ISBI_NuclearSegmentation

Reproducible research archive for "Nuclear segmentation in microscope cell images"
TeX
9
star
18

Coelho2021_GMGCv1

Python
9
star
19

libertarian-welfare

Libertarian Welfare State: A Book I'm Writing
Python
7
star
20

dot-link

Dotted Suffix Trees
C++
7
star
21

HBC

My Version of Hal Daumé's Hierarchical Bayesian Compiler
Haskell
7
star
22

android-fuse

Mount an android device using FUSE
Python
5
star
23

pyslic

PySLIC
Python
5
star
24

irstlm

Fork of IRSTLM
C++
5
star
25

safeio

Haskell Library for safe (atomic) IO
Haskell
3
star
26

PenalizedRegression

Shell
3
star
27

vita

My Vita
TeX
3
star
28

gitpointer

Github suggest
Python
3
star
29

elgreco

Graphical Models
C++
3
star
30

mahotas-paper

Paper about mahotas
Python
3
star
31

base-user

dot files and the like so that I can set up a new computer with a couple of command line calls
TeX
3
star
32

readmagick

Read images in Python using ImageMagick++ : SUPERCEDED BY IMREAD
C++
3
star
33

Coelho2015_NetsDetermination

Reproducible code archive for "Automatic Determination of NET (Neutrophil Extracellular Traps) Coverage in Fluorescent Microscopy Images" by Coelho et al.
Python
3
star
34

website.content

Content for My Website
2
star
35

talk-scientific-communication

Talk on Scientific Oral Communication
HTML
2
star
36

safeout

Simple atomic writing for Python
Python
2
star
37

conduit-algorithms

Conduit based algorithms
Haskell
2
star
38

unpack

Unpack zip/7z/tar.* archives with a consistent interface
Python
2
star
39

programming

Programming: An Introduction. A introductory book on computer programming.
2
star
40

imcol

Image Collection Management
Python
2
star
41

waldo

Waldo Project
Python
2
star
42

beiraproject

Beira Project Stuff
Shell
2
star
43

jug-presentations

presentations about jug
TeX
2
star
44

gitwc

Plot number of chars, words, and lines across time in a repository
Python
2
star
45

Hanu

Utilities for numerics in Haskell
Haskell
2
star
46

jug-paper

A future paper about jug (written in the open)
TeX
1
star
47

fna2faa.rs

Rust
1
star
48

outsort

Generic sorting of large datasets (using temporary files as temporary space)
Haskell
1
star
49

imm-mirna

Shell
1
star
50

particles

Python
1
star
51

StructureFunctionOceanTutorial

Jupyter Notebook
1
star
52

ML-for-microbial-communities

Jupyter Notebook
1
star
53

refsweb

Web interface to refs
Python
1
star
54

rbit

rabbit mail
Python
1
star
55

tutorial-unit-testing

HTML
1
star
56

whim

What Have I Missed
CoffeeScript
1
star
57

alist

Append List in Haskell
Haskell
1
star
58

refs

Bibtex Reference Management Software
Python
1
star
59

vision

Vision and Image Processing Library for Python [superceded by mahotas—use that]
Python
1
star
60

tutorial-cluster-usage

JavaScript
1
star
61

cq

Code Quarterly
Haskell
1
star
62

mlsegment

C++
1
star
63

rabbit_blog

1
star
64

beiraproject_org

django-gitcms repo for beiraproject.org
Python
1
star
65

TestingNGLESS

Haskell
1
star
66

blog_luispedro_org

CSS
1
star
67

NGH

Next Generation Sequence Handling in Haskell
Haskell
1
star
68

redundant100

Haskell
1
star
69

image-difftool

Image diff for git on the command line
Python
1
star
70

binary-instances

Haskell instances of Data.Binary
Haskell
1
star
71

anscombe

Python
1
star
72

fautils

Haskell
1
star
73

metarabbit

Posts for metarabbit.wordpress.com
Python
1
star
74

fragile

Lightweight command line unit testing tool for nodejs (with coffescript support)
JavaScript
1
star