• Stars
    star
    2,138
  • Rank 21,577 (Top 0.5 %)
  • Language
    Ruby
  • License
    MIT License
  • Created over 16 years ago
  • Updated almost 4 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Database backed asynchronous priority queue -- Extracted from Shopify

Delayed::Job

Delayed_job (or DJ) encapsulates the common pattern of asynchronously executing longer tasks in the background.

It is a direct extraction from Shopify where the job table is responsible for a multitude of core tasks. Amongst those tasks are:

  • sending massive newsletters
  • image resizing
  • http downloads
  • updating smart collections
  • updating solr, our search server, after product changes
  • batch imports
  • spam checks

Setup

The library evolves around a delayed_jobs table which can be created by using:


  script/generate delayed_job

The created table looks as follows:


  create_table :delayed_jobs, :force => true do |table|
    table.integer  :priority, :default => 0      # Allows some jobs to jump to the front of the queue
    table.integer  :attempts, :default => 0      # Provides for retries, but still fail eventually.
    table.text     :handler                      # YAML-encoded string of the object that will do work
    table.string   :last_error                   # reason for last failure (See Note below)
    table.datetime :run_at                       # When to run. Could be Time.now for immediately, or sometime in the future.
    table.datetime :locked_at                    # Set when a client is working on this object
    table.datetime :failed_at                    # Set when all retries have failed (actually, by default, the record is deleted instead)
    table.string   :locked_by                    # Who is working on this object (if locked)
    table.timestamps
  end

On failure, the job is scheduled again in 5 seconds + N ** 4, where N is the number of retries.

The default MAX_ATTEMPTS is 25. After this, the job either deleted (default), or left in the database with “failed_at” set.
With the default of 25 attempts, the last retry will be 20 days later, with the last interval being almost 100 hours.

The default MAX_RUN_TIME is 4.hours. If your job takes longer than that, another computer could pick it up. It’s up to you to
make sure your job doesn’t exceed this time. You should set this to the longest time you think the job could take.

By default, it will delete failed jobs (and it always deletes successful jobs). If you want to keep failed jobs, set
Delayed::Job.destroy_failed_jobs = false. The failed jobs will be marked with non-null failed_at.

Here is an example of changing job parameters in Rails:


  # config/initializers/delayed_job_config.rb
  Delayed::Job.destroy_failed_jobs = false
  silence_warnings do
    Delayed::Job.const_set("MAX_ATTEMPTS", 3)
    Delayed::Job.const_set("MAX_RUN_TIME", 5.minutes)
  end

Note: If your error messages are long, consider changing last_error field to a :text instead of a :string (255 character limit).

Usage

Jobs are simple ruby objects with a method called perform. Any object which responds to perform can be stuffed into the jobs table.
Job objects are serialized to yaml so that they can later be resurrected by the job runner.


  class NewsletterJob < Struct.new(:text, :emails)
    def perform
      emails.each { |e| NewsletterMailer.deliver_text_to_email(text, e) }
    end    
  end  
  
  Delayed::Job.enqueue NewsletterJob.new('lorem ipsum...', Customers.find(:all).collect(&:email))

There is also a second way to get jobs in the queue: send_later.


  BatchImporter.new(Shop.find(1)).send_later(:import_massive_csv, massive_csv)

This will simply create a Delayed::PerformableMethod job in the jobs table which serializes all the parameters you pass to it. There are some special smarts for active record objects
which are stored as their text representation and loaded from the database fresh when the job is actually run later.

Running the jobs

You can invoke rake jobs:work which will start working off jobs. You can cancel the rake task with CTRL-C.

You can also run by writing a simple script/job_runner, and invoking it externally:


  #!/usr/bin/env ruby
  require File.dirname(__FILE__) + '/../config/environment'
  
  Delayed::Worker.new.start  

Workers can be running on any computer, as long as they have access to the database and their clock is in sync. You can even
run multiple workers on per computer, but you must give each one a unique name:


  3.times do |n|
    worker = Delayed::Worker.new
    worker.name = 'worker-' + n.to_s
    worker.start
  end	

Keep in mind that each worker will check the database at least every 5 seconds.

Note: The rake task will exit if the database has any network connectivity problems.

Cleaning up

You can invoke rake jobs:clear to delete all jobs in the queue.

Changes

  • 1.7.0: Added failed_at column which can optionally be set after a certain amount of failed job attempts. By default failed job attempts are destroyed after about a month.
  • 1.6.0: Renamed locked_until to locked_at. We now store when we start a given job instead of how long it will be locked by the worker. This allows us to get a reading on how long a job took to execute.
  • 1.5.0: Job runners can now be run in parallel. Two new database columns are needed: locked_until and locked_by. This allows us to use pessimistic locking instead of relying on row level locks. This enables us to run as many worker processes as we need to speed up queue processing.
  • 1.2.0: Added #send_later to Object for simpler job creation
  • 1.0.0: Initial release

More Repositories

1

clarity

Web interface for the grep and tail -f unix tools. Useful for real-time log analysis. Remotely related to splunk
Ruby
774
star
2

highlights

download your kindle highlights and email random ones to your inbox
Ruby
181
star
3

imagery

Image server / proxy that can resize images on demand based on common file prefixes ( such as _small, _medium ) and apply other rmagick effects. Supposed to be used between a Squid/Varnish and S3
Ruby
181
star
4

liquid-editor

HTML Editor for the liquid language (syntax highlighting, basic error checking etc)
JavaScript
128
star
5

google_apps_login

Allows you to protect controllers by requiring login to a Google Apps for domains account. Great SSO solution for small companies.
Ruby
43
star
6

money_column

Simplifies dealing with money values in the database. Successor to the money gem. Extracted from Shopify.
Ruby
39
star
7

airbrake-go

Go library to report errors to airbrake and compatible servers
Go
33
star
8

throttle

Simple plugin which allows you to throttle certain activities in your web apps. Uses memcached for speedy implementation and requires Rails 2.1+
Ruby
26
star
9

fokus

Simple Web Extension (Firefox, Chrome) that does nothing other than allowing you to block a bunch of hosts to make it easier to focus when emails are piling up.
JavaScript
23
star
10

tinny

small webserver used for developing webapps in go. Will recompile your go program before each request.
Go
21
star
11

api-proxy

HTTP Remote call accelerator proxy for Shopify
Ruby
21
star
12

cacheable

Page caching extension of Shopify
Ruby
21
star
13

browser-go

http server to take screenshots of websites
Go
14
star
14

locking

Global named locks (req. mysql) -- Extraction from Shopify
Ruby
14
star
15

redis-tools

Some useful redis tools
Ruby
13
star
16

chat-server

Long polling real-time chat server based on EM
Ruby
11
star
17

messagepipe

work in progress rpc layer using msgpack
Ruby
8
star
18

cached

memcached object cache / identify cache
JavaScript
8
star
19

s3_connection

Small class which allows authenticated and persistent connectivity with s3
Ruby
7
star
20

mogrify-go

bindings to gd library
Go
7
star
21

ruby-mr

throwaway repository. Ruby <=> hadoop lib
Ruby
6
star
22

xml_node

Xml read/write with a nice ruby api
Ruby
6
star
23

mephisto_textlinkads

Plugins for text-link-ads in mephisto blogging engine
Ruby
5
star
24

steady

Schedule longer running period tasks
Ruby
4
star
25

snowman-fokus

Greasemonkey (or better) script to block distracting websites
JavaScript
2
star
26

jsonrecordline

Java
1
star
27

letterpress

Ruby
1
star
28

docker-compiler

Shell
1
star
29

cow-tree

C
1
star