Top Rating
- Top Contributors
  Discover the Top Open Source contributors by country or by language
- Interviews
  Discover real stories from Open Source developers
Discover

Discover your Favorite Language
Discover the top trending repositories and projects on Github. Explore the latest trends in your preferred languages.

CSS

Clojure

Swift

Erlang

F#

Solidity

PHP

R

More Languages
Awesome

Awesome repositories
Discover the most awesome repositories and projects of your favorite languages. Inspired by the Awesome-* lists trend in GitHub.

Objective-C

JavaScript

TypeScript

Rust

Ruby

F#

Perl

Erlang

More Languages
By Country

Rankings by Country
Discover the community of talented open source contributors in each country.

🇸🇾 Syria

🇪🇨 Ecuador

🇬🇭 Ghana

🇭🇳 Honduras

🇦🇱 Albania

🇨🇮 Côte d'Ivoire

🇧🇷 Brazil

🇲🇼 Malawi

All Countries Compare Countries

documentcloud/cloud-crowd

Stars
851
Rank 53,558 (Top 2 %)
Language
Ruby
License
MIT License
Created about 15 years ago
Updated almost 2 years ago

documentcloud/cloud-crowd

documentcloud

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Parallel Processing for the Rest of Us

=                                                                               
           _  _                                                                
          ( `   )_                                                             
         (    )    `)                                                          
       (_   (_ .  _) _)                                                        
                                      _                                        
                                     (  )                                      
      _ .                         ( `  ) . )                                   
    (  _ )_                      (_, _(  ,_)_)                                 
  (_  _(_ ,)                                                                   
                                                                               
           _  _               ___ _             _  ___                   _     
          ( `   )_           / __| |___ _  _ __| |/ __|_ _ _____ __ ____| |    
         (    )    `)       | (__| / _ \ || / _` | (__| '_/ _ \ V  V / _` |    
       (_   (_ .  _) _)      \___|_\___/\_,_\__,_|\___|_| \___/\_/\_/\__,_|    
                                                                               
                                                     _                         
                                                    (  )                       
                  _, _ .                         ( `  ) . )                    
                 ( (  _ )_                      (_, _(  ,_)_)                  
               (_(_  _(_ ,)                                                    
                                                                               
                                                                               
                                                                               
  ~ CloudCrowd ~

    * Parallel processing for the rest of us
    * Write your scripts in Ruby
    * Works with Amazon EC2 and S3
    * split -> process -> merge
    * As easy as `gem install cloud-crowd`

    Well-suited for:
    
    * Generating or resizing images.
    * Encoding video.
    * Running text extraction or OCR on PDFs.
    * Migrating a large file set or database.
    * Web scraping.
    
    
  ~ Documentation ~
  
    Wiki: https://github.com/documentcloud/cloud-crowd/wiki
    Rdoc: http://www.rubydoc.info/github/documentcloud/cloud-crowd
  
  
  ~ Getting started ~
  
    # Install the gem.
    
      >> sudo gem install cloud-crowd
    
    # Install the CloudCrowd configuration files to a location of your choosing.
    
      >> crowd install ~/config/cloud-crowd
    
    # Now, you can use the full complement of `crowd` commands from inside of
    # this configuration directory. To see the available commands:
    
      >> crowd --help
    
    # Edit the configuration files to your satisfaction, add AWS credentials, 
    # and then load the CloudCrowd schema into your configured database.
    
      >> cd ~/config/cloud-crowd
      >> mate config.yml
      >> mate database.yml
      >> [create the database you just configured...]
      >> crowd load_schema
    
    # Write your actions, and install them into the 'actions' subdirectory.
    # CloudCrowd comes with a few default actions as an example.
    
    # To launch the central server (make sure that you include its location
    # in config.yml):
    
      >> crowd server
    
    # The configuration folder also includes 'config.ru', which can be used by
     # any Rack-compliant webserver to run your central server.
    
    # Then, to launch a node of workers:
    
      >> crowd node
    
    # To spin up remote nodes, install the 'cloud-crowd' gem and copy over
    # your configuration directory. Run `crowd node`, and the remote machines
    # will register with the central server, becoming available for processing.
    
    # At this point you can visit your Operations Center at localhost:9173 to 
    # view all of your nodes, ready for action.

visualsearch

A Rich Search Box for Real Data

jammit

Industrial Strength Asset Packaging for Rails

docsplit

Break Apart Documents into Images, Text, Pages and PDFs

underscore-contrib

The brass buckles on Underscore's utility belt

documentcloud

The DocumentCloud platform

pixel-ping

A Minimalist Pixel Tracker for Node.js

closure-compiler

A Ruby Wrapper for the Google Closure Compiler

pdfshaver

Shave pages off of PDFs as images

pdfium

A mirror of PDFium (primary source https://pdfium.googlesource.com/pdfium/ )

documentcloud-pages

Responsively embed DocumentCloud pages.

documentcloud-notes

Responsively embed DocumentCloud notes.

pdftailor

Stitch and Unstitch PDFs.

documentcloud-vagrant

Vagrant scripts for a DocumentCloud development environment.

dc-search-embed

DocumentCloud's SearchEmbed backbone application.

help_center

Help documentation for DocumentCloud.

documentcloud.github.com

Redirect for DocumentCloud Open Source

pdfium-deb

Configuration and scripts for building a Debian package for PDFium

crowd-spotter

Statistics on Cloud Crowd performance