There are no reviews yet. Be the first to send feedback to the community and the maintainers!
Brandyn White <[email protected]> Andrew Miller <[email protected]> Source https://github.com/bwhite/hadoopy/ Issues https://github.com/bwhite/hadoopy/issues Docs http://bwhite.github.com/hadoopy/ IRC: #hadoopy @ freenode.net Requirements python development headers (python-dev), build tools (build-essential) Optional cython (>=.13) (without this it falls back to the pregenerated .c files) Features - oozie support - Automated job parallelization 'auto-oozie' available in the hadoopy_flow project (maintained out of branch) - typedbytes support (very fast) - Local execution of unmodified MapReduce job with launch_local - Read/write sequence files of TypedBytes directly to HDFS from python (readtb, writetb) - Works on OS X - Allows printing to stdout and stderr in Hadoop tasks without causing problems (uses the 'pipe hopping' technique, both are available in the task's stderr) - critical path is in Cython - works on clusters without any extra installation, Python, or any Python libraries (uses Pyinstaller that is included in this source tree) - Simple HDFS access (readtb and ls) inside Python, even inside running jobs - Unit test interface - Reporting using status and counters (and print statements! no need to be scared of them in Hadoopy) - Supports design patterns in the Lin/Dyer book (http://www.umiacs.umd.edu/~jimmylin/book.html) Limitations - Hadoop Local currently unsupported due to a bug in Hadoop's handling of the distributed cache in this mode. Use psuedo-distributed instead for now. (#40) Used in - A Case for Query by Image and Text Content: Searching Computer Help using Screenshots and Keywords (to appear in WWW'11) - Web-Scale Computer Vision using MapReduce for Multimedia Data Mining (at KDD'10) - Vitrieve: Visual Search engine - Picarus: Hadoop computer vision toolbox Ubuntu Install (others are similar) sudo apt-get install python-dev build-essential sudo python setup.py install
hadoop_vision
Example code for "Web-Scale Computer Vision using MapReduce for Multimedia Data Mining"picarus
Computer vision in the cloud: CV + ML + Hadoop + HBase + REST.imfeat
Image Feature Descriptorsclassipy
A collection of classifiers with a standardized interface. Has a HTTP server interface that allows any language to access.dfs
This is a distributed FUSE filesystem I wrote for a class. It supports capability based authentication, public key handshake, symmetric session encryption, extent server, and log server (similar to GFS)kinectfs
ZeroMQ based project for using Pub-Sub for the Kinect. Dumps can be mounted and accessed using a FUSE filesystem.vision_data
distpy
Python distance metricsimage_server
Simple image server for visualization on headless boxesfpga-image-registration
hadoopy_hbase
Library that adds hbase support to Hadoopyimseg
.emacs.d
My emacs stuffhadoopy_tutorial
hadoop_clustering
opennpy
OpenNI python wrapper with a libfreenect-esque interfacecrawlers
vision_results
Collection of simple result visualizations for vision tasks, readily hackable for your own use.image_search
hadoop_log
Hadoop Jobtracker webserver scraperjewel-thief
dv_tp_integration
dv_bench
pyram
Python parameter selection libraryhadoopy_helper
Useful tools that compliment hadoopytexas_pete
hadoopy-goodies
Extra tools and helper scripts using the Hadoopy libraryimpoint
keyframe
openeyes
hadoopy_flow
Hadoopy monkey patch library to added parallel job execution automaticallyhadoopy-picnic
Hadoopy-based collage maker (still under development, watch this space over the next week)viderator
graphical_models
Binary CRF experimentswhite-knight
A background subtraction, tracking, and classification program written in C++ and Python. It was bothering me that good code was going stale in my backups for about a year, so I decided to work on it in my spare time.hadoopy_utils
dv_hadoop_tests
annotation
Python tools to annotate imagespython_examples
Example python taskspywxopengl-fun
kernels
pythonrc
vidfeat
interactive_learning
mturk_vision
Mechanical turk vision scriptscamera_geometry
opencv-examples
puppet_config
Puppet config for Hadoop and Picaruspython_templates
Boiler plate python headers, etcpkgtest
A basic distutils package that shows a possible way to use ctypes modulesrest-examples
REST client/server examplescoq-confuse
picarus_takeout
upload_server
Simple server to allow uploading files into a local directorypatch_classifier
.ipython
pycassa_server
HTML display server for Cassandra using pycassameme_hunter
nn_bench
gmaps_annotations
project_status
filter_fun
Workspace for image filter designhadoopy_rt
data_sources
Common interface for column oriented read-only data sources (used in a few of my projects)Love Open Source and this site? Check out how you can help us