• Stars
    star
    104
  • Rank 328,651 (Top 7 %)
  • Language
  • Created over 10 years ago
  • Updated over 8 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A virtual Hadoop cluster running CDH5

A working virtual Hadoop cluster

With these files you can setup and provision a locally running, virtual Hadoop cluster in real distributed fashion for trying out Hadoop and related technologies. It runs the latest Cloudera Hadoop distribution: CDH5. It also allows you to practise the use of Cloudera Manager for installing the Hadoop stack. If you're looking for a fully automated install, without user intervention, look elsewhere. I specifically made this with the goal of creating an environment ideally suited for Cloudera Manager to do its job. This gives you the freedom to actually install the services you want, and change the configuration how you see fit.

This README describes how to get the cluster with Cloudera Manager up and running. For more detailed instructions on how to install the whole Hadoop stack on that, you can use this guide.

Specs

The cluster conists of 4 nodes:

  • Master node with 4GB of RAM (Running the NameNode, Hue, ResourceManager etc. after installing the Hadoop services)
  • 3 slaves with 2GB of RAM each (Running DataNodes)

As you can see, you'll need at least 10GB of free RAM to run this. If you have less, you can try to remove one machine from the Vagrantfile. This will lead to worse performance though!

Usage

Depending on the hardware of your computer, installation will probably take between 15 and 25 minutes.

First install VirtualBox and Vagrant.

Install the Vagrant Hostmanager plugin

$ vagrant plugin install vagrant-hostmanager

Clone this repository.

$ git clone https://github.com/DandyDev/virtual-hadoop-cluster.git

Provision the bare cluster. It will ask you to enter your password, so it can modify your /etc/hosts file for easy access in your browser. It uses the Vagrant Hostmanager plugin to do this.

$ cd virtual-hadoop-cluster
$ vagrant up

Go to the Cloudera Manager web console and follow the installation instructions. For more detailed instructions on how to do that, you can use this guide.

Done! Have fun with your Hadoop cluster.

More Repositories

1

slack-machine

A simple, yet powerful and extendable Slack bot
Python
752
star
2

pelican-bootstrap3

Bootstrap 3 theme for Pelican
331
star
3

graphite-stack-ansible-vagrant

Provision a complete Graphite, StatsD & Grafana install using Ansible and (optionally) Vagrant
Shell
63
star
4

statserver

This is a complete statistics & monitoring solution provisioned by Ansible
53
star
5

sentry-ansible-vagrant

Provision a complete Sentry install using Ansible and (optionally) Vagrant
Shell
47
star
6

gapi-php

Updated Gapi class for Google Analytics
PHP
21
star
7

flume-plugins

Some extensions to Flume to help with collecting logs and storing as Avro.
Java
17
star
8

flood

Python search APIs to various Torrent trackers
Python
11
star
9

dandydev.net

My personal site
HTML
10
star
10

dotfiles

My personal dotfiles
Shell
10
star
11

sentry-vagrant

Vagrant/Puppet provisioning for Sentry
Ruby
9
star
12

daan.fyi

MDX
8
star
13

secure-dev-proxy

Setting Up a Secure Development Proxy
Shell
7
star
14

fabdocker

Docker deployments with Fabric
Python
5
star
15

ploy

Go
2
star
16

pythonserver-ansible

Ansible Playbook to setup a Python server
1
star
17

sgannotate

JavaScript
1
star
18

gitsentry

Watch git repositories for changes to specific paths and get notified on Slack
Python
1
star
19

autoplaylistpoetry

Generate Spotify playlists from a piece of text
Python
1
star
20

JackAMovie

JackAMovie
CSS
1
star
21

pyalgorithms

Basic algorithms in Python
Python
1
star
22

RegDomain

Extract domain/hostname from URL in PHP
PHP
1
star
23

flybot

The MakeItFly Slack bot
CoffeeScript
1
star
24

docker-presentation

JavaScript
1
star
25

makeitfly_hubot

CoffeeScript
1
star
26

excel2note

Excel2Note creates Evernote notes from an Excel sheet
Python
1
star