hadoop

Install Hadoop MapReduce Next Generation.

Module Stats

16,228 downloads

10,525 latest version

1.5 quality score

Support the Puppet Community by contributing to this module

You are welcome to contribute to this module by suggesting new features, currency updates, or fixes. Every contribution is valuable to help ensure that the module remains compatible with the latest Puppet versions and continues to meet community needs. Complete the following steps:

Review the module’s contribution guidelines and any licenses. Ensure that your planned contribution aligns with the author’s standards and any legal requirements.
Fork the repository on GitHub, make changes on a branch of your fork, and submit a pull request. The pull request must clearly document your proposed change.

For questions about updating the module, contact the module’s author.

Version information

released May 28th 2014

Start using this module

Installation method

Add this module to your Puppetfile:

mod 'viirya-hadoop', '0.0.18'

Learn more about managing modules with a Puppetfile

Add this module to your Bolt project:

bolt module add viirya-hadoop

Learn more about using this module with an existing project

Manually install this module globally with Puppet module tool:

puppet module install viirya-hadoop --version 0.0.18

Tags: hadoop, yarn

Documentation

viirya/hadoop — version 0.0.18 May 28th 2014

Puppet module for deploying Hadoop MapReduce Next Generation on cluster

This module deploys Hadoop MapReduce Next Generation on a cluster of machines. It is tested on Apache Hadoop 2.2.0 under puppet agent/master environment. It is based on bcarpio/hadoop 0.0.3.

Usage:

Install this module on your puppet master node by:

sudo puppet install viirya-hadoop.

In site.pp, define:

node 'your hadoop slave nodes' {
    include java
    include hadoop::cluster::slave
}

node 'your hadoop master node' {
    include java
    include hadoop::cluster::master
}

For pseudo-distributed mode of Hadoop, put following codes in a .pp file (e.g. hadoop.pp).

include java
include hadoop::cluster::pseudomode

Then,

sudo puppet apply hadoop.pp

Hadoop Distribution:

This puppet module will automatically download Apache Hadoop distribution from one pre-defined Apache Mirror site. If you are like to use faster mirror site, please modify the URL in init.pp.

Parameters:

Some parameters are able to modify in params.pp. You should modify the parameters such as 'master', 'resourcemanager', 'dfs_slaves' and 'yarn_slaves' to reflect your hadoop cluster settings.

SSH keys:

Remember to generate your ssh keys and put the keys in files/ssh/.

Note: Since master deployment will run hadoop scripts to launch hadoop services on slaves nodes, please deploy hadoop slaves first. When all slaves are deployed, then deploy master node.

Kerberos:

Since 0.0.12 version, this puppet module supports Kerberos-secured Hadoop cluster. Kerberos should be installed first on your cluster. To generate necessary principles and keytab files for Hadoop serviers:

require hadoop::cluster::kerberos

Copyright (C) 2013 Liang-Chi Hsieh

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

hadoop

Contributions Requested

Support the Puppet Community by contributing to this module

Version information

Start using this module

Documentation

Dependencies