Running Oozie and HIHO

In this post, we will see how we can create HIHO workflows with Oozie. We ran Oozie on a Ubuntu environment with Java 1.6. Our cluster was running Apache Hadoop-0.20.2. Installing oozie-3.0.0 Oozie has an optional requirement on the ExtJS library to enable Oozie webconsole 1. Download oozie from https://github.com/downloads/yahoo/oozie/oozie-3.0.0-distro.tar.gz Download ExtJS 2.2 library from [...]

read more

Accessing HDFS over FTP

The Hadoop Distributed File System provides different interfaces so that clients can interact with it. Besides the HDFS shell, the file system exposes itself through WebDAV, Thrift, FTP and FUSE. In this post, we access HDFS over FTP. We have used Hadoop 0.20.2. 1. Download the hdfs-over-ftp tar from https://issues.apache.org/jira/secure/attachment/12409518/hdfs-over-ftp-0.20.0.tar.gz 2. Untar hdfs-over-ftp-0.20.0.tar.gz. 3. We [...]

read more

Getting started with Puppet

Puppet is an automated administrative engine for Unix-like systems to distribute and manage configurations of all clients using centralized specification from a master machine. Based on client-server model, It helps to perform administrative tasks such as adding users, installing packages, and updating server configurations on all clients machine simultaneously. This is a great way to [...]

read more