hadoop-on-docker

Docker containers for building Hadoop Clusters

This repository hosts the code for building docker containers for Hadoop services. Using the code in this repository, we can build docker images for any Hadoop version and we can build Hadoop Clusters using those docker images.

This repository also has the code for running single node hadoop setup. With some additional configurations, we can setup multi-node Hadoop clusters, but that is covered in the other project hadoop-on-k8s.

Steps to build the Hadoop docker image

Checkout the code.
Update the Hadoop version number in the Docker file. Currently the version number is 3.1.2
Execute the following docker build command to build the docker image for Hadoop.

docker build . -t the-docker-image-name:version

Eg:

docker build . -t eiswar/hadoop:3.1.2

Steps to setup the single-node hadoop cluster

After building the image, update the image name in the docker compose file.
Create a directory in the docker host to store HDFS namenode and datanode data.
Update the volume mounts in the docker-compose file.
Start the Hadoop services using docker-compose using the following command

docker-compose up

Once the docker-compose is finished, the single-node hadoop cluster on docker will be up and running. It can be accessed using the IP address specified in the docker-compose file.

We can update the /etc/hosts file with the IP address and hostname for the hadoop node, we can access the resouce manager UI and file manager UI from the browser.

Features

Automatic resource allocation for the mapreduce applications.
Automatic setup of the fair scheduler
This can be easily integrated with Kubernetes. Options for establishing SSH trusts between Hadoop master and worker pods in Kubernetes environment are added.

To do

Enabling HDFS federation
Enabling Namenode High Availability

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
Dockerfile		Dockerfile
README.md		README.md
bootstrap.sh		bootstrap.sh
dev.env		dev.env
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

hadoop-on-docker

Steps to build the Hadoop docker image

Steps to setup the single-node hadoop cluster

Features

To do

About

Releases

Packages

Languages

fcobos/hadoop-on-docker

Folders and files

Latest commit

History

Repository files navigation

hadoop-on-docker

Steps to build the Hadoop docker image

Steps to setup the single-node hadoop cluster

Features

To do

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages