Skip to content

Scripts to build a Docker image with Hadoop running in psedo-distributed mode

License

Notifications You must be signed in to change notification settings

parrot-stream/docker-hadoop

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

37 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

hadoop


Description

This image runs the Cloudera CDH Hadoop in a pseudo-distributed mode on a Centos 7 Linux distribution.

The latest tag of this image is build with the latest available release of CDH on Centos 7.

You can pull it with:

docker pull parrotstream/hadoop

You can also find other images based on different Apache Hadoop releases, using a different tag in the following form:

docker pull parrotstream/hadoop:[hadoop-release]-[cdh-release]

For example, if you want Apache Hadoop release 2.6.0 on CDH 5.11.1 you can pull the image with:

docker pull parrotstream/hadoop:2.6.0-cdh5.11.1

Run with Docker Compose:

docker-compose -p parrot up

Once started you'll be able to read the list of all the Hadoop Web GUIs urls:

Hadoop Web UIs URL
Hadoop Name Node http://localhost:50070
Hadoop Data Node http://localhost:50075
YARN Node Manager http://localhost:8042
YARN Resource Manager http://localhost:8088
YARN Timeline History http://localhost:8188
MapReduce Job History http://localhost:19888/jobhistory

While the Hadoop Docker container is running, you can always get the urls' list with the script:

print-urls.sh

included in the GitHub source repository.

Available tags:

About

Scripts to build a Docker image with Hadoop running in psedo-distributed mode

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published