The Hadoop ecosystem is the leading opensource platform for distributed storage and processing of "big data". The Hadoop platform is available at CERN as a central service provided by the IT department.
These tutorials organized by the IT Hadoop service, aims to introduce the main concepts about Hadoop technology in a practical way and is targeted to those who would like to start using the service for distributed parallel data processing.
Attendees will have the possibility to access a test Hadoop system where they will be able to perform hands-on exercises. Instructions will be provided by the speakers. To facilitate the preparation of the test environment, please register if you plan to attend.
[] (https://cern.ch/swanserver/cgi-bin/go?projurl=https://github.com/prasanthkothuri/hadoop-tutorials-2016.git)