http://www.filipyoo.com/handle-200-GB-of-data-with-AWS-EC2-hadoop-cluster/
Storing 200 GB of NYC taxi dataset and deploying a Cloudera Hadoop cluster to visualize it.
http://www.filipyoo.com/plot-visualization-Hadoop-large-dataset-with-python-datashader/