commoncrawl-elasticsearch Java app that takes data from the Common Crawl public dataset on AWS and places into ElasticSearch