Bookmarks tagged [bigdata]
https://github.com/onurakpolat/awesome-bigdata#readme
A curated list of awesome big data frameworks, ressources and other awesomeness. - onurakpolat/awesome-bigdata
- tags: awesome-list, bigdata
- source code
https://github.com/awesomedata/awesome-public-datasets#readme
A topic-centric list of HQ open datasets. PR ☛☛☛. Contribute to awesomedata/awesome-public-datasets development by creating an account on GitHub.
- tags: awesome-list, bigdata, datasets
- source code
https://github.com/youngwookim/awesome-hadoop#readme
A curated list of amazingly awesome Hadoop and Hadoop ecosystem resources - youngwookim/awesome-hadoop
- tags: awesome-list, bigdata, hadoop
- source code
https://github.com/igorbarinov/awesome-data-engineering#readme
A curated list of data engineering tools for software developers - igorbarinov/awesome-data-engineering
https://github.com/manuzhang/awesome-streaming#readme
a curated list of awesome streaming frameworks, applications, etc - manuzhang/awesome-streaming
- tags: awesome-list, bigdata, streaming
- source code
https://github.com/awesome-spark/awesome-spark#readme
A curated list of awesome Apache Spark packages and resources. - awesome-spark/awesome-spark
- tags: awesome-list, bigdata, apache-spark
- source code
https://www.splunk.com/pdfs/ebooks/the-essential-guide-to-machine-data.pdf
Whatever you call it, machine data is one of the most underused and undervalued assets of any organization. And, unfortunately, it’s usually kept for some minimum amount of time before being tossed ou...
http://dataminingguide.books.yourtion.com
https://github.com/linyiqun/DataMiningAlgorithm
https://github.com/Flowerowl/Big-Data-Resources
https://code.csdn.net/CODE_Translation/spark_matei_phd
https://aiyanbo.gitbooks.io/spark-programming-guide-zh-cn/content/