Spark-Odyssey

A journey in to the world of Machine Learning algorithms using Apache Spark.

Prerequisites

Build and Demo process

Clone the Repo

git clone https://github.com/ramuramaiah/spark-odyssey.git

Build

./gradlew clean jar

What the demo does?

The commands to run the Spark jobs are available in the batch files For e.g. To run the collocation algorithm, the colloc.bat has the following entry

colloc.bat

%SPARK_HOME%/bin/spark-submit ^
    --class spark.odyssey.colloc.Driver ^
    --jars file:///C:/mahout/lib/mahout-math-0.13.0.jar ^
    --master local ^
    --deploy-mode client ^
    --driver-memory 4g ^
    --executor-memory 2g ^
    --executor-cores 1 ^
    --queue colloc ^
    build/libs/spark-odyssey.jar ^
    --algo g_2 ^
	-s 1 ^
    "./build/resources/main/input_events.csv" ^
    "./output"

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
gradle/wrapper		gradle/wrapper
spark-cassandra		spark-cassandra
src/main		src/main
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
build.gradle		build.gradle
colloc.bat		colloc.bat
cooccur.bat		cooccur.bat
gradle.properties		gradle.properties
gradlew		gradlew
gradlew.bat		gradlew.bat
setEnv.bat		setEnv.bat
settings.gradle		settings.gradle

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spark-Odyssey

Prerequisites

Build and Demo process

Clone the Repo

Build

What the demo does?

Libraries Included

Useful Links

Issues or Suggestions

About

Releases

Packages

Languages

ramuramaiah/Spark-Odyssey

Folders and files

Latest commit

History

Repository files navigation

Spark-Odyssey

Prerequisites

Build and Demo process

Clone the Repo

Build

What the demo does?

Libraries Included

Useful Links

Issues or Suggestions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages