Library that adds useful error handling and non-serializable object management capabilities to Apache Spark applications.
Note: This work expands on a previous effort by @nerdammer.
Note: Must use a Java version supported by Apache Spark (8..11 at this time) for running the tests
- To generate a package that can be referenced from other projects:
sbt "+package"
This will cross-build for Scala 2.13.x and 2.12.x; find the result intarget/scala-2.13/
(or similar) folder.
libraryDependencies += "org.hathitrust.htrc" %% "spark-utils" % VERSION
<dependency>
<groupId>org.hathitrust.htrc</groupId>
<artifactId>spark-utils_2.13</artifactId>
<version>VERSION</version>
</dependency>