Skip to content

Library that adds useful error handling and non-serializable object management capabilities to Apache Spark applications.

Notifications You must be signed in to change notification settings

htrc/HTRC-Tools-SparkUtils

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

56 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Scala CI codecov GitHub release (latest SemVer including pre-releases)

HTRC-Tools-SparkUtils

Library that adds useful error handling and non-serializable object management capabilities to Apache Spark applications.

Note: This work expands on a previous effort by @nerdammer.

Build

Note: Must use a Java version supported by Apache Spark (8..11 at this time) for running the tests

  • To generate a package that can be referenced from other projects:
    sbt "+package"
    This will cross-build for Scala 2.13.x and 2.12.x; find the result in target/scala-2.13/ (or similar) folder.

Usage

SBT

libraryDependencies += "org.hathitrust.htrc" %% "spark-utils" % VERSION

Maven

<dependency>
    <groupId>org.hathitrust.htrc</groupId>
    <artifactId>spark-utils_2.13</artifactId>
    <version>VERSION</version>
</dependency>

About

Library that adds useful error handling and non-serializable object management capabilities to Apache Spark applications.

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages