Skip to content

Latest commit

 

History

History
46 lines (29 loc) · 1.13 KB

README.md

File metadata and controls

46 lines (29 loc) · 1.13 KB

Databricks DCS Toolkit

A set of useful primatives for making more advanced DCS containers: See: https://docs.databricks.com/clusters/custom-containers.html and: https://github.com/databricks/containers

Docker Layer Architecture

Foundation: minimal_container conda_python Standard R Rapids: Rapids Container Monitoring: Ganglia

Basic instructions


make build-base

make build-rapids

make build-ganglia

make push-rapids

Breakdown of images

build-base builds the base image, adds conda and installs some of the standard libs that we need to make notebooks work in databricks. Note that we may still be missing some required Python or Spark packages that normally are in a DBR release.

build-rapids builds a function RAPID.ai Python container with the current (Feb 2021) release of RAPIDs.ai

This has been lightly tested on DBR 10.3 and DBR 10.4 LTS.

Known Missing Features:

R - has issues - related to not triggering init script? ssh Repos feature in DB Web Terminal

Still need to add back some more libs for deeplearning etc

  • need to check if it is in the Rapids installs