A set of useful primatives for making more advanced DCS containers: See: https://docs.databricks.com/clusters/custom-containers.html and: https://github.com/databricks/containers
Foundation: minimal_container conda_python Standard R Rapids: Rapids Container Monitoring: Ganglia
make build-base
make build-rapids
make build-ganglia
make push-rapids
build-base
builds the base image, adds conda and installs some of the standard libs that we need to make notebooks work in databricks. Note that we may still be missing some required Python or Spark packages that normally are in a DBR release.
build-rapids
builds a function RAPID.ai Python container with the current (Feb 2021) release of RAPIDs.ai
This has been lightly tested on DBR 10.3 and DBR 10.4 LTS.
Known Missing Features:
R - has issues - related to not triggering init script? ssh Repos feature in DB Web Terminal
Still need to add back some more libs for deeplearning etc
- need to check if it is in the Rapids installs