Skip to content

anmolbhatia05/brane-data-science-workflow

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DOI CI

Introduction

This branescript is created for the data processing pipeline for the kaggle data science project Titanic Kaggle competition using the Brane framework.

This script makes use of two separate packages:

This scripts runs on a brane instance as defined here.

To understand the function calls in the script, read the readme for the brane_compute and brane_visualize packages.

Automated running of branescript (DSl)

We have set automated build for this brane script using github actions. The following happens -

  1. Imports both compute and visualize packages from their github repositories and builds it
  2. Runs the workflow for the titanic challenge as described in titanic_workflow.bs

The triggers are push and pull_request, but, one can run it manually as well. Please refer to the image below to see how. (Click on run workflow(CI))

Running this branescript in local/K8

We assume that you have brane cli (installed) and brane (installed and deployed). Now run the following commands in sequence -

  1. brane import web-services-and-cloud-based-systems/brane-compute-package
  2. brane import web-services-and-cloud-based-systems/brane-visualise-package
  3. brane run titanic_workflow.bs --data /data