HPEEzmeral · erdincka · Sep 28, 2023 · Oct 2, 2023 · Oct 3, 2023 · Oct 2, 2023
diff --git a/demos/wind-turbine/README.md b/demos/wind-turbine/README.md
@@ -1,39 +1,68 @@
-# Wind Turbine (Spark Demo)
+# Wind Turbine (Spark - Livy - Sparkmagic)
+
+In this demonstration, you use Spark to explore a dataset and train a Gradient-Boosted Tree (GBT) regressor that
+leverages various features, such as wind speed and direction, to estimate the power output of a wind turbine.
 
 ![wind-farm](images/wind-farm.jpg)
 
-Welcome! In this experiment we delve into the world of wind turbines
-and harness the power of machine learning to predict their energy production!
-In this demonstration, we will be using Spark to explore the training dataset
-and train a Gradient-Boosted Tree (GBT) regressor that will utilize various
-features, such as wind speed and direction, to estimate the power output of a
-wind turbine.
+Wind turbines hold tremendous potential as a sustainable source of energy, capable of supplying a substantial portion
+of the world's power needs. However, the inherent unpredictability of power generation poses a challenge when it comes
+to optimizing this process.
 
-Wind turbines hold tremendous potential as a sustainable source of energy,
-capable of supplying a substantial portion of the world's power needs. However,
-the inherent unpredictability of power generation poses a challenge when it
-comes to optimizing this process.
+Fortunately, you have a powerful tool at our disposal: Machine Learning (ML). By leveraging advanced algorithms and data
+analysis, you can develop models that accurately predict the power production of wind turbines. This enables you to
+optimize the power generation process and overcome the challenges associated with its ingrained variability.
 
-Fortunately, we have a powerful tool at our disposal: machine learning. By
-leveraging advanced algorithms and data analysis, we can develop models that
-accurately predict the power production of wind turbines. This enables us to
-optimize the power generation process and overcome the challenges associated
-with its ingrained variability.
+1. [What You'll Need](#what-youll-need)
+1. [Procedure](#procedure)
+1. [How it Works](#how-it-works)
+1. [References](#references)
 
 ## What You'll Need
 
-To complete the tutorial follow the steps below:
+For this tutorial, ensure you have:
+
+- Access to an HPE Ezmeral Unified Analytics cluster.
+
+## Procedure
+
+To complete this tutorial follow the steps below:
+
+1. Login to your Ezmeral Unified Analytics (EzUA) cluster, using your credentials.
+1. Create a new Notebook server using the `jupyter-data-science` image. Request at least `4Gi` of memory for the
+   Notebook server.
+1. Connect to the Notebook server and clone the repository locally.
+1. Navigate to the tutorial's directory (`ezua-tutorials/demos/wind-turbine`).
+1. Launch a new terminal window and create a new conda environment using the specified `environment.yaml` file:
+
+   ```bash
+   conda env create -f environment.yaml
+   ```
+
+1. Add the new conda environment as an ipykernel:
+
+   ```bash
+   python -m ipykernel install --user --name=wind-turbine
+   ```
+
+1. Refresh your browser tab to access the updated environment.
+1. Launch the `wind-turbine.ipynb` notebook file and follow the instructions. Make sure to select the `wind-turbine`
+   environment kernel.
 
-1. Login to you EzAF cluster.
-1. Create a new notebook server using the `jupyter-data-science` image.
-1. Clone the repository locally.
-1. Launch the `wind-turbine.ipynb` notebook file and follow the instructions.
+## How it Works
 
-> It is recommended to create a Notebook server with more than 3Gi of memory
+In this tutorial, you use Livy and Sparkmagic to remotely execute Python code in a Spark cluster. Livy is an open-source
+REST service that enables remote and interactive analytics on Apache Spark clusters. It provides a way to interact with
+Spark clusters programmatically using a REST API, allowing you to submit Spark jobs, run interactive queries, and manage
+Spark sessions.
 
-> If you created your notebook server using the `jupyter-data-science` image
-> you should be good to go. However, you can always create a separate conda
-> environment for this tutorial using the provided `environment.yaml` file.
-> To create a separate enviroment run `conda env create -f environment.yaml`.
+To communicate with Livy and manage your sessions you use Sparkmagic, an open-source tool that provides a Jupyter kernel
+extension. Sparkmagic integrates with Livy, to provide the underlying communication layer between the Jupyter kernel and
+the Spark cluster.
 
+## References
 
+1. [Spark: Unified engine for large-scale data analytics](https://spark.apache.org/)
+1. [Livy: A REST Service for Apache Spark](https://livy.apache.org/)
+1. [Sparkmagic: Jupyter magics and kernels for working with remote Spark clusters](https://github.com/jupyter-incubator/sparkmagic)
+1. [Wind Turbine Scada Dataset](https://www.kaggle.com/datasets/berkerisen/wind-turbine-scada-dataset/data)
diff --git a/demos/wind-turbine/images/spark-interactive-ui.png b/demos/wind-turbine/images/spark-interactive-ui.png