This tutorial explores how to leverage Feast [1] for generating training data and enhancing online Machine Learning (ML) model inference. In this use-case, your goal is to train a ride-sharing driver satisfaction prediction model using a training dataset built using Feast.
For this tutorial, ensure you have:
- Access to an HPE Ezmeral Unified Analytics (EzUA) cluster.
To complete the tutorial follow the steps below:
- Login to your EzUA cluster, using your credentials.
- Create a new Notebook server using the
marketplace.us1.greenlake-hpe.com/ezmeral/ezkf/jupyter:v1.3.0-e658264
image. Request at least4Gi
of memory for the Notebook server. - Connect to the Notebook server, launch a new terminal window, and clone the repository locally.
- Navigate to the tutorial's directory (
ezua-tutorials/tutorials/feast
). - Install the dependencies:
pip install -r requirements.txt
- Refresh your browser tab to access the updated environment.
- Launch the
ride-sharing.ipynb
Notebook and execute the code cells.
Feast is an open-source platform designed to manage, store, and serve ML features to production models. At its core, Feast decouples the feature engineering process from model training and serving. Engineers and data scientists define and compute features, then store them in the Feast feature store. Once in the store, these features can be retrieved for both training ML models and serving them in real-time or batch predictions.
The central feature store ensures consistency in feature values and computation logic between training and serving, reducing the possibility of training-serving skew. Additionally, Feast handles data backfills, versioning, and monitoring, simplifying many of the operational complexities associated with deploying ML models at scale.