Skip to content

Latest commit

 

History

History
19 lines (12 loc) · 954 Bytes

File metadata and controls

19 lines (12 loc) · 954 Bytes

Prepare packages and dataset for pyspark

For simplicity export the location to these jars. All examples assume the packages and dataset will be placed in the /opt/xgboost directory:

Download the jars

Download the RAPIDS Accelerator for Apache Spark plugin jar

Build XGBoost Python Examples

Following this guide, you can get samples.zip and main.py and copy them to /opt/xgboost

Download dataset

You need to copy the dataset to /opt/xgboost. Use the following links to download the data.

  1. Mortgage dataset
  2. Taxi dataset
  3. Agaricus dataset