This tool enables you to download Amazon user reviews by category from the extensive McAuley-Lab/Amazon-Reviews-2023 dataset. Designed for handling very large datasets, it is ideal for projects involving data analysis, research, or advanced Python workflows.
For the complete results of Python Pandas vs Python Ibis, just visit Peak Python for the 9 page report.
This program was developed as part of the course "Rethinking Python Pandas: 5x-100x Speed & Maximum Flexibility with Python Ibis" by Peak Python.
Key Features:
- Download product category-specific reviews.
- Defaults to a maximum of 10M rows of CSV output. Change the function defaults if you want to export all rows.
- Ensure you have enough disk space for large datasets.
- Python 3.8+ is recommended for optimal performance.
- Create Download Directories:
- Prepare directories where the data will be saved.
- Update Configuration:
- Modify the
core_path
variable in the script to point to your directories.
- Modify the
Run the program to initiate the download. Each category may result in a large dataset, so adjust your resources accordingly.
Feel free to reach out for support or to share feedback about your experience with this program! 🚀