In this repository, I have developed a model to detect flairs of India subreddit posts using Machine Learning algorithms.
Structure
- Reddit_data_collection Notebook contains all the code that was used to fetch data using PRAW API from reddit and merging it with the dataset downloaded from Kaggle. It was then cleaned and saved into a CSV file to do the data analysis and build the machine learning model.
- Exploratory Data Analysis Notebook contains all the code that was used to analyse and visualize the data.
- Preprocessing file cntains all the code required to preprocess and clean the data.
- Flair-Detector Notebook contains the code used to train various machine learning models and check the accuracy on different features.