This repository contains notebooks for performing Exploratory Data Analysis (EDA) and data preprocessing. The aim is to provide a comprehensive set of tools and techniques to clean, transform and analyze datasets, making them ready for further analysis or machine learning modeling.
- Data Cleaning: Handling missing values, data inconsistencies and outliers.
- Data Transformation: Encoding categorical variables, feature scaling and creating new features.
- Exploratory Data Analysis: Visualizations, Insights extraction and data summary.
- Data Integration: Combining data from multiple sources.
- Pandas: Data manipulation and analysis
- Numpy: Numerical operations.
- Scikit-learn: Feature engineering and preprocessing.
- Matplotlib and seaborn: Data visualization.