Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Solved issue #4, removing outliers. #40

Closed
wants to merge 1 commit into from

Conversation

yaaghnetha09
Copy link

  • Solved issue Data Preprocessing: Removing Outliers from the Dataset #4
  • Tried to check the outliers by taking a threshold value and comparing the columns with it and filter the data accordingly.
  • The code calculates the z-scores using NumPy's abs, mean, and std functions, and then filters the data based on the threshold.

Screenshot 2024-03-17 140435

  • Used scatter plot is used to visualize the wine prices on y-axis and num_reviews on x-axis, and outliers are removed based on a threshold.

Screenshot 2024-03-17 141202
Screenshot 2024-03-17 140458

@yaaghnetha09 yaaghnetha09 changed the title Removed outliers using Z score method in two ways. Solved issue#4 Mar 17, 2024
@yaaghnetha09 yaaghnetha09 changed the title Solved issue#4 Solved issue #4, removing outliers. Mar 17, 2024
@yaaghnetha09
Copy link
Author

@darshbaxi, please review it, if there is any bug, please let me know.

@darshbaxi
Copy link
Contributor

The code looks good :)

@darshbaxi darshbaxi closed this Mar 17, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants