GitHub - pvsnp9/Feature-Selection: Techniques to select features for Machine learning projects.

Feature Selection Techniques

Feature selection is the process of reducing the number of input variables when developing a predictive model. It is desirable to reduce the number of input variables to both reduce the computational cost of modeling and, in many cases, to improve the performance of the model. Statistical-based feature selection methods involve evaluating the relationship between each input variable and the target variable using statistics and selecting those input variables that have the strongest relationship with the target variable. These methods can be fast and effective, although the choice of statistical measures depends on the data type of both the input and output variables.

There are two main types of Feature Selection:

Supervised Selection:

Use the target variable (e.g. remove irrelevant variables). it is further categorized into three main categories.

- Filter Method
Select subsets of features based on their relationship with the target.
    - Basics: (Constant, Quasi-Constant, Duplicated)
    - Correlation 
    - Statistical measures: (Information Gain, Fisher Score, Univariate method, mutual information)

- Wrapper method:
Search subsets of features that perform according to a predictive model.
    - Step forward Selection 
    - Step Backward Selection 
    - Exhaustive Search 
    - Feature shuffling 

- Embedded Methods:
    - LASSO
    - Decision tree derived importance 
    - Regression Coefficients 

- Hybrid Methods:
    - Recursive Feature Elimination

Unsupervised Learning

Do not use the target variable (e.g. remove redundant variables). Unsupervised feature selection techniques ignore the target variable, such as methods that remove redundant variables using correlation or features that have few values or low variance(i.e. data cleaning).

Detail Listing

Basic Selection Methods
1. Removing Constant Features
2. Removing Quasi-Constant Features
3. Removing Duplicated Features
Correlation Feature Selection
1. Removing Correlated Features
2. Basic Selection Methods + Correlation - Pipeline
Filter Methods: Univariate Statistical Methods
1. Mutual Information
2. Chi-square distribution
3. Anova
4. Basic Selection Methods + Statistical Methods - Pipeline
Filter Methods: Other Methods and Metrics
1. Univariate roc-auc, mse, etc
2. Method used in a KDD competition - 2009
Wrapper Methods
1. Step Forward Feature Selection
2. Step Backward Feature Selection
3. Exhaustive Feature Selection
Embedded Methods: Linear Model Coefficients
1. Logistic Regression Coefficients
2. Linear Regression Coefficients
3. Effect of Regularization on Coefficients
4. Basic Selection Methods + Correlation + Embedded - Pipeline
Embedded Methods: Lasso
1. Lasso
2. Basic Selection Methods + Correlation + Lasso - Pipeline
Embedded Methods: Tree Importance
1. Random Forest derived Feature Importance
2. Tree importance + Recursive Feature Elimination
3. Basic Selection Methods + Correlation + Tree importance - Pipeline
Hybrid Feature Selection Methods
1. Feature Shuffling
2. Recursive Feature Elimination
3. Recursive Feature Addition

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
Embedded method - Lasso (L1) reg		Embedded method - Lasso (L1) reg
Embedded methods - Regression		Embedded methods - Regression
Embedded methods - Trees		Embedded methods - Trees
Filter method - Basic		Filter method - Basic
Filter method - Correlation		Filter method - Correlation
Filter method - statistical measures		Filter method - statistical measures
Metric based filtering		Metric based filtering
datasets		datasets
wrapper method		wrapper method
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Feature Selection Techniques

Detail Listing

About

Releases

Packages

Languages

pvsnp9/Feature-Selection

Folders and files

Latest commit

History

Repository files navigation

Feature Selection Techniques

Detail Listing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages