Skip to content

Rrschch-6/hushHush-Recruiter

Repository files navigation

Hush Hush-Recruiter

image

Hush Hush recruiter is an automated recruitment tool written in Python 3 that automates the entire analysis process of selecting potential candidate for Doodle firm. The secretive process automatically sents email to an candidate if he is selected for a potential role at Doodle.

Recruiter wants to ensure that

  1. The algorithm to hire a candidate cannot be deterministic
  2. The data points of potential candidates can be picked over the data source from internet.
  3. The interface to provide the coding solutions should be invalidated after a specified period

Table of contents

Overview

Through API we try to connect with each datasources which will provide the details on activities availble on the activities with their application. All the datasources with normalised score will be populated to database which is common for all the source . From the source we implement our selection algorithm which will list down the potentional candidates and their position.

we pick each of the candidates and sent them an email based which will have all the link for the doodle challenge.

Data sources

we are picking Github,Stackoverflow,Kaggle and Twitter as our main source. image

API integration

Each of the datasource folder is created and the script is included.

Normalising the score

Three methods we have used to normalise the score :

  1. X(norm)= X-min(x)/max(x)- min(x)

  2. we used harmonicc mean for some attributes and arthimetic mean of all the harmonic mean

  3. we used scaler funcations

Information on database

We have used SQLlite database for this project ,name :HushHush.db available in Application-->Data (folder)

Selection Algorithims

To avoid people hacking the algorithm, we are selecting on the basis of percentiles instead of absolute scores of candidates. STEP 1: Check normalized score of candidates in all datasets except Twitter.

STEP 2: Candidates will be selected as per the percentiles below (in any dataset).

STEP 3: For remaining candidates (candidates who are below 70percentile in all datasets).

STEP 4: Sort Candidates by the sum of Weighted Score in Descending order.

STEP 5: Candidates will be selected as per the below percentile hierarchy.

image

Classification Algorithm

After implementing our main algorithm, for the further evaluation for new entries we decided to use a machine learning algorithm based on the labels we got from our main algorithm and our initial data. As SVM,NN, LogReg or other models that are trained with gradient descent, are doing very well when the variables are normalized or rescaled we tried these three algorithms with different parameters. Based on our data Neural Network had the best accuracy for us. Multi-layer Perceptron is sensitive to feature scaling, so it is highly recommended to rescaled data with mean 0 and variance 1 Multi-layer Perceptron (MLP) is a supervised learning algorithm. MLP algorithm trains using some form of gradient descent and the gradients are calculated using Backpropagation with no activation function in the output layer

Email Notification

image

Doodle challenge

https://docs.google.com/forms/d/e/1FAIpQLScEOEKdgnWweYQLzpxiwOILcMvEi_dIuNPhbD6qvf0nUTMtVw/viewform?usp=sf_link

Application Design

image

Project structure

image

Front end image

Credits

This is a group project of Big data programming-1(semester -1) for the master program in big data and business analytics. Developers are Sasha (@Rrschch-6), Nissy (@sasidn), Garima (@Garima27dec) ,Omer(@atesch93) and Fatemeh (@karampanah927)

About

Big Data Programming Project - hushHush Recruiter

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •