The objective is to build a large database of data interview questions that can be used to develop a model that can suggest key questions that prospective candidates may want to focus on based on the content of the job advertisement, resume, company and/or key focus areas. This database can help focus time and effort in the interview preparation process. Key stages to building the project:
- Initial research to identify key resources that can be used to develop the database
- Building of the database via explicit import of data files, web scraping and collation
- Creating a set of tagged questions that can serve as the ground truth in modeling activities
- Developing suggested answers to key questions.
- Exploratory data analysis that looks at the distribution of data questions relative to data type jobs
- Building a classification model using various NLP techniques
- Developing a Web API that people that are preparing for interviews can interact with to easily access the suggested list of questions and help manage their learning process.
- [DONE] https://www.springboard.com/blog/data-science-interview-questions/
- [DONE] https://www.edureka.co/blog/interview-questions/data-science-interview-questions/
- [DONE] https://github.com/kojino/120-Data-Science-Interview-Questions
- [ONLY FIRST 5 PAGES] https://www.glassdoor.com/Interview/data-scientist-interview-questions-SRCH_KO0,14.htm
- [DONE] https://www.dezyre.com/article/100-data-science-interview-questions-and-answers-general-for-2018/184
- [DONE] https://www.kdnuggets.com/2016/02/21-data-science-interview-questions-answers.html
- [DONE] http://nitin-panwar.github.io/Top-100-Data-science-interview-questions/?utm_campaign=News&utm_medium=Community&utm_source=DataCamp.com
- [DONE] https://www.analyticsvidhya.com/blog/2018/06/comprehensive-data-science-machine-learning-interview-guide/
- [DONE] https://medium.com/acing-ai/top-data-science-interview-questions-answers-part-2-20f8c458056d
- [DONE] https://tekslate.com/data-science-interview-questions/
- https://www.amazon.com/Questions-Crack-Data-Science-Interview-ebook/dp/B06XKVBFZ8
- https://www.sanfoundry.com/1000-data-science-questions-answers/
- [DONE] https://www.udacity.com/course/data-science-interview-prep--ud944
TODO
A classifier that categorizes Data Science questions into the following:
- Communication
- Data Analysis
- Predictive Modeling
- Probability
- Product Metrics
- Programming
- Statistical Inference
Note: The classifier could assign multiple tags to a given data science question.
TODO