This is a set of notebooks intended to give a quick introductions into some methods for building and examining models that could be useful for materials design.
The first notebook classical-ml
introduces a number of methods for fitting some features
to data on the band gap of materials. The final model that we come to is based strongly on
Data-Driven Discovery of Photoactive Quaternary Oxides Using First-Principles Machine Learning
The second notebook shapley_values_gbtree
introduces the application of TreeExplainer
to examine how the features of the model contribute to the outcomes. And to help with understanding the predictions that are made.
data
- contains all the data needed to train the modelsmodels
- contains a pre-trained decision tree, if you want to skip straight to tutorial 2notebooks
- has the two notebooksenvironment.yml
- contains the conda environment that these notebooks were developed in