Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[ENH] Neat and automated transfer learning with OPTIMADE API for auto-adjusted problem-specific ML model generation on the fly #16

Merged
merged 50 commits into from
Apr 4, 2024

Conversation

amkrajewski
Copy link
Contributor

As the title says, this new addition to the core pySIPFENN functionalities connects it to OPTIMADE API to enable rapid adjustment of the models to any specific dataset described by an OPTIMADE query (or multiple queries). Most of the functions are neatly hidden behind high-level API and default values should work well for datasets between 100-10,000 datapoints.

You can now simply:

from pysipfenn import Calculator, OPTIMADEAdjuster
c = Calculator(autoLoad=False)
c.loadModels("SIPFENN_Krajewski2022_NN30")
ma = OPTIMADEAdjuster(c, "SIPFENN_Krajewski2022_NN30",  useClearML=True, device='mps') # MPS is for Apple M1 GPU

ma.fetchAndFeturize(
    'elements HAS "Hf" AND elements HAS "Mo" AND NOT elements HAS ANY "O","C","F","Cl","S"',
    parallelWorkers=4)
ma.adjust()

ma.plotStarting() # See the starting performance
ma.plotAdjusted() # See the adjusted performance

or to perform a hyperparameter search, replace the ma.adjust() with:

ma.matrixHyperParameterSearch()
ma.adjust(learningRate=0.0001, optimizer='AdamW', weightDecay=1e-05, epochs=37)

All model usage works as before with the Calculator class. Modifying or exporting it for later is through specific classes in the modelExporters submodule.

…; meant mostly for tuning to smaller datasets
…; meant mostly for tuning to smaller datasets
… a parameter with documentation in `OPTIMADEAdjuster`
… now collecting names, target data, and featurizing the obtained structures
…adjustment, to display if a datapoint has been shown to the model at adjustment. Displayed when plotting unadjusted and adjusted models.
…sed right now but important for future methods including crystALL
@amkrajewski
Copy link
Contributor Author

Notes:

  1. It is feature-complete.
  2. I'm still working on the testing suite.
  3. @rdamaral will add a neat tutorial at a future date.

Copy link

codecov bot commented Mar 30, 2024

Codecov Report

Attention: Patch coverage is 89.20188% with 46 lines in your changes are missing coverage. Please review.

Project coverage is 93.58%. Comparing base (74a31dd) to head (4c12c21).
Report is 4 commits behind head on main.

Files Patch % Lines
pysipfenn/core/modelAdjusters.py 87.15% 46 Missing ⚠️
Additional details and impacted files
@@            Coverage Diff             @@
##             main      #16      +/-   ##
==========================================
- Coverage   94.84%   93.58%   -1.27%     
==========================================
  Files          17       19       +2     
  Lines        1999     2432     +433     
==========================================
+ Hits         1896     2276     +380     
- Misses        103      156      +53     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@amkrajewski
Copy link
Contributor Author

Hi @jwsiegel2510 and @rdamaral Everything is complete and the tests are passing. It's ready to be reviewed!

@amkrajewski
Copy link
Contributor Author

Hi @jwsiegel2510 and @rdamaral, I was hoping to pull it later today to align with the manuscript posting on arXiv.

@ricardonpa
Copy link
Contributor

ricardonpa commented Apr 3, 2024

Hi Adam,

I've reviewed the documentation, tested the main functions, and they are working well. I also did not encounter any issues when installing this branch version in a new conda environment (Python 3.10).

Just a couple of comments:

  • Running the hyperparameter search step takes a long time on a cpu, so I couldn't finish testing. As discussed, please consider changing the default epochs to a lower value.
  • When testing OPTIMADE providers:

OQMD returned TypeError: can only concatenate str (not "int") to str
JARVIS returned ValidationError: 1 validation error for StructureResource.
Aflow returned Error: Provider ...: ('Connection aborted.', ... ))
PS: Both OQMD and Jarvis were run using the targetPath values mentioned in the documentation. Other than these, MP and Alexandria were also tested and did not raise any error.

  • I also tried providing the wrong targetPath for a provider and it raises: 'ValueError: not enough values to unpack (expected 4, got 0)'. Do you think targetPath could be fetched automatically from each provider's endpoint when defyining OPTIMADEAdjuster? I'm considering this because even the current default values may eventually break, for instance, if MP decides to change their endpoint (again). Another alternative would be to have targetPath set to () rather than MP's formation energy path.

…mproved description of the parameter to make it clear higher number is desired if GPU is present
…y the base URL to use and point to custom provider or a specific sub-database of a provider
@amkrajewski
Copy link
Contributor Author

amkrajewski commented Apr 3, 2024

Hi @rdamaral ! Thanks for the insightful comments :)

  1. The default number of epochs for fine-tuning was reduced to 20, with documentation discussing this and mentioning that on a GPU (even a laptop one) 100 may be preferred.
  2. The OQMD server is down, and JARVIS seems to have issues filtering. I will ask about that at the developer meeting tomorrow.
  3. I've added a bunch of assertions that should catch unexpected user inputs and display useful messages on what went wrong. The property data paths are provider-specific and cannot be inferred a prior.

@amkrajewski
Copy link
Contributor Author

I also added a new functionality that allows you to override provider and use a custom endpoint. E.g.

ma = pysipfenn.OPTIMADEAdjuster(
    c,
    model="SIPFENN_Krajewski2022_NN30",
    endpointOverride=["https://alexandria.icams.rub.de/pbesol"],
    targetPath=['attributes', '_alexandria_formation_energy_per_atom']
)

ma.fetchAndFeturize(
    'elements HAS "Hf" AND elements HAS "Mo" AND elements HAS "Zr"',
    parallelWorkers=2
)

@amkrajewski amkrajewski merged commit 0d60ef6 into main Apr 4, 2024
13 of 15 checks passed
@ricardonpa
Copy link
Contributor

Nice. The endpointOverride input is very interesting from the user’s perspective, especially in the event of changes or new additions to OPTIMADE. 👍

Copy link
Collaborator

@jwsiegel2510 jwsiegel2510 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good, it seems reasonable to me to assume that each task will take the same amount of time since we are simply training the same network with different hyper parameters.

Copy link
Collaborator

@jwsiegel2510 jwsiegel2510 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me

Copy link
Collaborator

@jwsiegel2510 jwsiegel2510 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me, I guess the previous two commits were meant to be one.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants