124 add strain information linked with the species csv #126
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This is a draft PR looking at how to add strain information by adding a
StrainModel
andStrain
class in thespecies.py
file. I also had to add a new filter function to thegenerator.py
file to allow for filtering a dataframe to get back only the unique rows -- this is needed to make the sets of Species unique, even though multiple strains might share a species.The intention is to use this new feature to separately validate
subject.json
models to ensure that the species matches the species listed in the strain, when the strain is set.We can't merge this PR until registry information is added for strains or we remove the registry fields from
StrainModel
.