simulation tags are not converted to int/float as appropriate #1658

ckirkman-IDM · 2021-11-05T21:23:26Z

dtk-tools used to do this, and is critical for all calibration analyzers, otherwise their sample index values are string-sorted (0, 1, 10, 11, 12, 2, 3, 4, 5, 6, 7, 8, 9) instead of int sorted (0, 1, 2, 3, 4, 5, ...). For calibration, this could lead to inappropriate matching between analyzer scores and the parameter sample it is supposed to be linked to.

I have diagnosed this as a non-feature parity issue between idmtools <-> dtk-tools that does not depend on pyComps or pandas.

We will need to decide if numerical conversion of tag values is up to the user (in analyzers, etc) or is reliably done by idmtools, or have calibra convert just-this-tag, as every ported dtk-tools -> idmtools calibration analyzer will be exposed to this error.

Discussion and details from teams:

ckirkman-IDM · 2021-11-05T21:24:42Z

@shchen-idmod @menriquez-IDM @devclinton @ZDu-IDM

ckirkman-IDM · 2021-11-05T21:29:45Z

Is there a way to make the issuelabeler bot less ... AGGRESSIVE in adding labels over and over again?

shchen-idmod · 2021-11-05T21:47:14Z

To give the more history context, we used to fix Prashanth and Jillian's calibration with 2 lines of code at right before return result in analyzer's reduce function to first convert result index(which is sample_index tag) to int then sort by index. But this solution basically needs each user to remember this 2 lines of code. other wise, calibration result will be screwed up.
If we can change in idmtools or some where not ask user to do it, it would be idea solution.

devclinton · 2023-08-25T18:56:12Z

I think maybe we could add a function the the IEntity(has the tags object on it) to make it easier for users to easily convert the columns in tags to specific datatypes. My first instinct is to imitate libraries that users are familiar with, so looking at Dataframe constructor as inspiration(see https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.html), we could add something like

def convert_tag_types(dtypes):
       .......
       
# and example
a = Simulation()
a.convert_tag_types([("seed", int), ("s", float)])
# or
a.convert_tag_types({"seed": int, "s": float})

We could then enable automatic conversion with a new option in the iplatform later possible on fetch as a post load option?

issuelabeler bot added Analyzers COMPS Core labels Nov 5, 2021

ckirkman-IDM added bug Something isn't working and removed COMPS labels Nov 5, 2021

issuelabeler bot added COMPS Discuss Discuss with group labels Nov 5, 2021

ckirkman-IDM removed the COMPS label Nov 5, 2021

devclinton assigned emilydriano Jul 27, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

simulation tags are not converted to int/float as appropriate #1658

simulation tags are not converted to int/float as appropriate #1658

ckirkman-IDM commented Nov 5, 2021 •

edited

Loading

ckirkman-IDM commented Nov 5, 2021

ckirkman-IDM commented Nov 5, 2021

shchen-idmod commented Nov 5, 2021

devclinton commented Aug 25, 2023

simulation tags are not converted to int/float as appropriate #1658

simulation tags are not converted to int/float as appropriate #1658

Comments

ckirkman-IDM commented Nov 5, 2021 • edited Loading

ckirkman-IDM commented Nov 5, 2021

ckirkman-IDM commented Nov 5, 2021

shchen-idmod commented Nov 5, 2021

devclinton commented Aug 25, 2023

ckirkman-IDM commented Nov 5, 2021 •

edited

Loading