Add option for `getCyclopsProfileLogLikelihood` #75

azimov · 2024-06-14T15:58:33Z

In long running jobs we are finding that Cyclops getCyclopsProfileLogLikelihood runs for many hours and often produces the warning:

WARN Cyclops getCyclopsProfileLogLikelihood Coefficient drift detected. Resetting Cyclops object and recomputing all likelihood values computed so far.

It seems that this will repeat an almost identical process of failing to find the minima from an initial starting position exactly 10 times. In the tasks we're executing we don't really have the luxury of tweaking other parameters to allow this to converge. Instead, it would be good to set this to a lower parameter with a configurable option (e.g. something Cyclops.LogLikelihood.MaxRetry).

The text was updated successfully, but these errors were encountered:

msuchard · 2024-06-18T20:46:48Z

@azimov -- how's this (fa3433d) ?

schuemie · 2024-06-21T06:06:58Z

Just to be clear: these aren't retries, it is recomputing likelihood at prior points, but not re-doing the positioning of those points. In other words: it will eventually get there, and the result is not guaranteed to be non-informative just because we detected coefficient drift.

In general, in OHDSI we tend to mostly value getting the 'right' answer, while the amount of compute it takes to get there is of secondary concern. @azimov seems to suggest that there is point where we don't want to spend more compute to get the right answer, and that this is the point. I don't share that opinion.

Don't get me wrong: I think computing profiles for SCCS takes way too much time, and would like a more efficient solution. But until we have that, I don't think the solution is: don't compute profiles when it takes more than some arbitrary threshold.

schuemie · 2024-07-01T06:29:20Z

@msuchard : Could you change the code you committed so the default value for maxRetries = Inf? That way, at least the code will be default behave the way it did. It also means I don't have to now change CohortMethod and SelfControlledCaseSeries to set maxRetries = Inf.

If we are going to add this argument, could we also give it a more accurate name, like maxCorrections?

msuchard · 2024-07-01T12:25:43Z

hmmm ... i do not believe i changed the default behavior, but my apologies if i did:

7cda607 (old line 919)

am going to avoid making any more changes until @schuemie and @azimov come to an agreement on best behavior.

schuemie · 2024-07-16T14:21:06Z

Ah, I had forgotten I already set a maxResets inside the function. I'll push a change where the new function argument is renamed to maxResets. I think @azimov has agreed to let me make the final call.

azimov · 2024-07-16T21:08:42Z

Yes @msuchard - I defer to Martijn on this. It seems to be a single edge case that caused my issue and we should probably work on improving precision to resolve that

msuchard closed this as completed in 57f0b61 Nov 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add option for `getCyclopsProfileLogLikelihood` #75

Add option for `getCyclopsProfileLogLikelihood` #75

azimov commented Jun 14, 2024

msuchard commented Jun 18, 2024

schuemie commented Jun 21, 2024

schuemie commented Jul 1, 2024

msuchard commented Jul 1, 2024

schuemie commented Jul 16, 2024

azimov commented Jul 16, 2024

Add option for getCyclopsProfileLogLikelihood #75

Add option for getCyclopsProfileLogLikelihood #75

Comments

azimov commented Jun 14, 2024

msuchard commented Jun 18, 2024

schuemie commented Jun 21, 2024

schuemie commented Jul 1, 2024

msuchard commented Jul 1, 2024

schuemie commented Jul 16, 2024

azimov commented Jul 16, 2024

Add option for `getCyclopsProfileLogLikelihood` #75

Add option for `getCyclopsProfileLogLikelihood` #75