Hash fuzzed input to detect duplicates? #11

mgold · 2018-08-04T21:36:13Z

When generating random values from a fuzzer, there is no guarantee that each one will be unique. You may ask for 100 cases but get less. It may be possible you get much less.

One solution is to hash each input, store the hashes, and reject inputs with a duplicate hash. We'd need to fail the test after some number of failed attempts to create distinct inputs, perhaps max 20 (2*numberOfRequestedRuns).

Since we'd want a designated union type tag for this failure condition, it makes sense to do this while we're doing a major revision.

Is there any interest in exploring this idea?

The text was updated successfully, but these errors were encountered:

drathier · 2018-08-05T16:25:57Z

There definitely is, but I'd like to pair this with knowing roughly how many possible values a fuzzer can produce. No reason to keep generating booleans to try to find a third value. Also, we probably want to generate more values if we're fuzzing a huge thing, like a Dict (Int, Int) (List String).

Janiczek · 2022-07-27T13:27:14Z

No reason to keep generating booleans to try to find a third value.

The status quo is that you still keep generating booleans, and run the toExpectation function to boot.

I think we could do this optimization separately (retry generating if we've already tested an input for a test -- skipping some toExpectation calls), and then there is the separate issue of generating all values exhaustively if the fuzzer allows it. I'll create an issue for that one as I and @gampleman have some thoughts around it already :)

Edit: #188

mgold added the Design Question Needs design discussion label Aug 4, 2018

mgold added the fuzzers Concerns randomness or simplifiers label Aug 24, 2018

Janiczek linked a pull request Oct 14, 2022 that will close this issue

Skip values (moreso, RandomRuns) we have already tested #207

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hash fuzzed input to detect duplicates? #11

Hash fuzzed input to detect duplicates? #11

mgold commented Aug 4, 2018

drathier commented Aug 5, 2018

Janiczek commented Jul 27, 2022 •

edited

Loading

Hash fuzzed input to detect duplicates? #11

Hash fuzzed input to detect duplicates? #11

Comments

mgold commented Aug 4, 2018

drathier commented Aug 5, 2018

Janiczek commented Jul 27, 2022 • edited Loading

Janiczek commented Jul 27, 2022 •

edited

Loading