Proof of concept: allow Debug.log in fuzz tests #156

drathier · 2020-10-18T14:24:15Z

Proof of concept! Allow Debug.log to be used effectively in fuzz tests.

Idea: pass an extra argument to all fuzzer test functions { log : String -> a -> a }. This function is a no-op until we've found the simplest failing test case. The we run that simplest test case with a record that contains the real Debug.log function. Since it inherently works with side-effects, we can't easily capture and re-print stdout.

Fuzz tests:

all : Test
all =
    describe "Meta Debug.log"
        [ fuzz fuzzer "debug log for 1-arg fuzzer" <|
            \meta a ->
                meta.log (Debug.toString { desc = "fuzz1-meta-log", a = a }) <|
                    (a |> Expect.notEqual 5)
        , fuzz2 fuzzer fuzzer "debug log for 2-arg fuzzer" <|
            \meta a b ->
                meta.log (Debug.toString { desc = "fuzz2-meta-log", a = a, b = b }) <|
                    (( a, b ) |> Expect.notEqual ( 2, 5 ))
        , fuzz3 fuzzer fuzzer fuzzer "debug log for 3-arg fuzzer" <|
            \meta a b c ->
                meta.log (Debug.toString { desc = "fuzz3-meta-log", a = a, b = b, c = c }) <|
                    (( a, b, c ) |> Expect.notEqual ( 2, 3, 5 ))
        ]

fuzzer = intRange 2 5

Output:

{ a = 5, desc = "fuzz1-meta-log" }: Fail { description = "Expect.notEqual", given = Nothing, reason = Equality "5" "5" }
{ a = 2, b = 5, desc = "fuzz2-meta-log" }: Fail { description = "Expect.notEqual", given = Nothing, reason = Equality "(2,5)" "(2,5)" }
{ a = 2, b = 3, c = 5, desc = "fuzz3-meta-log" }: Fail { description = "Expect.notEqual", given = Nothing, reason = Equality "(2,3,5)" "(2,3,5)" }


↓ Meta Debug.log
✗ debug log for 1-arg fuzzer
Given 5

     
    5
    ╵
    │ |> Expect.notEqual
    ╷
    5
     


↓ Meta Debug.log
✗ debug log for 2-arg fuzzer
Given (2,5)

         
    (2,5)
    ╵
    │ |> Expect.notEqual
    ╷
    (2,5)
         


↓ Meta Debug.log
✗ debug log for 3-arg fuzzer
Given (2,3,5)

           
    (2,3,5)
    ╵
    │ |> Expect.notEqual
    ╷
    (2,3,5)
           


TEST RUN FAILED

Passed: 258
Failed: 3

…prints when testing the final shrunken value is actually printed to stdout

harrysarson · 2020-10-18T15:37:42Z

src/Test/Fuzz.elm

@@ -128,6 +144,10 @@ findSimplestFailure (Rose failingValue branches) getExpectation oldExpectation =

        -- base case: we cannot simplify any more
        Nothing ->
+            let
+                _ =
+                    getExpectation { log = Debug.log } failingValue


Would we want to let the test runner hook into this in someway, so that it can provide a custom logging function? I guess this would require the function to return a Cmd (for port logging) so may not be possible.

harrysarson · 2020-10-18T15:38:33Z

This is really cool!

avh4 · 2020-10-18T17:11:10Z

imo this puts an extremely high burden on the user... it seems like if they wanted to log something in their code, they would have to thread through the meta.log function all the way down to wherever they want to actually log something, or am I missing something about what this does or where it's headed? I wonder if there's a way to change the test runner to be able to do this without having to change the public API and put the burden on the user to learn a new way of logging for fuzz tests.

harrysarson · 2020-10-19T15:25:07Z

I guess the alternative is a Test.Log function. The kernel code backing this function would then need to read some global state to know what to do (log or not log)

harrysarson · 2020-10-19T15:26:04Z

The javascript test runner ava uses a logging function passed to each test when it is run (like we do in this PR) so there is precedent for @drathier's approach.

avh4 · 2020-10-19T17:06:22Z

Is there a way to make it work without the user having to do anything special? elm-test controls the execution environment of each test, so I'd expect there'd be a way for it to set up Debug.log to do what it wants for each test run?

Wasn't there also some discussion about being able to capture the console logs for each test so that any output can be put next to the correct error message? If that were done, then wouldn't this basically be solved as well, as the console output for any individual run could be discarded at the runner's (in this case the fuzz test module's) discretion?

drathier · 2020-10-20T07:27:17Z

Yeah, this is the pure fp approach, but we do have kernel code available so we could have a collection mechanism for this instead, if the test runner does the actual collection, or replaces the js implementation of Debug.log. I prefer a non-pure approach, because it just works.

drathier · 2020-10-23T09:53:20Z

How about this then? We try to gather up log prints using help from kernel code and the test runner.

The test runner overwrites the Debug.log implementation in the compiled js code, instead storing the logs in an array for later.
We expose one kernel function that destructively clears that log array and returns the log entires that were in it.
This package can then gather up logs per failed test case, and present it nicely to the user.

harrysarson · 2020-10-23T12:38:38Z

Could we have an internal API like

capture: (a -> b) -> { output: b, logs: List (String, String) }

function _Test_capture(func) {
    assert(__Debug_divertLogsTo === null);
    _Debug_divertLogsTo = [];
    var output = func();
    var logs = __Debug_divertLogsTo ;
    __Debug_divertLogsTo  = null;
    return {
        __$output: output,
        __$logs: logs
    };
}

Although this API would require a patch to elm/core:Elm.Kernel.Debug to work.

harrysarson · 2020-10-23T12:39:15Z

Could we have an internal API like

capture: (a -> b) -> { output: b, logs: List (String, String) }

function _Test_capture(func) {
    assert(__Debug_divertLogsTo === null);
    _Debug_divertLogsTo = [];
    var output = func();
    var logs = __Debug_divertLogsTo ;
    __Debug_divertLogsTo  = null;
    return {
        __$output: output,
        __$logs: logs
    };
}

Although this API would require a patch to elm/core:Elm.Kernel.Debug to work.

drathier · 2020-10-23T13:41:26Z

That sounds good, yes. We should probably disable Debug.log outside of this function, just passing through values without printing or logging them, for performance reasons.

harrysarson · 2020-10-23T13:48:50Z

We should probably disable Debug.log outside of this function, just passing through values without printing or logging them,

If someone is logging outside of a test (I guess the only way to do this is to have log statements in the code that runs on load) then I think we should not do anything special with those logs.

drathier · 2020-10-23T13:56:32Z

When generating the log output for fuzz tests we'd do this:

run a fuzz test throwing away logs
on error, simplify the error, throwing away logs
run the simplest input through the fuzz test function one last time, this time collecting the Debug.log output into a list
fetch that list of logs and put it into the test failure data structure, for later use in error reporting

Steps 1 and 2 would generate a ton of unneeded log output and performance would be really bad. I'm expecting people to add log statements to their tests and just leave them in, expecting them to have zero performance impact on successful tests. That's at least how I would want to treat them.

harrysarson · 2020-10-23T15:23:34Z

Would what you are suggesting work as the following two APIs?

suppressLogs: (a -> b) -> b
captureLogs: (a -> b) -> { output: b, logs: List (String, String) }

drathier · 2020-10-23T16:55:56Z

It could, yes

gampleman · 2020-11-09T15:13:25Z

To solve the problem of people leaving logging in tests forever, you could mark tests containing logging as pending rather than passed.

harrysarson · 2020-11-09T17:13:23Z

Is it a problem if people leave logging in tests forever?

drathier · 2020-11-09T17:13:33Z

Why not let them leave the logging in?

gampleman · 2020-11-09T20:55:00Z

What's the point of logging that exists just to get discarded?

harrysarson · 2020-11-09T21:58:54Z

What is the point of a test that doesn't fail?

I guess the idea is that you can put logging in your tests and then debugging a test failure becomes much easier because the failure message comes with log messages.

mpizenberg · 2021-02-08T22:10:12Z

Just to chip in on the discussion. In elm-test-rs capturing calls to Debug.log is possible by capturing all stdout of a worker into a variable that gets flushed to the supervisor via worker thread parentPort.

https://github.com/mpizenberg/elm-test-rs/blob/b8192ac947837792c9da49b3eb3d841ddbabd297/templates/node_runner.js#L7-L9

It works well for unit tests, but isn't much useful with the huge amount of logging in fuzz tests. Having an alteration of the log function that would deactivate it except for the shrinked pass would be awesome.

Proof of concept: allow Debug.log in fuzz tests such that only debug …

fca2f5b

…prints when testing the final shrunken value is actually printed to stdout

drathier mentioned this pull request Oct 18, 2020

capture Debug.log and present it with a failing test #154

Open

harrysarson reviewed Oct 18, 2020

View reviewed changes

harrysarson added fuzzers Concerns randomness or simplifiers major release blocker Issue to be resolved before next major version bump labels Oct 18, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proof of concept: allow Debug.log in fuzz tests #156

Proof of concept: allow Debug.log in fuzz tests #156

drathier commented Oct 18, 2020

harrysarson Oct 18, 2020 •

edited

Loading

harrysarson commented Oct 18, 2020

avh4 commented Oct 18, 2020 •

edited

Loading

harrysarson commented Oct 19, 2020

harrysarson commented Oct 19, 2020 •

edited

Loading

avh4 commented Oct 19, 2020

drathier commented Oct 20, 2020

drathier commented Oct 23, 2020

harrysarson commented Oct 23, 2020

harrysarson commented Oct 23, 2020

drathier commented Oct 23, 2020

harrysarson commented Oct 23, 2020

drathier commented Oct 23, 2020 •

edited

Loading

harrysarson commented Oct 23, 2020 •

edited

Loading

drathier commented Oct 23, 2020

gampleman commented Nov 9, 2020

harrysarson commented Nov 9, 2020

drathier commented Nov 9, 2020

gampleman commented Nov 9, 2020

harrysarson commented Nov 9, 2020

mpizenberg commented Feb 8, 2021

Proof of concept: allow Debug.log in fuzz tests #156

Are you sure you want to change the base?

Proof of concept: allow Debug.log in fuzz tests #156

Conversation

drathier commented Oct 18, 2020

harrysarson Oct 18, 2020 • edited Loading

Choose a reason for hiding this comment

harrysarson commented Oct 18, 2020

avh4 commented Oct 18, 2020 • edited Loading

harrysarson commented Oct 19, 2020

harrysarson commented Oct 19, 2020 • edited Loading

avh4 commented Oct 19, 2020

drathier commented Oct 20, 2020

drathier commented Oct 23, 2020

harrysarson commented Oct 23, 2020

harrysarson commented Oct 23, 2020

drathier commented Oct 23, 2020

harrysarson commented Oct 23, 2020

drathier commented Oct 23, 2020 • edited Loading

harrysarson commented Oct 23, 2020 • edited Loading

drathier commented Oct 23, 2020

gampleman commented Nov 9, 2020

harrysarson commented Nov 9, 2020

drathier commented Nov 9, 2020

gampleman commented Nov 9, 2020

harrysarson commented Nov 9, 2020

mpizenberg commented Feb 8, 2021

harrysarson Oct 18, 2020 •

edited

Loading

avh4 commented Oct 18, 2020 •

edited

Loading

harrysarson commented Oct 19, 2020 •

edited

Loading

drathier commented Oct 23, 2020 •

edited

Loading

harrysarson commented Oct 23, 2020 •

edited

Loading