-
Notifications
You must be signed in to change notification settings - Fork 236
Collecting .smt2 files from F*'s regression suite
This make
invocation shown below will cause F* to verify a bunch of files, emitting a separate .smt2 file for each verification condition it produces.
I usually first remove all .smt2 files from the FStar working tree and then start the regression run.
make -C src utest -k 12 OTHERFLAGS='--z3refresh --log_queries' # regenerate all of them
The --z3refresh
option causes F* to create a separate .smt2 file for each problem.
The --log_queries
causes it to actually write out the .smt2 files to disk.
The -k 12
says to use 12 cores; tune it to the number of cores you actually want to use.
The generated files are named like
./ulib/queries-FStar.Int32-55.smt2
./ulib/queries-FStar.Int32-56.smt2
./ulib/queries-FStar.Int32-57.smt2
./ulib/queries-FStar.Int32-58.smt2
./ulib/queries-FStar.Int32-59.smt2
./ulib/queries-FStar.Int32-6.smt2
./ulib/queries-FStar.Int32-60.smt2
./ulib/queries-FStar.Int32-61.smt2
./ulib/queries-FStar.Int32-62.smt2
./ulib/queries-FStar.Int32-63.smt2
./ulib/queries-FStar.Int32-64.smt2
./ulib/queries-FStar.Int32-65.smt2
./ulib/queries-FStar.Int32-66.smt2
./ulib/queries-FStar.Int32-67.smt2
...
i.e., they have the form queries-<moduleName>-<queryNumber>.smt2
The --hint_info
option when additionally passed in OTHERFLAGS
above, causes F* to write on stdout some information about each query, hint replay etc.
For example, you will see output of the form:
(.\FStar.Integers.fst(275,50-275,53)@queries-FStar.Integers-98.smt2)
Query (FStar.Integers.f_uint_8, 5) succeeded (with hint) in 74 milliseconds with fuel 1 and ifuel 1
This mentions the F* source file and position that produced the query; the .smt2 file that corresponds to it; the query identifier (FStar.Integers.f_uint_8, 5
, in this case), and info about whether it succeeded while using a hint or not, the time it took and the fuel parameters that were used.
Here is a snippet of queries that failed hint replay.
$ grep -B 1 "failed (with hint)" dump | head -10
(.\FStar.BitVector.fst(74,0-74,34)@queries-FStar.BitVector-42.smt2)
Query (FStar.BitVector.lemma_xor_bounded, 5) failed (with hint) in 44 milliseconds with fuel 2 and ifuel 1
--
(.\FStar.SquashProperties.fst(48,0-53,94)@queries-FStar.SquashProperties-4.smt2)
Query (FStar.SquashProperties.excluded_middle_squash, 1) failed (with hint) in 37 milliseconds with fuel 2 and ifuel 1
--
(.\FStar.HyperHeap.fst(211,0-211,25)@queries-FStar.HyperHeap-84.smt2)
Query (FStar.HyperHeap.lemma_modset, 4) failed (with hint) in 54 milliseconds with fuel 2 and ifuel 1
--
(.\FStar.HyperHeap.fst(217,0-217,44)@queries-FStar.HyperHeap-88.smt2)
Each query that fails with a hint is immediately reattempted by F* without a hint.
So, if you see, e.g.,
(.\FStar.HyperHeap.fst(211,0-211,25)@queries-FStar.HyperHeap-84.smt2)
Query (FStar.HyperHeap.lemma_modset, 4) failed (with hint) in 54 milliseconds with fuel 2 and ifuel 1
Then queries-FStar.HyperHeap-85.smt2
contains the same query re-attempted without a hint. For example:
$ grep -A 3 'FStar.HyperHeap-84.smt2' dump
(.\FStar.HyperHeap.fst(211,0-211,25)@queries-FStar.HyperHeap-84.smt2)
Query (FStar.HyperHeap.lemma_modset, 4) failed (with hint) in 54 milliseconds with fuel 2 and ifuel 1
(.\FStar.HyperHeap.fst(211,0-211,25)@queries-FStar.HyperHeap-85.smt2)
Query (FStar.HyperHeap.lemma_modset, 4) succeeded in 547 milliseconds with fuel 2 and ifuel 1
It would be nice to have a more systematic way of correlating hints and no-hints versions of the same queries. Although, it should be possible to write some simple scripts to do this correlation based on the above information.