Leakage traces #431

OwenConoly · 2024-08-25T06:40:56Z

This PR will add leakage traces to the source and intermediate semantics, augment the compiler-correctness statement and proof to talk about leakage traces, and add some examples involving leakage traces.

OwenConoly · 2024-08-25T06:46:45Z

For now, this branch only has some edits to Semantics.v. Before I add anything else, I wanted to ask whether I should be making these edits (i.e., adding leakage traces) in Semantics.v, MetricSemantics.v, or both? Also, why are Semantics.v and MetricSemantics.v separate, and are there plans to merge them eventually?

bedrock2/src/bedrock2/Semantics.v

andres-erbsen

Semantics.v, MetricSemantics.v, or both?

IMO the best option is to create additional semantics files. We want to have only one compiler proof, so that would use LeakageMetricSemantics. Some program proofs may be done with that one as well, but the examples for our current manuscript probably shouldn't be, so there we'd probably want want LeakageSemantics as well. In an unverified software project, this kind of duplication would be asking for inconsistency bugs, but I hope it won't be too hard to state and prove the appropriate relationships between these variants.

bedrock2/src/bedrock2/Semantics.v

OwenConoly · 2024-08-26T15:46:55Z

I hope it won't be too hard to state and prove the appropriate relationships between these variants.

Proving the appropriate relationship between Semantics.v and LeakageSemantics.v will only be trivial if we modify Semantics.v so that exec takes the pick_sp parameter. If LeakageSemantics exec takes pick_sp as a parameter but Semantics exec stays as is, then proving any sort of equivalence between them is equally as nontrivial as that semantics-equivalence statement we put in the paper, the one that I wrote a ~3000-line proof of.

So, should I modify Semantics and MetricSemantics execs to take pick_sp as a parameter, and then prove the appropriate relationships between Semantics vs. MetricLeakageSemantics, MetricSemantics vs. MetricLeakageSemantics, and LeakageSemantics vs MetricLeakageSemantics?

OwenConoly · 2024-08-26T15:55:21Z

However, the Semantics-exec-implies-LeakageSemantics-exec direction is trivial even if LeakageSemantics takes pick_sp as a parameter and Semantics doesn't. So if that direction of the equivalence is all we care about, then potentially I could just leave Semantics and MetricSemantics alone.

andres-erbsen · 2024-08-26T16:20:53Z

Ah, indeed. I was only thinking of the easy direction, Semantics implies LeakageSemantics. I don't think we should add pick_sp to current semantics. @samuelgruetter do you think it would be acceptable to carry both copies in the repo, but only the half of the equivalence proof that allows program proofs without leakage to be used with a compiler proof that preserves everything they say about leakage?

However, the other direction of this relationship might be a good opportunity to drill down on the difficulty in the equivalence proof and perhaps simplify it. We don't have to do this, but I'm thinking that if you could state the desired equivalence with a minimized version of semantics that is just complicated enough to illustrate the challenge (without abstract traces, predictors, or even function calls and etc), then I would give it a try. "How hard can pushing a quantifier into an inductive be" 🚀😂🙄

samuelgruetter · 2024-08-26T16:30:21Z

Ah, indeed. I was only thinking of the easy direction, Semantics implies LeakageSemantics. I don't think we should add pick_sp to current semantics. @samuelgruetter do you think it would be acceptable to carry both copies in the repo, but only the half of the equivalence proof that allows program proofs without leakage to be used with a compiler proof that preserves everything they say about leakage?

Sounds good!

However, the other direction of this relationship might be a good opportunity to drill down on the difficulty in the equivalence proof and perhaps simplify it. We don't have to do this, but I'm thinking that if you could state the desired equivalence with a minimized version of semantics that is just complicated enough to illustrate the challenge (without abstract traces, predictors, or even function calls and etc), then I would give it a try "How hard can pushing a quantifier into an inductive be" 🚀😂🙄

This kind of proof would be interesting as a minimal standalone file that might accompany a paper as "supplemental material", where the focus is really to make it as simple as possible, but without simplifying away the core difficulty. (And if at some point, we actually need this direction of the equivalence for real, we can again scale it up to the whole language).

OwenConoly · 2024-08-26T17:22:51Z

"How hard can pushing a quantifier into an inductive be"

This could be nice, yeah! To make it as academic as possible (and remove the coqutil/bedrock2 dependency and get a standalone file) I'd probably make the following simplifications to the equivalence proof I currently have:

no memory, only locals
no leakage traces or metrics either
locals are nats instead of words (in addition to removing a dependency, this makes me happy because proofs seem less trivial when they're written about Turing-complete languages)
no ifs, since you can write them with whiles anyway
instead of stack allocation, the quantifier is just an RNG construct
no function calls

I'd probably keep IO calls though, since the nondeterminism there seems to add an interesting dimension.

OwenConoly · 2024-08-26T17:56:15Z

Oh, bedrock2 is Turing-complete! I forgot that you can store and retrieve as much information as you want in the IO trace (with an appropriate choice of ext_spec). That's kinda cool.

I had thought that it was just a finite-state machine, so in principle, you could prove some absolute upper bound (depending only on word size and source program) on the size of an exec proof tree (and then this sort of equivalence theorem becomes trivial).

andres-erbsen · 2024-08-26T18:06:51Z

👍 You can store a turing-machine tape in a nat, so I think all options here are turing-complete. And I'd even remove I/O at first, we can add it back later if we want.

OwenConoly · 2024-08-27T15:11:23Z

The existence of (Metric)LeakageSemantics.v implies the existence of:

(Metric)LeakageLoops.v, (Metric)LeakageProgramLogic.v, (Metric)LeakageWeakestPreconditionProperties.v, and (Metric)LeakageWeakestPrecondition.v,

right?

andres-erbsen · 2024-08-27T17:47:14Z

Yes. At some point I may try to share more code, but for new these are just copied I think.

samuelgruetter · 2024-08-27T17:57:50Z

The existence of (Metric)LeakageSemantics.v implies the existence of:

(Metric)LeakageLoops.v, (Metric)LeakageProgramLogic.v, (Metric)LeakageWeakestPreconditionProperties.v, and (Metric)LeakageWeakestPrecondition.v,

right?

Will we ever need a program logic that does both metrics and leakage at the same time? If not, the needed files should probably just be

MetricLeakageSemantics.v (because in the compiler, we want both at the same time)
(non-metric) LeakageLoops.v, LeakageProgramLogic.v, LeakageWeakestPreconditionProperties.v, and LeakageWeakestPrecondition.v

andres-erbsen · 2024-08-27T18:08:15Z

I don't have specific code planned where this would be needed, but I imagine basic library functions like memset could end up being used by callers that need either kind of spec. I'm fine with procrastinating on building support for this, though.

OwenConoly · 2024-08-27T18:49:54Z

Ok. I can just do the MetricLeakage files while I'm at it, if they'll be needed eventually anyway. (Probably I'll end up making some errors in MetricLeakageProgramLogic, since there won't be any test cases for it.)

Unrelatedly, since I am adding the separate files for Leakage stuff instead of changing what was already there, my changes to the source language shouldn't break the compiler proof. So it seems natural to split this PR into two: in this PR I can just change the source language and add some examples, and then I can make a separate PR for compiler proof things. Does that sound good?

andres-erbsen · 2024-08-27T21:13:33Z

Separate PRs sounds good to me. (There is some possibility that we will want to review them together, though.)

OwenConoly · 2024-09-30T23:39:18Z

compiler/src/compiler/CustomFix.v

+Check Fix_eq.
+Definition type_of_Fix_eq :=
+  forall (A : Type) (R : A -> A -> Prop) (Rwf : well_founded R) 
+    (P : A -> Type) (F : forall x : A, (forall y : A, R y x -> P y) -> P x),
+    (forall (x : A) (f g : forall y : A, R y x -> P y),
+        (forall (y : A) (p : R y x), f y p = g y p) -> F x f = F x g) ->
+    forall x : A, Fix Rwf P F x = F x (fun (y : A) (_ : R y x) => Fix Rwf P F y).
+
+Check Fix_eq'.
+Definition type_of_Fix_eq' :=
+  forall (A : Type) (R : A -> A -> Prop) (Rwf : well_founded R) 
+    (P : A -> Type) (F : forall x : A, (forall y : A, R y x -> P y) -> P x),
+    (forall (x : A) (f g : forall y : A, R y x -> P y),
+        (forall (y : A) (p1 p2 : R y x), f y p1 = g y p2) -> F x f = F x g) ->
+    forall x : A, Fix Rwf P F x = F x (fun (y : A) (_ : R y x) => Fix Rwf P F y).
+
+Goal type_of_Fix_eq' -> type_of_Fix_eq.
+  cbv [type_of_Fix_eq type_of_Fix_eq']. auto. Qed.


Fix_eq was apparently useless for proving the fixpoint equation for some functions that I defined for the compiler proof, so I proved and used this Fix_eq' instead. I'm unsure whether Fix_eq is genuinely not applicable here; it's probable that I'm just not using Fix/Fix_eq as intended. If Fix_eq is really not helpful here, then I wonder why they don't have Fix_eq' in the standard library.

I think that, a few months ago, I had a good grasp on why Fix_eq is not helpful for the function I defined and why Fix_eq' is needed instead. But my current level of understanding is just "if I apply Fix_eq instead of Fix_eq', then the proof gets stuck".

@andres-erbsen, you've directly used Fix for general recursion, right? Has this issue come up before?

I have run into this before, and I think I did get it working with stdlib Fix_eq every time. Unfortunately I do not recall how, but perhaps the trick was about clearing the recursive call before the abstract proof that R decreases. However, I remember spending at least a few minutes on it and feeling annoyed that I had to, so feel free to keep the more flexible lemma and perhaps even submit it to stdlib.

OwenConoly · 2024-10-02T22:57:29Z

Unfortunately, my attempt at writing my compiler-proof leakage-transformation functions without CPS did not work out nicely, for two reasons:

It's convenient to be able to do some error-handling-ish logic, where hitting a certain thing in the recursion causes the function to exit immediately. This is kind of annoying to do without CPS: the solution I found was to carry around a flag saying 'have we hit an error yet'. This isn't a big problem though, and on its own probably wouldn't make using CPS worth it.
The bigger issue is that, because we have the deterministic stack allocation addresses, our inductive hypothesis for the compiler phase proofs needs to say that the high-level pick_sp function is of a particular form. With the function in CPS, there's a natural way to say what this form is, and it's easy to prove given the recursive structure of the continuation-passing function. With the function not in CPS, it is not so convenient to say what it should be. Basically, the flexibility lost is that in CPS, we can just state that the high-level pick_sp function should look like the leakage transformation function, but it doesn't matter what the continuation is.

Amusingly, this second issue is a non-issue if we work with nondeterministic stackalloc addresses. This made me assume it wouldn't be an issue with the deterministic stackalloc addresses, but apparently I was wrong.

I probably didn't explain any of that well enough to be understandable; I could elaborate if any of it is interesting. But anyway, since I have to use CPS now, I have a need to write a non-structurally-recursive function which takes a function as an argument. I think there are three options for how to prove the fixpoint equation for such a function:

Use a functional extensionality axiom.
Prove some analogue of Fix_eq that's parameterized over arbitrary equivalence relations instead of just equality.
Do what is being done in fiat/src/Common/Wf1.v. I don't know exactly what is going on there, but it looks like it solves this problem?

I already did option (2), so I'm inclined to just stick with that unless there are compelling reasons for doing something else. I imagine there are no good reasons for doing (1), but I'm not sure about (3).

compiler/src/compiler/CustomFix.v

JasonGross · 2024-10-03T00:37:09Z

3. Do what is being done in fiat/src/Common/Wf1.v. I don't know exactly what is going on there, but it looks like it solves this problem?

I already did option (2), so I'm inclined to just stick with that unless there are compelling reasons for doing something else. I imagine there are no good reasons for doing (1), but I'm not sure about (3).

I'm not sure. Wf1 is a complicated general wrapper around the simple trick that if you know how many arguments your recursive function takes, you can demand pointwise extensionality rather than equality, and the induction goes through. It sounds like what you did is somewhat more general than this (though note that you maybe want to parameterize over a PER instead of an equivalence, if you want to avoid funext?)

… eexists)

fancy new notations!)

…ly broken

…oo much

because don't have any leakagesemantics -> semantics lemmas

OwenConoly · 2025-01-09T20:52:26Z

bedrock2/src/bedrock2Examples/stackalloc.v

+  Lemma word_to_bytes' (a : word) :
+    exists l, length l = (Z.to_nat ((width + 7) / 8)) /\
+                a = word.of_Z (LittleEndianList.le_combine l).


Thoughts about where this belongs? coqutil/Word/Properties.v, perhaps?

Actually, coqutil/Word/LittleEndianList.v seems better.

Are you sure you want a lemma with an exists litke this at all? Probably a lemma that uses a specific value of l would do, and maybe it exists already.

Yeah, word_to_bytes' is a corollary of word_to_bytes (right above it), which has a specific value of l. I just thought it might be nice to abstract away the unnecessary details, since (so far as I know) this lemma would mainly be useful for stack deallocation, where we wouldn't care what the bytes actually are.

I don't think a lemma like this existed before I proved it. I just conducted an unsuccessful three-minute search, and also this comment makes it sound like there are no word_to_bytes lemmas.

OwenConoly · 2025-01-10T21:57:36Z

compiler/src/compiler/Pipeline.v

+    Definition remove_n_r {X : Type} (n : nat) (l : list X) :=
+      rev (skipn n (rev l)).
+
+    Lemma remove_n_r_spec {X : Type} (l1 l2 : list X) :
+      remove_n_r (length l2) (l1 ++ l2) = l1.
+    Proof.
+      cbv [remove_n_r]. rewrite rev_app_distr. rewrite List.skipn_app_r.
+      - apply rev_involutive.
+      - rewrite length_rev. reflexivity.
+    Qed.


This probably belongs somewhere other than Pipeline.v. Shall I put it in coqutil?

OwenConoly commented Aug 25, 2024

View reviewed changes

bedrock2/src/bedrock2/Semantics.v Outdated Show resolved Hide resolved

OwenConoly commented Aug 25, 2024

View reviewed changes

bedrock2/src/bedrock2/Semantics.v Outdated Show resolved Hide resolved

andres-erbsen reviewed Aug 26, 2024

View reviewed changes

bedrock2/src/bedrock2/Semantics.v Outdated Show resolved Hide resolved

bedrock2/src/bedrock2/Semantics.v Outdated Show resolved Hide resolved

OwenConoly force-pushed the leakage_traces branch from bc1fcb2 to cff166c Compare August 27, 2024 15:08

OwenConoly force-pushed the leakage_traces branch from 55105b1 to 989d21b Compare September 6, 2024 03:22

OwenConoly force-pushed the leakage_traces branch 5 times, most recently from ec43d73 to b0dbeb7 Compare September 30, 2024 21:05

OwenConoly commented Sep 30, 2024

View reviewed changes

OwenConoly force-pushed the leakage_traces branch 2 times, most recently from d4d014e to 9fe2d00 Compare October 1, 2024 03:50

JasonGross reviewed Oct 3, 2024

View reviewed changes

compiler/src/compiler/CustomFix.v Outdated Show resolved Hide resolved

OwenConoly added 24 commits January 9, 2025 14:57

make some fixpoints local

bb9ab6c

make ct.v use FE310CSemantics instead of BasicC32Semantics

9de1beb

revert BasicC32Semantics to have no leakage traces

5cb3ee0

make leakage straightline work way better

0f3c7b7

fix straightline in a different way (still using letexists instead of…

b266837

… eexists)

improve leakage straightline.

073d181

fnspec notations

8f74745

add fnspec_ex! notations for specifying constant-time

dc7a6be

work on bedrock2Examples involving leakage (including using the

9b5bab5

fancy new notations!)

make fnspec! exists notation more intuitive

2dc013b

fix kyberslash proof

bbfe067

work on swap and stackalloc

f6526fe

trying to fix function calls and things... swap_swap proof mysterious…

4e9c3a9

…ly broken

fix mysterious problem with swap_swap---special_intro was unfolding t…

5a8a974

…oo much

change proofs to deal with "enter" in straightline working properly

2d249fb

fix enter better this time

fac42da

work on top level loop

9bb6d06

fix event loop

34373d4

add leakage stuff to memory_mapped_ext_calls_compiler.v

463eef2

make "make" work in compiler subdirectory

d4fca34

fix KamiRiscvStep (it broke because leakage stuff was added to run1)

5f974ea

make end2endpipeline go through... can't get end2endlightbulb to work,

46f2f3b

because don't have any leakagesemantics -> semantics lemmas

add non-leakage ext_specs to FE310CSemantics and memory_mapped_ext_spec

bbac468

make end2end compile

7d40f6c

OwenConoly force-pushed the leakage_traces branch from bead77f to b2d1b5c Compare January 9, 2025 19:57

OwenConoly commented Jan 9, 2025

View reviewed changes

OwenConoly commented Jan 10, 2025

View reviewed changes

OwenConoly force-pushed the leakage_traces branch from 246377c to 30cc501 Compare January 10, 2025 22:43

various cleanup: typos, whitespace, comments, etc

8f1f5ee

OwenConoly force-pushed the leakage_traces branch from 30cc501 to 8f1f5ee Compare January 10, 2025 23:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Leakage traces #431

Leakage traces #431

OwenConoly commented Aug 25, 2024

OwenConoly commented Aug 25, 2024 •

edited

Loading

andres-erbsen left a comment •

edited

Loading

OwenConoly commented Aug 26, 2024

OwenConoly commented Aug 26, 2024

andres-erbsen commented Aug 26, 2024 •

edited

Loading

samuelgruetter commented Aug 26, 2024

OwenConoly commented Aug 26, 2024

OwenConoly commented Aug 26, 2024 •

edited

Loading

andres-erbsen commented Aug 26, 2024

OwenConoly commented Aug 27, 2024

andres-erbsen commented Aug 27, 2024

samuelgruetter commented Aug 27, 2024

andres-erbsen commented Aug 27, 2024

OwenConoly commented Aug 27, 2024 •

edited

Loading

andres-erbsen commented Aug 27, 2024

OwenConoly Sep 30, 2024

andres-erbsen Oct 1, 2024

OwenConoly commented Oct 2, 2024

JasonGross commented Oct 3, 2024

OwenConoly Jan 9, 2025

OwenConoly Jan 9, 2025

andres-erbsen Jan 10, 2025

OwenConoly Jan 10, 2025

OwenConoly Jan 10, 2025

Leakage traces #431

Are you sure you want to change the base?

Leakage traces #431

Conversation

OwenConoly commented Aug 25, 2024

OwenConoly commented Aug 25, 2024 • edited Loading

andres-erbsen left a comment • edited Loading

Choose a reason for hiding this comment

OwenConoly commented Aug 26, 2024

OwenConoly commented Aug 26, 2024

andres-erbsen commented Aug 26, 2024 • edited Loading

samuelgruetter commented Aug 26, 2024

OwenConoly commented Aug 26, 2024

OwenConoly commented Aug 26, 2024 • edited Loading

andres-erbsen commented Aug 26, 2024

OwenConoly commented Aug 27, 2024

andres-erbsen commented Aug 27, 2024

samuelgruetter commented Aug 27, 2024

andres-erbsen commented Aug 27, 2024

OwenConoly commented Aug 27, 2024 • edited Loading

andres-erbsen commented Aug 27, 2024

OwenConoly Sep 30, 2024

Choose a reason for hiding this comment

andres-erbsen Oct 1, 2024

Choose a reason for hiding this comment

OwenConoly commented Oct 2, 2024

JasonGross commented Oct 3, 2024

OwenConoly Jan 9, 2025

Choose a reason for hiding this comment

OwenConoly Jan 9, 2025

Choose a reason for hiding this comment

andres-erbsen Jan 10, 2025

Choose a reason for hiding this comment

OwenConoly Jan 10, 2025

Choose a reason for hiding this comment

OwenConoly Jan 10, 2025

Choose a reason for hiding this comment

OwenConoly commented Aug 25, 2024 •

edited

Loading

andres-erbsen left a comment •

edited

Loading

andres-erbsen commented Aug 26, 2024 •

edited

Loading

OwenConoly commented Aug 26, 2024 •

edited

Loading

OwenConoly commented Aug 27, 2024 •

edited

Loading