Provide recommendation to counter xz utils style attack #560

david-a-wheeler · 2024-07-05T18:15:14Z

The malicious attack on the xz utils slipped through many defenses because the "source" package included pre-generated malicious code. This meant that review of the source code (e.g., as seen by git) couldn't find the problem.

This proposes a best practices to counter it. The text is longer than I'd like, but it's hard to make it short, and this was a worrying attack so I think it's reasonable to say this.

We'll probably need to renumber this proposal if we also add the proposed text to counter attacks like polyfill.io: #559 ... but I think that's okay!

The malicious attack on the xz utils slipped through many defenses because the "source" package included pre-generated malicious code. This meant that review of the source code (e.g., as seen by git) couldn't find the problem. This proposes a best practices to counter it. The text is longer than I'd like, but it's hard to make it short, and this was a worrying attack so I think it's reasonable to say this. We'll probably need to renumber this proposal if we also add the proposed text to counter attacks like polyfill.io: #559 ... but I think that's okay! Signed-off-by: David A. Wheeler <[email protected]>

david-a-wheeler · 2024-07-05T18:38:04Z

@ljharb @ctcpip - If #559 is accepted, I propose number it after #559. These are two different proposals, though, so I thought it'd be easier and faster and consider them in parallel; we can renumber things afterwards :-).

Signed-off-by: David A. Wheeler <[email protected]>

docs/Concise-Guide-for-Developing-More-Secure-Software.md

SecurityCRob

+1

drusso-rh · 2024-07-16T14:32:54Z

+1

docs/Concise-Guide-for-Developing-More-Secure-Software.md

ctcpip · 2024-07-16T16:21:17Z

Thinking about this a bit more. Neither published artifacts nor source repositories are completely safe from supply chain attacks, but are we confident that the surface area for risk is absolutely less with source repos? And for all ecosystems and package registries? If the xz attack had been via the source rather than the artifact would we be having a different conversation?

One potential protection that (at least some) package registries give you is that usually once an artifact is published, it cannot be tampered with.

For example, if I npm install [email protected], I am guaranteed to have the original published artifact -- it cannot be hijacked.

But if I npm install git+ssh://[email protected]:expressjs/express.git#4.19.2, it does not have the same guarantees. It could potentially be anything.

Notably, if the project was compromised, a rogue maintainer could poison the source repository and in this example, version 4.19.2, but they could not poison 4.19.2 in the npm registry.

Therefore, there are scenarios in which relying on the published package is safer.

There is also something to be said for non-malicious situations, where maintainers are correcting tags or simply making mistakes.

For example, I have deleted erroneous tags in projects before. As an example, some projects use a tag format like 1.0.0, 1.1.0, etc and some use v1.0.0, v1.1.0, etc and sometimes you get a stray tag that doesn't match the pattern the project uses. So we just delete the bad tag and add the correct one. But this will break anyone pulling in the bad tag. Also, when I push the new tag, I could mistakenly associate the wrong commit. Maybe this reintroduces a CVE that was patched, making downstream consumers vulnerable, and importantly, this would go undetected by security scanners because it is a version that is marked as fixed or unaffected.

Clearly there is not a perfect answer. The question is, do we have a recommendation that we can confidently say is the better option? I'm not convinced at this point that there is, and I there is some nuance depending on which ecosystems and registries we are talking about.

ljharb · 2024-07-16T16:53:45Z

Indeed in any language with a package repository, git is for development, not consumption.

I continue to think the only true solution here is to use the git repo, replicate the build process, and measure how close the result is to the published, immutable artifact - post publish.

ctcpip · 2024-07-16T20:30:10Z

flowchart
    0[Install tarball from registry]
    1[Rejoice]
    a{
      Concerned
      about
      compromised
      tarballs?
    }
    b(Build from source)
    c{
      Concerned
      about
      compromised
      source?
    }

    0 --> a
    a --> |YES| b
    b --> c --> |YES| 0
    a --> |NO| 1
    c --> |NO| 1

ljharb · 2024-07-16T20:38:52Z

Indeed, because then you get to rely on the immutable tarball, and you get to verify its contents based on the repo independently without any unpaid work from maintainers in a way that anyone can audit and replicate, including for historical packages whose maintainers are no longer around.

ctcpip

using this suggestion, if I want version 1.2.3 of a package, I have to:

determine if a 1.2.3 branch or tag exists in the source repo, and get the name of that branch/tag (it may or may not be 1.2.3 exactly)
if the branch/tag exists, I must trust that it matches the released artifact's substantive content
- if I can't trust that it matches, then what do I do?
if I can't find a matching branch/tag, or don't trust it, or don't want to use it, then what do I do?

There is also no guarantee that the published artifact at version 1.2.3 matches what the repo says via branch or tag is 1.2.3. The published artifact is not associated with a commit id or anything (and even if it were, that commit id may not exist anymore).

If source repos are clean and well-maintained, then maybe this works. But they are, quite often, far from being clean and well-maintained. Often tags and branches are missing entirely, or they stopped pushing tags seven years ago, or even completely different versions in repo tags vs the package registry. e.g. registry has only 1.0,1.2,1.4, and the repo has only 1.1,1.3. I have actually seen this in production packages with many downloads.

And this gets even murkier when we get into vulnerability management. If I can't properly associate what I'm using with an exact version, then I can't accurately know what CVEs I am impacted by.

There are also other downsides to consuming the source repo directly that are orthogonal to supply chain attack risk.

This recommendation seems to boil down to "don't use package registries/artifactories; don't use published artifacts; use only source code repositories". I think it relies too much on assumptions that we cannot make about source repos. The source repo is the canonical truth, but the published artifact is what is meant to be consumed. For these reasons, and the fact that using published artifacts actually offers some protection that the source repo cannot offer, I find it difficult to recommend this as currently written.

david-a-wheeler · 2024-11-05T15:52:12Z

If the xz attack had been via the source rather than the artifact would we be having a different conversation?

The xz attack inserted malicious source into the source package (the tarball of source code) that was not in the repository.

If the attack had been visible in the repository it would have been immediately visible to all others who checked the repository. The history would be immediately visible, since git (and any other vcs) can track changes. But since it was hidden in a tarball, it wasn't obvious... it's historically expected behavior that generated files in the tarball aren't in the VCS, and since they are often different when regenerated somewhere else, there's no easy way to check them.

I'll try to rewrite this to make it clearer.

Signed-off-by: David A. Wheeler <[email protected]>

david-a-wheeler · 2024-11-05T16:56:42Z

This recommendation seems to boil down to "don't use package registries/artifactories

No, not at all. The word "package" has multiple meanings. Source packages aren't built packages. A package registry, like npm, usually only deals with built packages.

However, in the system package world, there's often a distinction between a source package (a package of source code) and a built package. The built packages are built from the source packages, because there are often many builders. However, it's been conventional to take what's in the VCS and add stuff into the source package... which led to the xz utils attack. If the source package is only a copy of the VCS, then it's easy to verify that they're the same. If there are mysterious additions, it's impractical to notice malicious additions.

Signed-off-by: David A. Wheeler <[email protected]>

06kellyjac

Firstly I'd like to say a recommendation along these lines would be nice so +1 🙂

I spotted a merge conflict header was still included.

~~Also I wrote some suggestions and thoughts on the topic. Apologies for it being quite wordy. I hope it can trigger some interesting discussion 🙂~~
RIP: it wasn't posted 😢 I'll try to re-write it

docs/Concise-Guide-for-Developing-More-Secure-Software.md

Co-authored-by: j-k <[email protected]> Signed-off-by: David A. Wheeler <[email protected]>

david-a-wheeler · 2024-11-07T16:22:25Z

I believe all the issues listed above have been addressed.

By starting with "If a source code (unbuilt) package is released...", it makes it clear that we are not talking about built packages. The "if" also makes it clear that there might not be such a thing, which makes sense, because not every ecosystem has this distinction.

The merge conflict has also been fixed.

Ready to merge?

docs/Concise-Guide-for-Developing-More-Secure-Software.md

The source package should be a copy or subset of the VCS materials. Signed-off-by: David A. Wheeler <[email protected]>

06kellyjac

LGTM 🚀 😄

docs/Concise-Guide-for-Developing-More-Secure-Software.md

Co-authored-by: Jordan Harband <[email protected]> Signed-off-by: David A. Wheeler <[email protected]>

Addressed later.

bagder · 2024-12-05T21:43:53Z

I think recommending not bundling the configure script is well-meaning but should and will be ignored. I will continue to ship it in my projects.

autotools is by design made to include and ship configure. That is what makes it so useful and powerful.

Fix grammar nit in xz utils response

01d5e05

Signed-off-by: David A. Wheeler <[email protected]>

ljharb reviewed Jul 5, 2024

View reviewed changes

docs/Concise-Guide-for-Developing-More-Secure-Software.md Outdated Show resolved Hide resolved

SecurityCRob approved these changes Jul 11, 2024

View reviewed changes

SecurityCRob added the Product: Concise Guides label Jul 11, 2024

david-a-wheeler commented Jul 16, 2024

View reviewed changes

docs/Concise-Guide-for-Developing-More-Secure-Software.md Outdated Show resolved Hide resolved

ctcpip previously requested changes Jul 16, 2024

View reviewed changes

david-a-wheeler added 2 commits November 5, 2024 11:03

Clarify text

64f4159

Signed-off-by: David A. Wheeler <[email protected]>

Merge branch 'main' into xz_utils_counter

093b41f

Signed-off-by: David A. Wheeler <[email protected]>

david-a-wheeler added 2 commits November 5, 2024 11:59

Fix emphasis for markdownlint

c38c1b9

Signed-off-by: David A. Wheeler <[email protected]>

Fix emphasis for markdownlint

f8f0507

Signed-off-by: David A. Wheeler <[email protected]>

06kellyjac reviewed Nov 7, 2024

View reviewed changes

docs/Concise-Guide-for-Developing-More-Secure-Software.md Outdated Show resolved Hide resolved

Update docs/Concise-Guide-for-Developing-More-Secure-Software.md

f0a63e4

Co-authored-by: j-k <[email protected]> Signed-off-by: David A. Wheeler <[email protected]>

david-a-wheeler requested a review from ctcpip November 7, 2024 16:32

06kellyjac reviewed Nov 7, 2024

View reviewed changes

docs/Concise-Guide-for-Developing-More-Secure-Software.md Outdated Show resolved Hide resolved

Clarify source package should only include VCS materials

3354518

The source package should be a copy or subset of the VCS materials. Signed-off-by: David A. Wheeler <[email protected]>

06kellyjac approved these changes Nov 8, 2024

View reviewed changes

ljharb reviewed Nov 8, 2024

View reviewed changes

docs/Concise-Guide-for-Developing-More-Secure-Software.md Outdated Show resolved Hide resolved

Update docs/Concise-Guide-for-Developing-More-Secure-Software.md

5c5f039

Co-authored-by: Jordan Harband <[email protected]> Signed-off-by: David A. Wheeler <[email protected]>

david-a-wheeler merged commit a7f2313 into main Dec 3, 2024
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Provide recommendation to counter xz utils style attack #560

Provide recommendation to counter xz utils style attack #560

david-a-wheeler commented Jul 5, 2024

david-a-wheeler commented Jul 5, 2024

SecurityCRob left a comment

drusso-rh commented Jul 16, 2024

ctcpip commented Jul 16, 2024

ljharb commented Jul 16, 2024

ctcpip commented Jul 16, 2024

ljharb commented Jul 16, 2024

ctcpip left a comment

david-a-wheeler commented Nov 5, 2024

david-a-wheeler commented Nov 5, 2024 •

edited

Loading

06kellyjac left a comment •

edited

Loading

david-a-wheeler commented Nov 7, 2024

06kellyjac left a comment

bagder commented Dec 5, 2024 •

edited

Loading

Provide recommendation to counter xz utils style attack #560

Provide recommendation to counter xz utils style attack #560

Conversation

david-a-wheeler commented Jul 5, 2024

david-a-wheeler commented Jul 5, 2024

SecurityCRob left a comment

Choose a reason for hiding this comment

drusso-rh commented Jul 16, 2024

ctcpip commented Jul 16, 2024

ljharb commented Jul 16, 2024

ctcpip commented Jul 16, 2024

ljharb commented Jul 16, 2024

ctcpip left a comment

Choose a reason for hiding this comment

david-a-wheeler commented Nov 5, 2024

david-a-wheeler commented Nov 5, 2024 • edited Loading

06kellyjac left a comment • edited Loading

Choose a reason for hiding this comment

david-a-wheeler commented Nov 7, 2024

06kellyjac left a comment

Choose a reason for hiding this comment

bagder commented Dec 5, 2024 • edited Loading

david-a-wheeler commented Nov 5, 2024 •

edited

Loading

06kellyjac left a comment •

edited

Loading

bagder commented Dec 5, 2024 •

edited

Loading