Debian must ship reproducible packages (opens in new tab)

(lists.debian.org)

366 pointsrobalni15d ago166 comments

166 comments

This is a huge achievement for Debian and the free software world.

It took a while though until this was understood. In 2007 when pointing out on debian-devel that this is needed, I was still told what huge waste of time this would be. And indeed it took a huge amount of work by many people to get there, but it is well worth it.

PunchyHamster15d ago

There was no bug or attack on Debian since 2007 that reproducible packages would prevent.

"Well worth it" is not correct. And it just ups the the contribution barrier to Debian higher, I already heard a lot of people complaining that contributing to Debian is hard and while in past I defended it by "they need all the checks and bounds to make sure packages play with eachother nicely", this is just step that makes it hard for no reason and little benefit.

savolai15d ago

” If you are wondering why we are doing this at all, then hopefully the Reproducible Builds website will explain why this is useful.”

https://reproducible-builds.org/

Could you perhaps respond to the argumentation here?

dvogel15d ago

(Not OP, but...) I still fail to see the current value in confirming that a reproducing builder also included the same compromised dependency that I did when I built it. I understand that reproducible builds are guarding against dynamic attacks within build infrastructure. However I just don't see those happening. Compromised source dependencies are a 100x more common problem.

3 more replies

PunchyHamster15d ago

I know why they are useful. I am arguing they are waste of time for effort involved.

Forcing devs to use hardware keys to sign commits/CI requests would be actual security improvement, thwarting many supply chain attacks that only worked coz the attacker got to developer credentials. Hardware keys at least have option to make some operations require physically pressing the key so there is chance developer will notice.

But thanks to reproducible builds, at least someone can... validate that the binary code of vulnerable package can be reproduced. Very fucking useful.

I am not saying it is useless. I am saying it is one of highest hanging fruits on security tree.

2 more replies

azkalam15d ago

Reproducible builds reduce the need for trusted parties.

Have many organizations produce the binaries independently and post the arifacts.

Once n of m parties agree on the arifact hash, take that as the trusted build.

If every party reaches a different hash then we cannot build consensus.

sgc15d ago

To move away from organizational dependence, there should be an installable project for debian where I can dedicate some configurable small percentage of my compute when idle to reproducibly building debian components to make a robust verification system, starting with the most critical code.

Obviously, it would be a ton of work to make such a system resistant to gaming by malicious actors (see GNU Guix for useful efforts), but it would provide valuable diversity in architecture and (political or other) control.

It would be even cooler if we could have independent projects that could run on various distros and OS, and build packages for any of them. Having packages for bsd verified on linux and vice-versa with statistical logging (this code has been verified x times on y OSes) would be reassuring.

1 more reply

MomsAVoxell15d ago

Reproducible builds are applicable not only to respond to ‘attacks’, a subject you seem to be bikeshedding, but also for other reasons too.

Anyone having to maintain a code base or a distributed fleet of devices will gain from this decision, immensely, as their operational periods come and go.

Reproducible builds are about longevity as much as they are about security.

Please don’t make bold claims about ‘no reason and little benefit’ while demonstrating ignorance of this hard fact: reproducible builds should have been the norm, in computing, from the get-go.

bluGill15d ago

I longevity is harmed though. Your certs need to expire in a few years we think that your toolchain will not be downloadable.

2 more replies

PunchyHamster15d ago

> Anyone having to maintain a code base or a distributed fleet of devices will gain from this decision, immensely, as their operational periods come and go.

Just baking in build ID and commit is enough. What you think reproducible builds add ?

> Please don’t make bold claims about ‘no reason and little benefit’ while demonstrating ignorance of this hard fact: reproducible builds should have been the norm, in computing, from the get-go.

So far not a single person in the thread gave me concrete example (as in existing project, existing problem, no other solution can solve it). Just claiming it's better based on their feelings. Come on, be the first one.

benregenspan15d ago

Is the "Jia Tan" XZ Utils compromise not a good example? That relied on code snuck into a release that was not in source.

(It was caught before being promoted into a stable Debian release, yes, but this sort of relied on a happy accident, too close for comfort)

chuckadams15d ago

The xz hack was still reproducible, because it was included in the distribution archive which did not match the upstream source -- even then, it was so obfuscated it likely would have gone unnoticed, but nevertheless it only lived in the uploaded tarball and not in the repo. Reproducibility is a good thing, but the next step is build provenance.

Still, lots of good non-security benefits to reproducible builds too.

jcranmer15d ago

The xz utils compromise is a very good example... of why reproducible packages doesn't actually solve anything security-wise!

The backdoor relied first on a difference between building a package in a packaging environment versus building the package on your own. And also, it relied on the very common practice of checking in unreviewable artifacts into the source tree (e.g., the configure script, malicious binary test artifacts).

Reproducible builds guarantee that two people can follow the same instructions and get the same, bit-identical outcome. It does nothing to guarantee that those instructions have not been compromised, and all of the great packaging security failures of my lifetime that I can think of have relied on those instructions being compromised (e.g., xz utils, Debian OpenSSL keygen issues).

2 more replies

eptcyka15d ago

It makes shipping backdoors a whole lot harder, yes.

PunchyHamster15d ago

Unless someone spins entirely separate infrastructure dedicated just for verifying Debian packages, it doesn't.

zaphirplane15d ago

Hmm, it prevents Trojan binaries which is a small subset of backdoor IMHO.

Defense in depth obviously is a good thing

aborsy15d ago

There was perhaps no detected bug or attack. There have most likely been bugs or attacks that reproducible builds would have prevented.

PunchyHamster15d ago

And you base it on what exactly ? It's "just" making sure the build process is always ordered.

If anything it will make attacker's job easier, as Ubuntu package will have same files structured exactly same way as Debian one.

1 more reply

CyberDildonics15d ago

There have most likely been bugs or attacks that reproducible builds would have prevented.

Like what exactly?

deknos15d ago

"mimimimi".

Those people do not care about quality in opensource at all. For longliving software this is very important.

Of course, all those javascript and kubernetes packages which are irrelevant in a few years again, might complain, but let them complain.

ckastner15d ago

> There was no bug or attack on Debian since 2007 that reproducible packages would prevent.

I'm reading this as a suggestion that the reproducible builds effort was an ineffective deterrent.

However, note that your observation could also be explained by the opposite: the reproducible builds effort was an effective deterrent, so nobody bothered with attempts.

> And it just ups the the contribution barrier to Debian higher

Until yesterday, the package just got flagged in the tracker, and you could either ignore it, or fix it yourself, or the kind people behind the reproducible builds effort supplied a patch themselves.

Now, you can no longer ignore it. But fixes are often trivial. Use a (stable) timestamp provided by the build, seed RNGs with some constant (instead of eg: time), etc. These are best practices anyway.

PunchyHamster15d ago

> However, note that your observation could also be explained by the opposite: the reproducible builds effort was an effective deterrent, so nobody bothered with attempts.

There was no attack that reproducible builds would help protect from before 2007 either.

> Until yesterday, the package just got flagged in the tracker, and you could either ignore it, or fix it yourself, or the kind people behind the reproducible builds effort supplied a patch themselves.

> Now, you can no longer ignore it. But fixes are often trivial. Use a (stable) timestamp provided by the build, seed RNGs with some constant (instead of eg: time), etc.

that's the entirety of the problem. App developers don't want to be package experts or build experts.

> These are best practices anyway.

They are not. They are best practices if you want reproducible builds. They are entirely useless waste of time if you don't care.

2 more replies

Atotalnoob15d ago

That’s a big logical fallacy, I’m not sure if that’s what you want to go with

rurban13d ago

So these are broken on amd64. Debian arm64/forky rebuilderd stats https://reproduce.debian.net/arm64/stats/forky/

Most with failed to reproduce: NT_GNU_BUILD_ID. The others on some other bits. Mostly timestamps or hashes I assume

perlgeek15d ago

https://wiki.debian.org/ReproducibleBuilds has some more infos; some is outdated, but it also has a chart showing how many packages are built in the CI, and how many of those are reproducible builds.

(Orange = FTBR = "failed to build reproducibly")

I'm not good at reading numbers from charts, but I'd guess it's a few percent (4-5ish?).

bpavuk15d ago

all I get is this:

> Forbidden

> <p>You are not allowed to access this!</p>

(yes, with HTML tags on display) :)

EDIT: I also found a "I Challenge Thee" page in history. did I just get blocked by antibot measures? why???

unleaded15d ago

Do you have JavaScript disabled? They put one of those anti-scraper things on it.

bpavuk15d ago

nope, it's enabled. I can pass Cloudflare, reCaptcha, whatever Microsoft is doing, and Annubis, but Debian caught me off-guard

jaypatelani15d ago

Good thing. NetBSD has fully reproductible build since 2017. https://blog.netbsd.org/tnf/entry/netbsd_fully_reproducible_...

idoubtit15d ago

As pointed in your link, NetBSD achieved this with some help from Debian. If I understand correctly, it's not that NetBSD tried harder, it's that their problem was easier: fewer packages which change less (they still use CVS, "stability" is an understatement!).

BTW, most Debian packages have reproducible builds. Those which have not (I'd say 5%) are shown in orange in the graph there: https://wiki.debian.org/ReproducibleBuilds

Zopieux15d ago

A great milestone, congrats Debian on taking a stance and holding high standards for yourself, especially in the current era.

jgneff15d ago

I'm so happy to see this change. I got involved with reproducible builds in 2021 after reading in horror about the SolarWinds attack. [1]

I think Magnus Ihse Bursie said it best while working on reproducible builds of OpenJDK: "If you were to ask me, the fact that compilers and build tools ever started to produce non-deterministic output has been a bug from day one." [2]

[1] https://www.linux.com/news/preventing-supply-chain-attacks-l...

[2] https://github.com/openjdk/jdk/pull/9152#issue-1270543997

micw15d ago

I wonder why this is a thing nowadays. I use yocto for embedded devices and it was almost a no-brainer to implement reproducible builds. I can also easily enable Debian package management, so everything is already available.

MomsAVoxell15d ago

What do you mean why is it a thing nowadays?

Reproducible builds are an essential method in industrial computing - Debian isn’t at the forefront of this, it is merely adopting industry wide techniques also applied to other operating systems in use in long-term and safety-related applications.

Certainly, a lot of the hard work of the Yocto and Debian developers is already in your hands.

What is interesting is that this is now being applied in a more forward-focused policy by the Debian developers, that it will now be the norm rather than an option…

dezgeg15d ago

Did you actively verify that your builds were bit-reproducible?

tofflos15d ago

amd64 forky

reproduced: 97.02% good: 17586 bad: 511 fail: 30 unknown: 0

This, statistics for other architectures, and the reasons for unreproducibility can be found at https://reproduce.debian.net.

suprjami15d ago

I am always surprised Debian are leading this and not the commercial vendors. You'd think big organisations paying for RHEL and Ubuntu would be beating down the door for verifiable binaries.

tremon15d ago

If a competitor can prove that their packages are bit-for-bit identical to what a big organization is shipping, that allows the competitor to benefit from the security assurances of the big org. This is great for software freedom, not so great for wannabe monopolists.

jorams15d ago

Reproducible builds exist to reduce the need for trust, while commercial vendors are in the business of selling trust.

pixel_popping15d ago

Forbidden

You don't have permission to access this resource. Apache Server at lists.debian.org Port 443

ameliaquining15d ago

I can see it just fine; maybe an overzealous firewall thinks you're a bot? At any rate, the Wayback Machine has it: https://web.archive.org/web/20260510074120/https://lists.deb...

baranul15d ago

Unfortunately, many of these "protections" don't know what is a bot or a human. Many clueless websites are often just blocking huge swaths of legitimate readers and customers.

pixel_popping15d ago

Why would you block access to a static page, even Bots, what's the point? I'm not a bot, very typical non-privacy setup (Firefox, Linux, VPN) for personal usage.

It does work with my privacy/scrapping setup (residential proxy, spoofed fingerprints, Qubes and so on), great job debian.

TacticalCoder15d ago

What people really don't understand about reproducible builds is that they're not a guarantee that there's no backdoor.

They're a guarantee that if there's a backdoor, it's reproducible 100% of the time.

This is a godsend for white hats fighting the good fight.

And, as a side note, it's strongarming vs the bad guys: "Would be too bad if we could reproduce your shiny exploit 100% of the time wouldn't it!?".

Note that we should go further (but it's a bit orthogonal to reproducible builds): builds of the final binary/package should happen by first entirely discarding all files not necessary for the final build (like all test cases and all test assets). The build should literally happen in an environment that gets rid of those (after, of course, having test in another environment that all tests cases succeed): if I'm not mistaken get rid of test assets would have stopped Jia Tan's XZ backdoor attempt dead in its track (for example). Because IIRC there were binary data part of the backdoor hidden in some asset only used by test cases.

P.S: as a bonus they also allow to detect bit-flips (I'm not saying there aren't other ways to detect bit-flips: what I'm saying is that if you have deterministic builds anyway and something doesn't reproduce correctly due to a flipped-bit, it's going to be noticed).

1 more reply

casey214d ago

This fights against "opensource-washing" which is the practice of large companies claiming to release open source code, but the compile takes so long (as well as being overly-convoluted) that most people and many distros can't afford to maintain the package.

It feels like AI and traditional software are converging in complexity.

kkyktkrkekk15d ago

”Optimize the code for 5 seconds”, as many compilers, including vc++ on windows did, was probably one of the dumbest thing ever invented. It meant that the binaries became more optimized when building on faster computers.

inglor_cz15d ago

Has anyone fought Microsoft Visual Studio successfully to produce reproducible builds of C++ programs? From what I have heard, it is one of the worst contexts to do it.

Dwedit15d ago

It's that RICH header that you need to exclude. I just tested my copy of MSVC 2019, and `/emittoolversioninfo:no` will exclude the RICH header from the binary. Supposedly also works in MSVC 2022.

The build timestamps in the PE header and export table are also a problem as well.

azkalam15d ago

Probably easiest way is to use Bazel to leverage the effort that has gone in there

einpoklum15d ago

Well, you can't build MSVS yourself, reproducibly or otherwise, so this is a less appealing endeavor I would think.

rurban15d ago

... and most of this work is done by other distros and maintainers. Starting with binutils

shevy-java15d ago

A small step for debian,

giant leap for mankind.

stingraycharles15d ago

As someone who recently spent a lot of time on making a large C++ program entirely reproducible on 4 different OS’es, one cannot understate just how many tiny details matter here.

amelius15d ago

That's cool but I'm honestly a bit disappointed in how apt refused to embrace/support both the container and AI/GPU aspects of computing. Are we going to see some changes there?

yjftsjthsd-h15d ago

Those seem like unrelated things? I can imagine ways for apt to integrate with containers, but what would it possibly do for AI or GPU other than delivering packages like it already does?

Arrowmaster15d ago

What exactly are you talking about? Those don't seem related.

Hendrikto15d ago

Why the fuck does that site break the back button? DO NOT do that.

em-bee15d ago

since there is no other way to reach you please allow me to use this off topic message to let you know that there is a response to your comments on the gnupg discussion from two weeks ago.

einpoklum15d ago

Debian must ship packages without the hard dependence on systemd.

charcircuit15d ago

So much time has been wasted on reproducible builds which could have better spent on securing more important parts of Debian. Practically minor changes like a build timestamp being different is not an issue.

Hendrikto15d ago

It allows verifying that the binaries actually match the source, which is extremely valuable.

charcircuit15d ago

Bit for bit matching is not required for that.

Hendrikto14d ago

It makes it much simpler and more robust though. Also, it allows for content addressing a la Nix, among other benefits.

1 more reply

farfatched15d ago

Yes, making sure build timestamps are reproducible isn't a security win.

What is a win is that two independent parties can run the same build, and get the same binaries.

This is important because it removes trust from builders: anyone can verify their output.

It just so happens that unimportant things like build versions impede that.

charcircuit15d ago

Anyone can verify the actual code in the binary matches even if some bytes within the binary file itself are different. The verification routine doesn't have to be a basic bit for bit equality test.

farfatched14d ago

For sure.

This has been the status quo in Debian for a while now. You can build, and use diffoscope to audit the differences.

It's a stronger security property to have bit-for-bit reproducibilty, and it looks like Debian are ready to commit to it.

1 more reply

deknos15d ago

you are free to provide patches instead of bitching.

charcircuit15d ago

And Debian is able to offer me a few million dollars yearly to help fix their security situation.

deknos14d ago

the idea that debian has a few million dollars to spare creates the assumption, that even if they would have... you would either not know how to fix issues, or not worth it.

kkfx15d ago

Debian, like any other legacy distro, mush became declarative, because the '80s model of manual deploy and the absurd pain of D/I and Preseed must end.

kakwa_15d ago

In the end, Nix is just a thin veneer on this stuff.

Given how many quick & dirty sed patching or exec commands I've seen in the few nix package/modules I've read, I would not exactly bet my life on it being completely idempotent & reproducible.

kkfx15d ago

it's the best option after IllumOS (OpenSolaris) IPS integrated with ZFS. Far less powerful not imposing zfs (only well supported for root, swap, encryption etc), so not integrated in the package system and bootloader management (BEs, Boot Environments).

It's not reproducible bit by bit, it fetch the current version of anything, but it's still easy to reproduce enough, stable enough and complete enough, while classic distros need a fresh install every major release or facing issues an keeping a system in unknown state for long until it explode.

farfatched15d ago

I've been 100% on NixOS on many years, but it's Debian that really drove this project.

They're still a pragmatic choice for many usecases.

suprjami15d ago

bootcrew have bootc Containerfiles for Debian, Ubuntu, Arch, and openSUSE:

https://github.com/bootcrew/mono

blueflow15d ago

zero improvement on end-user experience. does not solve supply chain issues, debian package will reproducabily contain the malware from upstream.

quantummagic15d ago

> zero improvement on end-user experience.

Maybe not by itself, but it does allow for the ecosystem to be audited, in a way that ultimately benefits the end-user. It really is an important part of a healthy supply chain.

miohtama15d ago

I would call less North Korean hacks a massive benefit for end users

testdelacc115d ago

While taking no stance on your statement, I think “fewer” works better in this context than “less”.

rlpb15d ago

Debian has had a better "software supply chain" posture than any other player in the ecosystem since before the turn of the century. While we all face the risk of malware from upstream, Debian is the least at risk of being affected by it. See for example the stream of issues from npm et al. None of it has affected Debian.

alkindiffie15d ago

> for example the stream of issues from npm et al.

Curious, what distros where affected by npm supply chain attacks?

throw_a_grenade15d ago

It's npm that's affected, therefore it's not even considered when choosing language/ecosystem for writing distro tools. You'll find no sane distro writing package manager in javascript precisely to avoid this joke of a supply chain.

hiAndrewQuinn15d ago

This is some of the best news I've heard recently when it comes to figuring out how to produce high quality Software Bills of Materials for the upcoming EU Cyber Resilience Act, for what it's worth. Reproducible packages are actually worth a great deal when you are selling products with digital elements. Much easier to scan through, audit, etc. with confidence.

iveqy15d ago

It does not solve all supply chain issues, it do solve some supply chain issues.

Not being able to see if the source code shipped is the same as been used for creating the binary is scary

murderfs15d ago

Has there been a single publicly known attack that would have been prevented by this?

atoav15d ago

If you find yourself holding opinions of the kind: "If it can't be made perfect, it shouldn't be changed at all?" you may want to consider that most things that work well today were incrementally improved.

Reproducable builds are not solving all issues as you rightly observed, but they can be a stepping stone (or even a pre-condition) for further measures.

mschuster9115d ago

That's not what reproducible builds aim to prevent, and no one claims that. When upstream pushes bad code, that's on upstream.

The thing reproducible builds aim to prevent is Debian or individual developers and system administrators with access rights to binary uploads and signing keys to get forced to sign and upload binary packages by attackers - be these governments (with or without court orders) or criminal organizations.

As of now, say if I were an administrator of Debian's CI infrastructure, technically there would be nothing preventing me from running an "extra" job on the CI infrastructure building a package for openssh with a knock-knock backdoor, properly signing it and uploading it to the repository. For someone to spot the attack and differentiate it, they'd have to notice that there is a package in the repository that has no corresponding build logs or has issues otherwise.

But with reproducible builds, anyone can set up infrastructure to rebuild Debian packages from source automatically and if there is a mismatch with what is on Debian's repository, raise alarm bells.

ownagefool15d ago

Reproducible builds shows that, within a specific configuration, the code produced the binary, regardless of who signed or published it.

Indeed, this could mitigate an attacker replacing the binary with something that's not produced from the code, but it does not mitigate the tool chain or code itself containing the exploit, creating a malicious binary.

shevy-java15d ago

Well - reproducible also means code guarantee. It may not improve an end-user experience directly, but you get an extra quality control step, as guarantee, here. I think reproducibility is great. If we can achieve that, it should be achieved. See also NixOS; it can guarantee that snapshot xyz works, not just for one user, but ALL users. I see it as hopping from guarantee to guarantee. That's actually a good thing in the long run. Just think differently here.

j / k navigate · click thread line to collapse

166 comments

uecker15d ago

This is a huge achievement for Debian and the free software world.

PunchyHamster15d ago

There was no bug or attack on Debian since 2007 that reproducible packages would prevent.

savolai15d ago

” If you are wondering why we are doing this at all, then hopefully the Reproducible Builds website will explain why this is useful.”

https://reproducible-builds.org/

Could you perhaps respond to the argumentation here?

dvogel15d ago

3 more replies

PunchyHamster15d ago

I know why they are useful. I am arguing they are waste of time for effort involved.

But thanks to reproducible builds, at least someone can... validate that the binary code of vulnerable package can be reproduced. Very fucking useful.

I am not saying it is useless. I am saying it is one of highest hanging fruits on security tree.

2 more replies

azkalam15d ago

Reproducible builds reduce the need for trusted parties.

Have many organizations produce the binaries independently and post the arifacts.

Once n of m parties agree on the arifact hash, take that as the trusted build.

If every party reaches a different hash then we cannot build consensus.

sgc15d ago

1 more reply

MomsAVoxell15d ago

Reproducible builds are applicable not only to respond to ‘attacks’, a subject you seem to be bikeshedding, but also for other reasons too.

Anyone having to maintain a code base or a distributed fleet of devices will gain from this decision, immensely, as their operational periods come and go.

Reproducible builds are about longevity as much as they are about security.

Please don’t make bold claims about ‘no reason and little benefit’ while demonstrating ignorance of this hard fact: reproducible builds should have been the norm, in computing, from the get-go.

bluGill15d ago

I longevity is harmed though. Your certs need to expire in a few years we think that your toolchain will not be downloadable.

2 more replies

PunchyHamster15d ago

> Anyone having to maintain a code base or a distributed fleet of devices will gain from this decision, immensely, as their operational periods come and go.

Just baking in build ID and commit is enough. What you think reproducible builds add ?

benregenspan15d ago

Is the "Jia Tan" XZ Utils compromise not a good example? That relied on code snuck into a release that was not in source.

(It was caught before being promoted into a stable Debian release, yes, but this sort of relied on a happy accident, too close for comfort)

chuckadams15d ago

Still, lots of good non-security benefits to reproducible builds too.

jcranmer15d ago

The xz utils compromise is a very good example... of why reproducible packages doesn't actually solve anything security-wise!

2 more replies

eptcyka15d ago

It makes shipping backdoors a whole lot harder, yes.

PunchyHamster15d ago

Unless someone spins entirely separate infrastructure dedicated just for verifying Debian packages, it doesn't.

zaphirplane15d ago

Hmm, it prevents Trojan binaries which is a small subset of backdoor IMHO.

Defense in depth obviously is a good thing

aborsy15d ago

There was perhaps no detected bug or attack. There have most likely been bugs or attacks that reproducible builds would have prevented.

PunchyHamster15d ago

And you base it on what exactly ? It's "just" making sure the build process is always ordered.

If anything it will make attacker's job easier, as Ubuntu package will have same files structured exactly same way as Debian one.

1 more reply

CyberDildonics15d ago

There have most likely been bugs or attacks that reproducible builds would have prevented.

Like what exactly?

deknos15d ago

"mimimimi".

Those people do not care about quality in opensource at all. For longliving software this is very important.

Of course, all those javascript and kubernetes packages which are irrelevant in a few years again, might complain, but let them complain.

ckastner15d ago

> There was no bug or attack on Debian since 2007 that reproducible packages would prevent.

I'm reading this as a suggestion that the reproducible builds effort was an ineffective deterrent.

However, note that your observation could also be explained by the opposite: the reproducible builds effort was an effective deterrent, so nobody bothered with attempts.

> And it just ups the the contribution barrier to Debian higher

Until yesterday, the package just got flagged in the tracker, and you could either ignore it, or fix it yourself, or the kind people behind the reproducible builds effort supplied a patch themselves.

PunchyHamster15d ago

> However, note that your observation could also be explained by the opposite: the reproducible builds effort was an effective deterrent, so nobody bothered with attempts.

There was no attack that reproducible builds would help protect from before 2007 either.

> Now, you can no longer ignore it. But fixes are often trivial. Use a (stable) timestamp provided by the build, seed RNGs with some constant (instead of eg: time), etc.

that's the entirety of the problem. App developers don't want to be package experts or build experts.

> These are best practices anyway.

They are not. They are best practices if you want reproducible builds. They are entirely useless waste of time if you don't care.

2 more replies

Atotalnoob15d ago

That’s a big logical fallacy, I’m not sure if that’s what you want to go with

rurban13d ago

So these are broken on amd64. Debian arm64/forky rebuilderd stats https://reproduce.debian.net/arm64/stats/forky/

Most with failed to reproduce: NT_GNU_BUILD_ID. The others on some other bits. Mostly timestamps or hashes I assume

perlgeek15d ago

https://wiki.debian.org/ReproducibleBuilds has some more infos; some is outdated, but it also has a chart showing how many packages are built in the CI, and how many of those are reproducible builds.

(Orange = FTBR = "failed to build reproducibly")

I'm not good at reading numbers from charts, but I'd guess it's a few percent (4-5ish?).

bpavuk15d ago

all I get is this:

> Forbidden

> <p>You are not allowed to access this!</p>

(yes, with HTML tags on display) :)

EDIT: I also found a "I Challenge Thee" page in history. did I just get blocked by antibot measures? why???

unleaded15d ago

Do you have JavaScript disabled? They put one of those anti-scraper things on it.

bpavuk15d ago

nope, it's enabled. I can pass Cloudflare, reCaptcha, whatever Microsoft is doing, and Annubis, but Debian caught me off-guard

jaypatelani15d ago

Good thing. NetBSD has fully reproductible build since 2017. https://blog.netbsd.org/tnf/entry/netbsd_fully_reproducible_...

idoubtit15d ago

BTW, most Debian packages have reproducible builds. Those which have not (I'd say 5%) are shown in orange in the graph there: https://wiki.debian.org/ReproducibleBuilds

Zopieux15d ago

A great milestone, congrats Debian on taking a stance and holding high standards for yourself, especially in the current era.

jgneff15d ago

I'm so happy to see this change. I got involved with reproducible builds in 2021 after reading in horror about the SolarWinds attack. [1]

[1] https://www.linux.com/news/preventing-supply-chain-attacks-l...

[2] https://github.com/openjdk/jdk/pull/9152#issue-1270543997

micw15d ago

MomsAVoxell15d ago

What do you mean why is it a thing nowadays?

Certainly, a lot of the hard work of the Yocto and Debian developers is already in your hands.

What is interesting is that this is now being applied in a more forward-focused policy by the Debian developers, that it will now be the norm rather than an option…

dezgeg15d ago

Did you actively verify that your builds were bit-reproducible?

tofflos15d ago

amd64 forky

reproduced: 97.02% good: 17586 bad: 511 fail: 30 unknown: 0

This, statistics for other architectures, and the reasons for unreproducibility can be found at https://reproduce.debian.net.

suprjami15d ago

I am always surprised Debian are leading this and not the commercial vendors. You'd think big organisations paying for RHEL and Ubuntu would be beating down the door for verifiable binaries.

tremon15d ago

jorams15d ago

Reproducible builds exist to reduce the need for trust, while commercial vendors are in the business of selling trust.

pixel_popping15d ago

Forbidden

You don't have permission to access this resource. Apache Server at lists.debian.org Port 443

ameliaquining15d ago

I can see it just fine; maybe an overzealous firewall thinks you're a bot? At any rate, the Wayback Machine has it: https://web.archive.org/web/20260510074120/https://lists.deb...

baranul15d ago

Unfortunately, many of these "protections" don't know what is a bot or a human. Many clueless websites are often just blocking huge swaths of legitimate readers and customers.

pixel_popping15d ago

Why would you block access to a static page, even Bots, what's the point? I'm not a bot, very typical non-privacy setup (Firefox, Linux, VPN) for personal usage.

It does work with my privacy/scrapping setup (residential proxy, spoofed fingerprints, Qubes and so on), great job debian.

TacticalCoder15d ago

What people really don't understand about reproducible builds is that they're not a guarantee that there's no backdoor.

They're a guarantee that if there's a backdoor, it's reproducible 100% of the time.

This is a godsend for white hats fighting the good fight.

And, as a side note, it's strongarming vs the bad guys: "Would be too bad if we could reproduce your shiny exploit 100% of the time wouldn't it!?".

1 more reply

casey214d ago

It feels like AI and traditional software are converging in complexity.

kkyktkrkekk15d ago

inglor_cz15d ago

Has anyone fought Microsoft Visual Studio successfully to produce reproducible builds of C++ programs? From what I have heard, it is one of the worst contexts to do it.

Dwedit15d ago

It's that RICH header that you need to exclude. I just tested my copy of MSVC 2019, and `/emittoolversioninfo:no` will exclude the RICH header from the binary. Supposedly also works in MSVC 2022.

The build timestamps in the PE header and export table are also a problem as well.

azkalam15d ago

Probably easiest way is to use Bazel to leverage the effort that has gone in there

einpoklum15d ago

Well, you can't build MSVS yourself, reproducibly or otherwise, so this is a less appealing endeavor I would think.

rurban15d ago

... and most of this work is done by other distros and maintainers. Starting with binutils

shevy-java15d ago

A small step for debian,

giant leap for mankind.

stingraycharles15d ago

As someone who recently spent a lot of time on making a large C++ program entirely reproducible on 4 different OS’es, one cannot understate just how many tiny details matter here.

amelius15d ago

That's cool but I'm honestly a bit disappointed in how apt refused to embrace/support both the container and AI/GPU aspects of computing. Are we going to see some changes there?

yjftsjthsd-h15d ago

Those seem like unrelated things? I can imagine ways for apt to integrate with containers, but what would it possibly do for AI or GPU other than delivering packages like it already does?

Arrowmaster15d ago

What exactly are you talking about? Those don't seem related.

Hendrikto15d ago

Why the fuck does that site break the back button? DO NOT do that.

em-bee15d ago

since there is no other way to reach you please allow me to use this off topic message to let you know that there is a response to your comments on the gnupg discussion from two weeks ago.

einpoklum15d ago

Debian must ship packages without the hard dependence on systemd.

charcircuit15d ago

Hendrikto15d ago

It allows verifying that the binaries actually match the source, which is extremely valuable.

charcircuit15d ago

Bit for bit matching is not required for that.

Hendrikto14d ago

It makes it much simpler and more robust though. Also, it allows for content addressing a la Nix, among other benefits.

1 more reply

farfatched15d ago

Yes, making sure build timestamps are reproducible isn't a security win.

What is a win is that two independent parties can run the same build, and get the same binaries.

This is important because it removes trust from builders: anyone can verify their output.

It just so happens that unimportant things like build versions impede that.

charcircuit15d ago

Anyone can verify the actual code in the binary matches even if some bytes within the binary file itself are different. The verification routine doesn't have to be a basic bit for bit equality test.

farfatched14d ago

For sure.

This has been the status quo in Debian for a while now. You can build, and use diffoscope to audit the differences.

It's a stronger security property to have bit-for-bit reproducibilty, and it looks like Debian are ready to commit to it.

1 more reply

deknos15d ago

you are free to provide patches instead of bitching.

charcircuit15d ago

And Debian is able to offer me a few million dollars yearly to help fix their security situation.

deknos14d ago

the idea that debian has a few million dollars to spare creates the assumption, that even if they would have... you would either not know how to fix issues, or not worth it.

kkfx15d ago

Debian, like any other legacy distro, mush became declarative, because the '80s model of manual deploy and the absurd pain of D/I and Preseed must end.

kakwa_15d ago

In the end, Nix is just a thin veneer on this stuff.

Given how many quick & dirty sed patching or exec commands I've seen in the few nix package/modules I've read, I would not exactly bet my life on it being completely idempotent & reproducible.

kkfx15d ago

farfatched15d ago

I've been 100% on NixOS on many years, but it's Debian that really drove this project.

They're still a pragmatic choice for many usecases.

suprjami15d ago

bootcrew have bootc Containerfiles for Debian, Ubuntu, Arch, and openSUSE:

https://github.com/bootcrew/mono

blueflow15d ago

zero improvement on end-user experience. does not solve supply chain issues, debian package will reproducabily contain the malware from upstream.

quantummagic15d ago

> zero improvement on end-user experience.

Maybe not by itself, but it does allow for the ecosystem to be audited, in a way that ultimately benefits the end-user. It really is an important part of a healthy supply chain.

miohtama15d ago

I would call less North Korean hacks a massive benefit for end users

testdelacc115d ago

While taking no stance on your statement, I think “fewer” works better in this context than “less”.

rlpb15d ago

alkindiffie15d ago

> for example the stream of issues from npm et al.

Curious, what distros where affected by npm supply chain attacks?

throw_a_grenade15d ago

hiAndrewQuinn15d ago

iveqy15d ago

It does not solve all supply chain issues, it do solve some supply chain issues.

Not being able to see if the source code shipped is the same as been used for creating the binary is scary

murderfs15d ago

Has there been a single publicly known attack that would have been prevented by this?

atoav15d ago

Reproducable builds are not solving all issues as you rightly observed, but they can be a stepping stone (or even a pre-condition) for further measures.

mschuster9115d ago

That's not what reproducible builds aim to prevent, and no one claims that. When upstream pushes bad code, that's on upstream.

ownagefool15d ago

Reproducible builds shows that, within a specific configuration, the code produced the binary, regardless of who signed or published it.

shevy-java15d ago

j / k navigate · click thread line to collapse