Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pkgs/top-level: make package sets composable #303849

Open
wants to merge 9 commits into
base: master
Choose a base branch
from

Conversation

wolfgangwalther
Copy link
Contributor

@wolfgangwalther wolfgangwalther commented Apr 13, 2024

Description of changes

The various pkgsXYZ top-level package sets did not pass localSystem / crossSystem options to lower levels, so far. This change propagates the original arguments, i.e. localSystem0 / crossSystem0 to lower levels. Those include the overrides made by an upper package set.

Example: pkgsStatic adds crossSystem = { isStatic = true }. pkgsStatic.pkgsMusl previously lost this setting, because it didn't propagate crossSystem0, yet.

The first commit removes / rewrites some outdated comments, which referred to the cross and i686 package sets. Meanwhile, we have a lot more, so I rewrote those comments to not say anything about specific sets anymore.

The second commit adds some tests to confirm composability of package sets. This helped me coming up with the fixes, I'm not sure whether it should be kept. It takes about 8-9 seconds for me to run. It also helps showing what the remaining commits fix by uncommenting the previously broken tests.

The remaining commits fix those composability issues. This:

One open question for me is how to deal with pkgsXYZ.pkgsCross. This has practical challenges (#114510), but is also conceptually unsound: In a way pkgsCross expects to reset crossSystem from scratch, because the whole point about this layer is to start with "well known" example configurations. IMHO, it's fine to make modifications on top of that. e.g. pkgsCross.XYZ.pkgsLLVM or pkgsCross.XYZ.pkgsStatic, but it makes little sense to have pkgsLLVM.pkgsCross... or pkgsStatic.pkgsCross.... Probably completely unusable is pkgsCross.XYZ.pkgsCross.ABC.

One idea would be to just restrict pkgsCross to be used at the top-level, for example by adding something like this in each package set, including pkgsCross itself: [...]

That would also disallow pkgsMusl.pkgsCross.XYZ, which I think is about the only remotely reasonable use-case here: "cross compiling from a native musl environment". I'm not sure whether anyone uses that, though, and whether it's worth supporting at all.

Things done

  • Built on platform(s)
    • x86_64-linux
    • aarch64-linux
    • x86_64-darwin
    • aarch64-darwin
  • For non-Linux: Is sandboxing enabled in nix.conf? (See Nix manual)
    • sandbox = relaxed
    • sandbox = true
  • Tested, as applicable:
  • Tested compilation of all packages that depend on this change using nix-shell -p nixpkgs-review --run "nixpkgs-review rev HEAD". Note: all changes have to be committed, also see nixpkgs-review usage
  • Tested basic functionality of all binary files (usually in ./result/bin/)
  • 24.05 Release Notes (or backporting 23.05 and 23.11 Release notes)
    • (Package updates) Added a release notes entry if the change is major or breaking
    • (Module updates) Added a release notes entry if the change is significant
    • (Module addition) Added a release notes entry if adding a new NixOS module
  • Fits CONTRIBUTING.md.

Add a 👍 reaction to pull requests you find important.

@K900
Copy link
Contributor

K900 commented Apr 14, 2024

Eval is broken :(

@K900
Copy link
Contributor

K900 commented Apr 14, 2024

That said, this is generally good stuff and we should land this (though maybe after 24.05?)

@lf-
Copy link
Member

lf- commented Apr 14, 2024

I don't have time to review this right now but I just want to say, thank you so much for writing this, and your commit messages are great!

@wolfgangwalther
Copy link
Contributor Author

Eval is broken :(

I haven't been able to find out why, yet :/

@wolfgangwalther
Copy link
Contributor Author

Eval is broken :(

I haven't been able to find out why, yet :/

Will change to "draft" until I have the time to figure that out.

@wolfgangwalther wolfgangwalther marked this pull request as draft May 5, 2024 16:03
@wegank wegank added the 2.status: merge conflict This PR has merge conflicts with the target branch label May 22, 2024
@wolfgangwalther
Copy link
Contributor Author

I rebased on latest master and should have fixed eval along the way.

@ofborg ofborg bot removed the 2.status: merge conflict This PR has merge conflicts with the target branch label May 24, 2024
@wolfgangwalther
Copy link
Contributor Author

Improved the commit messages, fixed another eval failure and added another commit to do the following, as mentioned above:

One idea would be to just restrict pkgsCross to be used at the top-level

Let's see where ofborg fails this time.

@wolfgangwalther
Copy link
Contributor Author

Rebased and cherry-picked #314845. This makes ./outpaths.nix evaluate locally for me.

@wolfgangwalther
Copy link
Contributor Author

wolfgangwalther commented May 26, 2024

eval passes now. Ready for review. Please ignore the first commit (see other comment).

@wolfgangwalther wolfgangwalther marked this pull request as ready for review May 26, 2024 14:58
@Ericson2314
Copy link
Member

Thank for doing a very thorough job documenting your work, @wolfgangwalther, but I am a little worried about passing this information. it was originally very intentional that I transformed the ugly/bad localSystem and crossSystem to other better arguments things in the process of going from pkgs/top-level/default.nix -> pkgs/top-level/stage.nix.

I think I see why you are doing this, but another way to look at this is that having the other package sets mashed into stage.nix like this is --- while convenient --- and architectural abomination. The package sets are are supposed to be intended, and the proper way to get them is just to call nixpkgs with different arguments. Having some convience example calls is fine, but it was a mistake to ever make this part of the package set. pkgsCross being bad makes it unsurprising to me that supporting it forces further architectural imperfections. It's "virally" the wrong thing.

@wolfgangwalther
Copy link
Contributor Author

wolfgangwalther commented May 26, 2024

@Ericson2314 I'm not sure what the takeaway from this is. Are you saying that package sets should not be composed? I.e. we should only have (example) pkgsStatic at the top-level, but not pkgsCross.XYZ.pkgsStatic?

Clearly the status-quo, where it's possible to call them, but they only compose in parts, but not in others, is a problem.

@wolfgangwalther
Copy link
Contributor Author

Overall, I can find 5 different approaches how to do deal with those package sets:

  • Get rid of all of them and force users to pass arguments to nixpkgs.
  • Get rid of them when nested. pkgsStatic exists, but pkgsCross.XYZ.pkgsStatic is forbidden (either undefined or throwing an error).
  • Status quo: Have them, but let them compose badly.
  • Change them to always override all previous settings. So pkgsCross.XYZ.pkgsStatic would not be cross anymore, but just the same as pkgsStatic directly.
  • Make them compose well. pkgsCross.XYZ.pkgsStatic would do exactly what you'd expect it to do.

Obviously, I'm in favor of the last option.

Once we agree on where this should go, we can look into how to make this nicer architecturally.

@wolfgangwalther wolfgangwalther requested a review from alyssais as a code owner May 28, 2024 19:29
@github-actions github-actions bot added the 6.topic: lib The Nixpkgs function library label May 28, 2024
@wolfgangwalther
Copy link
Contributor Author

but I am a little worried about passing this information. it was originally very intentional that I transformed the ugly/bad localSystem and crossSystem to other better arguments things in the process of going from pkgs/top-level/default.nix -> pkgs/top-level/stage.nix.

I have changed the approach from "pass localSystem and crossSystem to stage.nix" to "make nixpkgsFun itself composable". So "dealing with the bad/ugly localSystem/crossSystem" only happens in default.nix now - stage.nix is a lot cleaner than before.

@Ericson2314 Do you like this approach better?

@m-bdf
Copy link
Contributor

m-bdf commented Jan 10, 2025

pkgsMusl seems broken on NixOS:

$ nix repl github:wolfgangwalther/nixpkgs/compose-pkgs-sets
> system = lib.nixosSystem { system = "x86_64-linux"; modules = []; }
> system.pkgs.pkgsMusl.stdenv.hostPlatform.libc
"glibc"

nixpkgs master prints "musl" as expected.

@wolfgangwalther
Copy link
Contributor Author

Hmm... I do this and get the expected result:

nix-repl> pkgs = import ./default.nix { }

nix-repl> pkgs.pkgsMusl.stdenv.hostPlatform.libc
"musl"

Not familiar too much with lib.nixosSystem, yet. That's flake-specific, right?

Unfortunately, nix repl github:wolfgangwalther/nixpkgs/compose-pkgs-sets doesn't work with my version of lix. Seems to be a newer nix feature to pass a github reference like that to repl?

@pwaller
Copy link
Contributor

pwaller commented Jan 10, 2025

Not familiar too much with lib.nixosSystem, yet. That's flake-specific, right?

It's the same thing as evaluating a nixos system which you can do with nix-instantiate --eval from a nixpkgs checkout; I see the same issue there, which is interesting:

$ nix-instantiate --eval --arg configuration {} ./nixos -A pkgs.pkgsMusl.stdenv.hostPlatform.libc
glibc

@wolfgangwalther
Copy link
Contributor Author

It's the same thing as evaluating a nixos system which you can do with nix-instantiate --eval from a nixpkgs checkout;

Ah, thanks. I can reproduce it with that, indeed. Really odd. Will look into it on the weekend.

@wolfgangwalther
Copy link
Contributor Author

This can be reduced down to this:

% nix-instantiate --eval --strict -E 'with import ./. {}; let result = lib.systems.elaborate { config = "x86_64-unknown-linux-musl"; isMusl = false; }; in { inherit (result) config libc; }'
{ config = "x86_64-unknown-linux-musl"; libc = "glibc"; }

Imho, this is an existing bug in lib.systems.elaborate, that we're uncovering here. It doesn't return a valid system, because it doesn't know which of config or isMusl to take as the source of truth.

This removes all specific references to pkgsCross or pkgsi686Linux, because
they have become outdated with the addition of many more package sets.
…e bottom

No change, just move appendOverlays and extend to the bottom, since they
will be changed much less often. This makes it easier to compare the
other package sets side-by-side.
Sharing a first piece of common code between all package sets makes it
easier to maintain and less likely to introduce a new package set
without this.
@wolfgangwalther
Copy link
Contributor Author

@m-bdf thanks for your feedback, I applied the suggestion for the tests.

pkgsMusl seems broken on NixOS:

This works now:

% nix-instantiate --eval --arg configuration {} ./nixos -A pkgs.pkgsMusl.stdenv.hostPlatform.libc
"musl"

Imho, this is an existing bug in lib.systems.elaborate, that we're uncovering here.

Not much we can do about systems.elaborate, it's not designed to return "correct" systems with arbitrary overrides applied. pkgs/top-level/default.nix takes care to never pass on the elaborated system, but the original localSystem/crossSystem args for that purpose. I added a new commit to make the nixos-nixpkgs module work the same way - passing on the original arguments instead of a fully elaborated system.

@github-actions github-actions bot added 6.topic: nixos Issues or PRs affecting NixOS modules, or package usability issues specific to NixOS 8.has: module (update) This PR changes an existing module in `nixos/` 6.topic: module system About "NixOS" module system internals 6.topic: lib The Nixpkgs function library labels Jan 11, 2025
@nix-owners nix-owners bot requested review from roberth and infinisil January 11, 2025 10:50
@github-actions github-actions bot added 10.rebuild-linux: 1-10 and removed 10.rebuild-linux: 0 This PR does not cause any packages to rebuild on Linux labels Jan 11, 2025
pkgs/test/top-level/stage.nix Outdated Show resolved Hide resolved
# Basic test for idempotency of the package set, i.e:
# Applying the same package set twice should work and
# not change anything.
isIdempotent = set: discardEvaluationErrors (pkgs.${set}.stdenv == pkgs.${set}.${set}.stdenv);
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

stdenv is a derivation, and == on derivations is limited to outPath.
So this would be equivalent and avoid confusion:

Suggested change
isIdempotent = set: discardEvaluationErrors (pkgs.${set}.stdenv == pkgs.${set}.${set}.stdenv);
isIdempotent = set: discardEvaluationErrors (pkgs.${set}.stdenv.outPath == pkgs.${set}.${set}.stdenv.outPath);

However, I think you do want to check more:

Suggested change
isIdempotent = set: discardEvaluationErrors (pkgs.${set}.stdenv == pkgs.${set}.${set}.stdenv);
isIdempotent = set: discardEvaluationErrors (pkgs.${set}.stdenv.outPath == pkgs.${set}.${set}.stdenv.outPath
&& pkgs.${set}.stdenv.hostPlatform == pkgs.${set}.${set}.stdenv.hostPlatform
&& pkgs.${set}.stdenv.buildPlatform == pkgs.${set}.${set}.stdenv.buildPlatform
);

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hm.. according to the test failures I got during development, I assumed the outPath changes when host or build platform change. Is that not correct?

I tried adding the hostPlatform & buildPlatform checks. Instead of ==, I had to use lib.systems.equals. I didn't get any test failures this way. This doesn't mean I don't need those changes, but do you have any example where a change to hostPlatform / buildPlatform would not affect stdenv's outPath?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This seems to support the idea that those platforms should affect stdenv's outpath:

inherit buildPlatform hostPlatform targetPlatform;

Copy link
Contributor Author

@wolfgangwalther wolfgangwalther Jan 11, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For now, I made the .outPath comparison explicit, but didn't add the platform checks, yet.

Edit: I reverted the change to use outPath, because nixfmt then requires it to be split across two lines awkwardly. This'd be a net-negative for readability.


let
# To silence platform specific evaluation errors
discardEvaluationErrors = e: (builtins.tryEval e).success -> e;
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Extensive use of this makes the test less effective.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Without those, I have errors like this quickly:

error: x86_64 Darwin package set can only be used on Darwin systems.

Those are the errors thrown in pkgs/top-level/stage.nix. Any idea how I could target those more specifically - without replicating the logic around those into the tests?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I went through the code again and I think discardEvaluationErrors does not necessarily make the test less effective. In fact it makes it more specific. The calls to this function are placed only around conditionals, so the evaluation errors can only come from evaluating those package sets.

These tests are not meant to test the evaluation of each package set, but to test the composability of them, i.e. they are supposed to test the code in stage.nix, not further downstream. If stdenv for a certain package set is broken, this is not the place to test it, imho.

pkgs/top-level/stage.nix Outdated Show resolved Hide resolved
@wolfgangwalther wolfgangwalther force-pushed the compose-pkgs-sets branch 2 times, most recently from c472e79 to 0101943 Compare January 11, 2025 13:03
This adds some basic tests to compose package sets. The cases that are
currently broken
are commented out, they include things like:

- pkgsStatic.pkgsMusl losing the isStatic flag
- pkgsCross.ppc64-musl.pkgsMusl losing the gcc.abi setting
- pkgsCross.mingwW64.pkgsStatic losing the config string
- pkgsLLVM.pkgsMusl losing the useLLVM flag
- pkgsLLVM.pkgsStatic losing the useLLVM flag
- pkgsLLVM.pkgsi686Linux losing the useLLVM flag

And probably more.
Passing the elaborated system defeats what pkgs/top-level/default.nix
tries to do: Pass only the original args and let defaults be inferred.

The underlying problem is that lib.systems.elaborate can not deal with
arbitrary overrides, but will often return an inconsistent system
description when partially overriding some values. This becomes most
prominent if trying to override an already elaborated system.
The various pkgsXYZ top-level package sets did not pass localSystem /
crossSystem to lower levels, so far. This change propagates original
arguments to lower levels, which include the overrides made by an upper
package sets.

There is an extensive test-suite to test various combinations of package
sets in pkgs/test/top-level. There are a few basic promises made:

- Package sets must be idempotent. pkgsMusl.pkgsMusl === pkgsMusl.

- Once pkgsCross is used any subsequent package sets should affect the
  **host platform** and not the build platform. Examples:
  - pkgsMusl.pkgsCross.aarch64-multiplatform is a cross compilation from
musl to glibc/aarch64
  - pkgsCross.aarch64-multiplatform.pkgsMusl is a cross compilation to
musl/aarch64

- Modifications from an earlier layer should not be lost, unless
  explicitly overwritten. Examples:
  - pkgsStatic.pkgsMusl should still be static.
  - pkgsStatic.pkgsCross.gnu64 should be static, but with glibc instead
of musl.

Exceptions / TODOs:
- pkgsExtraHardening is currently not idempotent, because it applies the
  same flags over and over again.

Supersedes NixOS#136549
Resolves NixOS#114510
Resolves NixOS#212494
Resolves NixOS#281596
When using pkgsCross with a system that ends up the same as the
localSystem, then modifications for package sets like pksgMusl need to
be done for **both** localSystem and crossSystem. Consider the following
on x86_64-linux:

  pkgsCross.gnu64.pkgsMusl

Before this change, this would result in a musl buildPlatform, but a gnu
hostPlatform. This breaks the promise of "stacking" package sets on top
of each other.

After this change, it results in a musl buildPlatform and a musl
hostPlatform. This works better.

One could expect this to result in the same as pkgsCross.musl64, i.e. a
gnu buildPlatform and a musl hostPlatform, however I couldn't get this
to work without increasing memory usage for ci/eval by many, many GB.
This is caused by usage of pkgsi686Linux inside the main package set,
which follows the same hybrid pattern.
@wegank wegank removed the 12.approvals: 1 This PR was reviewed and approved by one reputable person label Jan 11, 2025
mkPkgs = name: fn: nixpkgsFun (prevArgs: let nixpkgsArgs = fn prevArgs; in nixpkgsArgs // {
overlays = [
(self': super': {
"${name}" = super';
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this overlay still needed now that the package sets are composable?

It forces accessing an already accessed package set to be a no-op, which it now already is without the overlay for simple cases (e.g. pkgsStatic.pkgsStatic), and I believe is not desirable for more complex cases (e.g. when using buildPackages):

> pkgs.pkgsStatic.buildPackages.pkgsStatic.stdenv.hostPlatform.libc
"glibc"

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I've found that without the overlay above, using a package set in an overlay stack overflows:

> (pkgs.extend (final: prev: { inherit (final.pkgsStatic) hello; })).hello
error: stack overflow; max-call-depth exceeded

So a mechanism to avoid infinite recursion is indeed still needed, however I think the current one is not ideal, as it leads to unwanted no-ops (see the example in my previous comment).

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

FYI, max-call-depth is a relatively recent addition to Nix, and the limit of 10000 was set arbitrarily.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
6.topic: lib The Nixpkgs function library 6.topic: module system About "NixOS" module system internals 6.topic: nixos Issues or PRs affecting NixOS modules, or package usability issues specific to NixOS 8.has: clean-up 8.has: module (update) This PR changes an existing module in `nixos/` 8.has: package (new) This PR adds a new package 10.rebuild-darwin: 1-10 10.rebuild-linux: 1-10
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Build failure: pkgsCross.mingwW64.pkgsStatic.stdenv