#nixos-dev on 2018-10-03

2018-08-16 20:49 gchristensen changed the topic of #nixos-dev to: NixOS Development (#nixos for questions) | https://hydra.nixos.org/jobset/nixos/trunk-combined https://channels.nix.gsc.io/graph.html | 18.09 release managers: vcunat and samueldr | https://logs.nix.samueldr.com/nixos-dev

01:01 <ekleog> <zimbatm> ekleog: do you know if signed commits change the commit hash? <-- so there are multiple ways to sign commits, 3 have been discussed on the RFC

01:02 <ekleog> Way 1 is git's `git commit -S` / `git merge -S`. This one changes the commit hash, you can't sign a posteriori

01:03 <ekleog> Way 2 and 3 are both signing metadata relating to the commit and storing it in `git notes`. These ones don't change the commit hash, but rather push a commit on an unrelated branch `refs/signatures`

01:03 <ekleog> Way 2 signs basically the commit SHA-1 + metadata

01:04 <ekleog> Way 3 signs the patch-id, commit on which it's supposed to apply and [I don't really remember everything, it seems way too complex for no benefit to me]

01:05 * ekleog is in favor of way 2, which allows multiple persons to sign the same commit without any workflow change if we want to switch to it for sensitive packages later

01:29 lassulus_ has joined #nixos-dev

01:32 lassulus has quit [Ping timeout: 244 seconds]

01:32 lassulus_ is now known as lassulus

02:15 lopsided98 has quit [Quit: Disconnected]

02:17 lopsided98 has joined #nixos-dev

02:38 <samueldr> hi all! if you can spare some time and find nits and pick them out of this PR, it would be lovely, thanks! https://github.com/NixOS/nixpkgs/pull/47697

02:38 <{^_^}> #47697 (by samueldr, 1 minute ago, open): Manual: reviews partitioning steps

02:49 Synthetica has quit [Quit: Connection closed for inactivity]

02:52 orivej has quit [Ping timeout: 252 seconds]

03:48 <gchristensen> ekleog: the problem is git-notes suck, aren't nice for rebasing, pushing, pulling, merging, conflict-resolution, etc.

03:49 <gchristensen> there is a reason github dropped support for displaying git notes years ago

03:58 <ekleog> gchristensen: well, git's internal signatures aren't nice for any of those things either, and I think that's actually a good thing :)

03:58 <ekleog> like, I want to sign a commit in a known context, not sign a commit that'll maybe be rebased in a situation where it introduces a vulnerability

03:59 <ekleog> the point of git notes being only to have a way to associate (multiple) signature(s) to a commit after the commit is pushed

04:01 <ekleog> (tbh, I don't really understand why git didn't make its internal signatures based on git-notes in the first place, it's much nicer for ability to rebase, because you can rebase and then ask people to re-sign their commits, while with internal signatures you'd need to be synchronized with the others to in turn re-sign and re-push the correct commits)

04:03 * ekleog discovered about git notes only recently but fell in love with the idea, even if not really with the tool

04:06 <gchristensen> git notes are a disaster because of how detached they are

04:06 <gchristensen> git commit signatures work properly because they are part of the commit, not a forgotten appendage

04:08 <gchristensen> I used git notes to store security annotation data for commits, it was terrible to deal with conflicts, and I didn't even have to deal with more than 1 person committing to them

04:08 <gchristensen> everybody falls in love with git-notes because they're so cool, but man

04:09 <ekleog> the cat_uniq_sort merge strategy didn't do the trick?

04:10 <ekleog> Here it'd be storing an ever-growing set of signatures in the notes for each commit, so I'd guess cat_uniq_sort would take care of all conflict

04:10 <ekleog> s

04:11 <ekleog> the fear I have with regular git commit signatures is that it's not possible to sign an old commit we forgot to sign, which isn't really a problem with few people committing… but in nixpkgs I guess we should expect this kind of mistakes to happen

04:12 <gchristensen> well there is no need to sign an old commit, every commit inherently signs all of history

04:13 <gchristensen> maybe it would work fine, but since it is a weird part of git nobody really knows how to use it and the tools work less nicely, and external tools which know about git worked barely at all with it

04:14 <gchristensen> I'm not sure mashing up two user-hostile tools in to one feature is a good idea

04:16 <gchristensen> anywho, it is long past my bedtime, see you

04:25 lopsided98 has quit [Quit: Disconnected]

04:27 lopsided98 has joined #nixos-dev

04:27 <ekleog> see you :)

04:28 <ekleog> gchristensen: (hl so you see when you come back) the problem with “every commit inherently signs all of history” is “A pushes an unsigned backdoor, B pulls, signs a merge, pushes”: here, B has taken responsibility for A's backdoor

04:29 <ekleog> it's possible to say B must check all commits on fetch, but that requires tooling that's not yet present and adds a failure mode, or requires a *lot* of human brainpower

04:30 <ekleog> otoh, if we consider each commit to be signed independently (ie. not signing the previous state of history), then we can just say “ok A's commit has never been signed, there's something weird in the history”, hydra would be automatically blocked and people would look into the issue

04:32 <ekleog> but yeah, the problem is mostly about external tools that know about git, they wouldn't know at all about said signatures… though I can't see a practical example of issues happening due to it right now :)

04:44 orivej has joined #nixos-dev

04:59 srk has quit [Ping timeout: 246 seconds]

05:01 srk has joined #nixos-dev

05:17 orivej has quit [Ping timeout: 268 seconds]

06:28 FRidh has joined #nixos-dev

07:05 jtojnar has quit [Remote host closed the connection]

07:19 {^_^} has quit [*.net *.split]

07:19 globin has quit [*.net *.split]

07:19 phreedom_ has quit [*.net *.split]

07:32 {^_^} has joined #nixos-dev

07:32 globin has joined #nixos-dev

07:34 phreedom has joined #nixos-dev

07:40 Sigyn has quit [Quit: People always have such a hard time believing that robots could do bad things.]

07:40 Sigyn has joined #nixos-dev

08:13 {^_^} has quit [*.net *.split]

08:13 globin has quit [*.net *.split]

08:13 phreedom has quit [*.net *.split]

08:17 FRidh has quit [Quit: Konversation terminated!]

08:20 <andi-> what can we do about the issue with the build timeouts of 10h due to local nix-daemon configuration? 18.03 has been blocked for about 12 days or so because the chromium builds have been terminated after 10h. There is a more serious firefox update arriving today and it would be nic to release within a few days at most.

08:22 phreedom has joined #nixos-dev

08:23 <srhb> andi-: What's this about local nix-daemon configuration? A situation where meta.timeout is not respected?

08:24 <andi-> yes, https://github.com/NixOS/hydra/issues/591

08:24 <srhb> andi-: Thanks. Ack.

08:27 {^_^} has joined #nixos-dev

08:27 globin has joined #nixos-dev

08:27 <clever> srhb: the `timeout=` in nix.conf has priority, even when absent (the default value in the config then takes over)

08:27 <clever> srhb: so you need to set it per-slave: https://github.com/input-output-hk/iohk-ops/blob/master/modules/hydra-slave.nix#L19

08:28 <srhb> clever: Meh, that's not good...

08:30 <andi-> If I could hit the restart button I could at least hit it until we get lucky with a faster/idle build machine :/

08:32 <srhb> andi-: Our options are certainly limited, because removing those large builds from the tested set is not on the table (has been discussed before) -- so I guess we're down to 1) fixing the override of the meta.timeout and friends values, disabling the nix-daemon on all builders, creating a new feature flag for non-daemonized stores, or restarting ad hoc.

08:32 <srhb> Oh, I gave up on numbering after 1... *needs more coffee)

08:33 <clever> srhb: nix-daemon doesnt have to be disabled, you just have to ssh into root and run `nix-store --serve` as root

08:33 <clever> then it will ignore the daemon

08:33 <clever> but then hydra has root on all build slaves...

08:33 <srhb> That seems less likely to be a thing we can/want to do in the short term. :-)

08:33 <srhb> I think feature flag is the most palatable quickfix.

08:34 <clever> the protocols between hydra<->nix-store --serve<->nix-daemon have to be improved to allow forwarding this all the way

08:34 <clever> i think the problem is the serve<->daemon layer

08:34 <ekleog> wouldn't a quickfix just to increase builders' timeout setting?

08:34 <ekleog> +be

08:34 <clever> yeah

08:34 <srhb> Definitely.

08:35 <clever> in the case of iohk, the problem is the reverse, tests that just deadlock and stay running for 48 hours, so we needed to decrease the timeout

08:35 <ekleog> that can be done with meta.timeout, though, can't it?

08:35 <srhb> ekleog: The issue is that it's ignored on daemonized builder nodes.

08:35 <clever> ekleog: meta.timeout is ignored when nix-daemon is at play

08:35 <ekleog> oh. I thought nix-daemon took the min between the two values

08:35 <ekleog> which would make sense imo

08:35 <srhb> If it does, the problem is the same..

08:36 <srhb> (At least unless we do bump the nix.conf setting)

08:36 <ekleog> well, bump daemon's timeout to a lot, down iohk's timeout to not a lot

08:36 <clever> ekleog: i believe the protocol between `nix-store --serve` and `nix-daemon` doesnt support pushing over the timeout value

08:36 <ekleog> ugh, that'd be bad :(

08:37 <ekleog> ugh. https://github.com/NixOS/nix/issues/50

08:37 <{^_^}> nix#50 (by rbvermaa, 6 years ago, closed): Build timeout not passed to nix daemon

08:38 <ekleog> oh wait

08:38 <ekleog> it's good actually, didn't see it was closed with a commit

08:40 <clever> that commit is in the perl code for build remotes

08:40 <clever> all perl is gone

08:45 <ekleog> I can confirm that for a local daemon, the --timeout option works, so the daemon protocol can understand it

08:45 <ekleog> now, between hydra and nix-store --serve, I have no easy way of testing :/

08:46 <ekleog> well, in the meantime, maybe it'd make sense to bump the timeout to 15 hours or similar, so that builds are unblocked? having overloaded hydra is still better than having completely blocked hydra

08:46 <clever> sure

08:47 <srhb> It's more like 48 hours :-)

08:47 <srhb> (That's what we determined on the builders that _do_ respect meta.timeout)

08:48 <andi-> I saw a build that succeeded in ~3.5h on one of the epyc machines

08:49 <srhb> Yeah, it can go really fast in good situations.

08:49 <andi-> How do we force good situations? Break all other channels? ;-)

08:49 <srhb> iirc it's mostly about load at any given time.

08:50 <srhb> We don't have that sort of granularity.

08:50 <srhb> You'd want certain builds to "balloon" taking up more slots than usual.

08:51 {^_^} has quit [*.net *.split]

08:51 globin has quit [*.net *.split]

08:51 phreedom has quit [*.net *.split]

08:58 <andi-> moments of truth.. 8min until termination https://hydra.nixos.org/build/82320297

08:59 * adisbladis is taking bets

08:59 phreedom has joined #nixos-dev

09:03 <ekleog> 2 minutes

09:03 * ekleog bets failure, it's still missing 9k objects

09:05 {^_^} has joined #nixos-dev

09:05 globin has joined #nixos-dev

09:05 <andi-> 14h should just be enough to build it at the current performance :/

09:05 <adisbladis> And I thought webkitgtk was bad..

09:05 <ekleog> huh did it just go over 10hrs?

09:06 <ekleog> this would be a builder where meta.timeout is respected?

09:06 <ekleog> oh, no

09:07 <andi-> oh noes, he said webkitgtk.. thats also somewhere on my todo list :/

09:07 <ekleog> andi-: if you don't have access to the retrigger build button, just push a one-space-change-to-builder commit? :°

09:08 <andi-> I'd rather not do that... Also I have the restart button and role/permission but it always returns a 403 ^^

09:08 <ekleog> oh ^^'

09:08 <ekleog> well… in the meantime…

09:09 <ekleog> niksnut: would it be possible to bump the timeout of hydra to at least 24 hours or so? currently chromium timeouting is blocking 18.03 :/

09:09 <ekleog> s/hydra/& builders/

09:10 <andi-> If any of you are bored in the meantime: I am looking for a reviewer for the firefox bumps #47714 #47712 #47713 :-)

09:10 <{^_^}> https://github.com/NixOS/nixpkgs/pull/47714 (by andir, 1 hour ago, open): [18.09] firefox{-bin,}: 62.0.2 -> 62.0.3, firefox-esr-60: 60.2.1 -> 60.2.2

09:10 <{^_^}> https://github.com/NixOS/nixpkgs/pull/47713 (by andir, 1 hour ago, open): [18.03] firefox{-bin,}: 62.0.2 -> 62.0.3, firefox-esr-60: 60.2.1 -> 60.2.2

09:10 <{^_^}> https://github.com/NixOS/nixpkgs/pull/47712 (by andir, 1 hour ago, open): firefox{-bin,}: 62.0.2 -> 62.0.3, firefox-esr-60: 60.2.1 -> 60.2.2

09:10 <ekleog> niksnut: (later solutions would be fixing https://github.com/NixOS/hydra/issues/591 to make hydra always respect meta.timeout, as I understand it, cf. the discussion preceding these hl)

09:10 <{^_^}> hydra#591 (by cleverca22, 4 weeks ago, open): meta.timeout does not always work

09:15 <ekleog> andi-: hmm… what's there to review? it's all just version and hash bumps, so if it builds it should be ok?

09:21 <andi-> ekleog: just making sure I didn't miss anything :-)

09:22 <andi-> I build them all and tested in the mean time..

09:27 <ekleog> well, then I can confirm these look very much the same as your previous security fixes from https://github.com/NixOS/nixpkgs/pull/47277 ; so if those were correct the new ones should be too :)

09:27 <{^_^}> #47277 (by andir, 1 week ago, merged): [18.09] firefox, firefox-bin 61.0.2 -> 62.0.2, firefox-esr: 60.2.0esr -> 60.2.1esr [Moderate security fixes]

09:30 <andi-> Ohhh, srhb I think I figured what I am permitted to restart (and probably you as well). If you go to a jobset (e.g. https://hydra.nixos.org/jobset/nixos/release-18.03) you can click actions -> restart all failed/aborted

09:30 <andi-> It feels like a shotgun mode where I'd have expected to just be able to retry individual builds

09:34 roberth has joined #nixos-dev

09:34 <roberth> domenkozar and I are working on a brand new Continuous Integration service for Nix users. Check out https://hercules-ci.com!

09:38 <domenkozar> could someone proof read the weekly? :) http://weekly.nixos.org/preview/2018/10-arch64-builders-nixops-alternative-optimized-docker-layers-hercules-ci.html

09:42 <ekleog> for krops, I'd have put commas around “the official DevOps tool of NixOS”

09:43 <ekleog> missing caps for “developer friendly backdoor to VM tests infrastructure”

09:43 <ekleog> “Host build agents where you want” -> maybe “wherever”? unsure about that though

09:45 <niksnut> isn't chromium supposed to be built on big-parallel machines?

09:46 <domenkozar> ekleog: thanks :)

09:48 <srhb> domenkozar: s/theses/these and mismatch between optimize/optimising

09:49 <andi-> niksnut: it is.. I am not sure where I can see what features the builders have.

09:50 <domenkozar> srhb: thanks - otherwise looks good?

09:51 <srhb> domenkozar: Yep! LGTM :)

09:51 <domenkozar> thanks!

09:51 <clever> andi-: https://hydra.nixos.org/queue-runner-status i think

09:51 <clever> andi-: under the machines key

09:52 <andi-> clever: thanks

09:56 <andi-> The build machines have the big-parallel feature but still it takes them a while to build it.

10:10 orivej has joined #nixos-dev

10:29 orivej has quit [Ping timeout: 252 seconds]

10:40 <andi-> Looking at the output I get the impression it is building on a single core..

10:58 lassulus has quit [Ping timeout: 268 seconds]

11:11 lassulus has joined #nixos-dev

11:20 Synthetica has joined #nixos-dev

11:52 LnL has quit [Ping timeout: 244 seconds]

11:54 orivej has joined #nixos-dev

11:58 LnL has joined #nixos-dev

12:45 <ekleog> domenkozar: you're welcome :) (and sorry for not finishing with a “thank you!”, I've been interrupted IRL :°

12:45 <ekleog> )

12:50 <gchristensen> does anyone have feelings about adding "MLX5_CORE_EN m" or "MLX5_CORE_EN y" to our default kernel config?

13:51 <thoughtpolice> gchristensen: That's a fancy network card, is what I feel.

13:54 <thoughtpolice> Jokes aside, adding kernel modules to the default closure mostly gets annoying when you want to override things, like slimmer kernel builds, in my experience. (Really this is also due to the fact our kernel config driver tool is a huge hack, too). Maybe this has changed. I don't think that's worth holding off though. Mellanox cards aren't exactly unheard of.

13:54 <samueldr> I'll have to have eyes on #47697, I think I had a couple peeps checking, but no one left a review or approval :/

13:54 <{^_^}> https://github.com/NixOS/nixpkgs/pull/47697 (by samueldr, 11 hours ago, open): Manual: reviews partitioning steps

13:54 <thoughtpolice> Plus it's obviously quite grating to plug in some hardware and realise "I need to rebuild my kernel" suddenly, on its own.

13:55 <samueldr> thoughtpolice: I think this case is pretty self-serving where the nixos aarch64 community builder uses such a card (unless it's yet another one)

13:55 <thoughtpolice> I'm guessing the only reason we don't have it already, honestly, is because a lot of people don't just have that kind of gear sitting around, but they're pretty popular cards for high-end network deployments. Most people don't get them on a cloud.

13:55 <samueldr> so if I were to give a plus or a minus, it would be a +1

13:55 <thoughtpolice> Right, exactly.

13:56 <thoughtpolice> I think of it as, "If someone had done this sooner, they would have definitely added it", so in that sense it seems pretty good to just go ahead and add it and be happy.

13:57 <samueldr> I think the other detail to check would be: why does upstream (kernel) keeps it =n ? if no real reason or nothing bad, just do it :)

13:58 <sphalerite> yeah I'm +1 for =m

14:04 catern has joined #nixos-dev

14:11 <ekleog> I'm +1 for https://github.com/NixOS/nixpkgs/pull/42838 , and having hardware-configuration.nix automatically list this module if detected to be necessary

14:11 <{^_^}> #42838 (by teto, 13 weeks ago, open): [RFC] add ability to merge structured configs

14:12 <ekleog> now… well, =m can make the job in the meantime :°

14:12 <ekleog> oh actually teto finished it 4 hours ago, or so it seems… nice timing :°

14:16 <Dezgeg> why should this particular module be any different from the other bajillon modules we have on by default?

14:19 <ekleog> my opinion is the other modules should also be auto-detected this way, but it'd need benchmarking to figure out whether by compiling only the required modules we can drop down to an acceptable build time (like gentoo's 2-3 min max, not like nixos' current 20-30 minutes)… if it's not the case, staying with everything-on at least makes the cache work, but… :/

14:20 * ekleog wonder whether it'd be possible to have =m-compiled modules in separate outputs and to fetch them on-demand

14:23 <sphalerite> ekleog: not sure how that would work with module dependencies. It also wouldn't help with build time

14:24 <Dezgeg> how do you autodetect say, some USB hardware in advance?

14:24 <samueldr> the usb installer image will be some hell to build :)

14:25 <ekleog> sphalerite: it would make the cache able to build all modules and people able to selectively pick which ones they use, which would be a huge improvement over people currently having to rebuild the whole kernel+modules when they want any configuration not pre-built by hydra (eg. MLX5_CORE_EN, currently) -- for dependencies, can't we have cross-output dependencies?

14:26 <Dezgeg> if people find that they need MLX5_CORE_EN for their hardware they should submit a patch adding that

14:26 <ekleog> Dezgeg: the default config could still default-load undetectable external hardware

14:27 <Dezgeg> that's what the majority of modules are

14:27 <ekleog> well, MLX5_CORE_EN is a good example of one that isn't

14:27 <Dezgeg> why not? isn't it a PCI device?

14:28 <ekleog> because in practice people don't expect to hotplug pci devices without any issue

14:28 <ekleog> usb devices are a good point, pci devices are not

14:28 <sphalerite> ekleog: yes they do nowadays

14:29 <sphalerite> thunderbolt :)

14:29 <ekleog> nix defaults to turning on thunderbolt hotplugging? :/

14:29 <ekleog> nixos*

14:29 <sphalerite> idk

14:30 * ekleog sees that as a huge security vulnerability

14:30 <Dezgeg> besides, purely from the kernel config point of view you're not going to know if an option enables a PCI device or USB device

14:30 obadz has quit [Ping timeout: 252 seconds]

14:30 <sphalerite> > nixos.options.boot.initrd.luks.mitigateDMAAttacks.description

14:30 <{^_^}> "Unless enabled, encryption keys can be easily recovered by an attacker with physical\naccess to any machine with PCMCIA, ExpressCard, ThunderBolt or FireWire port.\nMore information is available at <...

14:30 <sphalerite> > nixos.options.boot.initrd.luks.mitigateDMAAttacks.default

14:30 <{^_^}> true

14:30 <ekleog> Dezgeg: well for sure it'd require vetting

14:31 <ekleog> sigh :)

14:31 <sphalerite> but the option description suggests it only disables firewire

14:31 <sphalerite> so the problem remains

14:31 <Dezgeg> there is no way for anyone to vet 10000 options

14:31 <Synthetica> Is there a way to enable hotplugging, but only after a root password check?

14:31 <ekleog> people already have vetted 10000 options

14:32 <Dezgeg> really? where?

14:32 <ekleog> I *hope* the enabled options have been checked at least once before being activated

14:32 <Dezgeg> of course not, the build script just answers 'm' where possible

14:32 <ekleog> be it by a nixpkgs committer, an ubuntu member or whomever

14:33 <ekleog> waaaait… really?

14:34 <ekleog> now, in practice the idea of having the cache build all modules as separately-fetchable outputs|derivations|whatever is much more reasonable than the one of parameterizing the kernel automatically for everyone

14:34 <andi-> That one really needs some work :/ It isn't fun trying to add builtins etc.. (not just modules)

14:34 <andi-> ekleog: Dependency resolution between modules would have to be mirrored to nix then

14:35 <ekleog> andi-: in the building derivation, use modinfo to get the dependencies, and add links to the dependencies in a nix-whatever/ directory

14:35 <ekleog> (if multiple outputs can indeed have dependencies between them)

14:35 <Dezgeg> you need to know the outputs beforehand

14:35 <ekleog> it'd “just” require having a list of modules in .nix

14:36 <ekleog> which can be automated, I think, but I'm not sure how

14:36 <sphalerite> mumble mumble we now have recursive nix

14:36 <andi-> it probably can be automated, pushes a bit of the work towards updating the build expression

14:36 <sphalerite> (although that'll never fly in nixpkgs also it's not in a release)

14:36 <sphalerite> (not sure if it's even on master)

14:36 <andi-> optional dependencies and stuff probably also exist in the kernel :/

14:37 <andi-> It will end up being cargo2nix, npm2nix,... for the kernel..

14:37 <sphalerite> Kconfig2nix :)

14:37 obadz has joined #nixos-dev

14:38 <ekleog> andi-: modinfo aesni-intel -> can't see anything that makes me think there could be optional dependencies

14:38 <ekleog> (random module picked just to check modinfo output

14:38 <ekleog> )

14:39 <Dezgeg> try something like ext4 which will optionally depend on crc32 if certain checksums are enabled

14:39 <ekleog> there's a hard depend on crc16 and no mention of crc32 here

14:39 <andi-> the "cheap" way would be to depend on all of them in that case.

14:39 <ekleog> oh nvm found it, it's not the same line

14:40 <ekleog> so yeah there's also “softdep” in addition to “depends”

14:40 <ekleog> and yeah, depending on all of them sounds like it'd make sense

14:40 <andi-> it would be a few megabytes extra but probably not tooo bad

14:40 <andi-> compared to having ALL modules :D

14:42 <samueldr> would it be realistic to implement such a feature, but still default to an all-encompassing default kernel (so not changing the defaults), but it would be built using that modular kernel?

14:42 <samueldr> so users that want to pick and match will have the independent modules already available, and no change for the defaults

14:43 <ekleog> samueldr: I can't see why it wouldn't be, insofar as implementing such a feature would not in itself be unrealistic

14:47 <sphalerite> wait so what's the goal here? :p

14:48 <ekleog> being able to add/remove modules without rebuilding for half an hour :D

14:49 <ekleog> (also, hopefully tweak the configuration of builtins, but I guess that won't be possible because changing an option for the core kernel could necessitate rebuilding some modules)

14:49 <sphalerite> I don't think that makes much sense from a practical perspective

14:51 <ekleog> well, that's how other distros do it : a main kernel package, and additional module packages for more rarely-used stuff that doesn't deserve inclusion in the main kernel package (for instance the iwlwifi network cards, always fun when you forget to install it… I'm not suggesting we go up to there)

14:53 <sphalerite> if you want to disable a module because you're concerned about the implications of it being autoloaded on a hotplug, you can blacklist it. If you want to disable a module to save space (and still want to avoid rebuilds), you can make a custom derivation that just copies bits and pieces from the stock kernel

14:54 <sphalerite> and if you want to enable something that's missing from the stock kernel, you'll be building it yourself anyway

14:55 <ekleog> unless the stock kernel starts building *everything* and not pushing everything to the end-user

14:55 <ekleog> (also, copying bits and pieces… avoiding that is exactly the reason why multiple outputs were made, isn't it?)

14:56 <sphalerite> well it should already be including everything it can as modules

14:56 <ekleog> like, the kernel modules are 58M currently… from memory (old one) my gentoo kernel was totalling only a few megs (like 5 or 10) when I used gentoo… that's a quite big difference, imo

14:57 <ekleog> to me the point of multiple derivations is exactly to not have to make a derivation that copies bits and pieces, actually :)

14:58 <sphalerite> shouldn't you also count the sources towards the size when talking about gentoo? :)

14:59 <ekleog> I'm not trying to compare NixOS to gentoo, just to mention that we could shave a *lot* on modules if we wanted to

15:00 <ekleog> (also, people have been using gentoo to build minimal-sized images that don't include the sources, in a way similar to not-os)

15:01 <sphalerite> yes, the latter is the perfect use case for a derivation that copies bits and pieces. None of the fuss with translating the whole kernel dependency structure into nix

15:02 <ekleog> parenthesis are not the main point

15:02 <ekleog> (otherwise I'd be comparing with not-os, which does handle this quite well)

15:02 <sphalerite> you can just use the existing information in an already built kernel tree to get the closure of the modules you care about and copy that into a new derivation which includes only the modules you're interested in — without building the kernel from source, and keeping the full 60MB things only on the machine building the image

15:03 <sphalerite> I'm not convinced that the potential size savings are worth the engineering effort

15:03 <sphalerite> that's my point

15:03 <ekleog> well, what I'm saying is that the kernel-building derivation could use the information from the built kernel tree to copy each module into an appropriate output

15:03 <Dezgeg> how do you know a priori what outputs are needed?

15:03 <ekleog> the only translation that's needed to nix is the list of modules

15:03 <sphalerite> no it can't, because that requires A) knowing which modules are going to be there at evaluation time

15:03 <sphalerite> or B) IFD

15:04 <ekleog> that's not the whole dependency tree

15:04 <sphalerite> if nix doesn't know about the dependencies, loading a module with dependencies will be Fun

15:05 <ekleog> it doesn't need to know dependencies at eval-time, only at build-time

15:05 <ekleog> and at build-time it can get them with modinfo

15:05 <sphalerite> yes it does, ebcause the list of outputs needs to be known before the build happens

15:05 <ekleog> again, the list of outputs is not the dependencies

15:06 <sphalerite> so what happens with the dependencies?

15:06 <ekleog> I'll just walk through the idea, because I feel like I can't explain otherwise

15:06 <ekleog> 1. The kernel.nix has a list of ["modulea", "moduleb"]

15:06 <ekleog> 2. this list is set as the list of outputs (plus a “kernel” output and whatever)

15:06 <ekleog> 3. when building the kernel, make a standard build-from-source-to-all-modules

15:07 <ekleog> 4. then, install the kernel to its output, and each module to its output

15:07 <Dezgeg> how do you know what is ["modulea", "moduleb"] going to be?

15:07 <ekleog> 5. and here is the trick, in each module output, add a link to the module outputs of modules it depends on, in nix-build-support or whatever

15:08 <ekleog> this way the dependency structure is automatically generated

15:08 <Dezgeg> I get the idea, but you cannot solve steps 2.-5. without solving step 1.

15:08 <sphalerite> but those module outputs don't exist because they weren't known at instantiation time!

15:08 <ekleog> Dezgeg: please tell me, what part of “yes you need to auto-generate the list of modules to list the outputs, but you don't need the whole dependency structure of the kernel” isn't clear? I've repeated it like three times, so I guess my grammar isn't correct?

15:09 <sphalerite> if each module output has links to its dependency outputs, where are the dependency outputs created?

15:10 <ekleog> you only need the list of modules for that, which I've assumed to be given four times now :)

15:11 <samueldr> ekleog: you assume something akin to the other "generators" in nixpkgs, something like you called Kconfig2nix, right?

15:11 <Dezgeg> that's not feasible to implement without using import-from-derivation

15:11 <ekleog> samueldr: I'm not the one who called it Kconfig2nix, but that's more or less the idea, except you could just trigger a full non-multiple-output kernel build and list the modules from there :)

15:12 * samueldr must have crossed lines while reading

15:12 <ekleog> (and Dezgeg ^ too, we can hardcode the list of outputs if it's auto-generated)

15:12 <Dezgeg> it cannot be hardcoded because it will depend on the kernel version, architecture and config options

15:14 <ekleog> which are all given as arguments… ok so *that* is a convincing argument, thanks! :) I guess it'd be possible to also take as parameter the list of outputs to generate, and if someone customizes their kernel they would anyway have to things like that / could build a non-multiple-output kernel ?

15:15 <ekleog> (and then have the defaults set for linuxPackages.whatever)

15:17 <sphalerite> oooh I thought we were assuming there *wouldn't* be a Kconfig2nix

15:17 <sphalerite> >_<

15:23 <ekleog> \o/

15:23 <ekleog> quid pro quo solved

15:23 <gchristensen> is there a way we could take our existing config and adding a new module to the already compiled one? even if it does take some reduplication

15:24 <sphalerite> still, just copying bits and pieces when you really can't afford to build your own kernel and building your own when you can seems good enough to me :p

15:24 <sphalerite> gchristensen: that's a good question. I'm going to try it. :D

15:24 <sphalerite> I imagine it would be similar to building out-of-tree modules like zfs

15:25 <gchristensen> yeah, then we don't have to solve all this hard stuff but users can stil get some modules

15:25 <sphalerite> (by "try it" I mean give it a quick shot and give up if I fail!)

15:25 <samueldr> I wonder if it'd make sense to snapshot the whole kernel source directory after configuration as a package so you can "hop in and build what you need"

15:25 <ekleog> gchristensen: we have that for eg. wireguard / virtualbox, if I understood your question

15:26 <gchristensen> yeah so if I could cheaply just add the mellanox module, that would make me way happy :)

15:26 <samueldr> (though, I think it'd still need to re-build a bunch)

15:26 <sphalerite> samueldr: that's basically what we have already with the separate configFile derivation I think?

15:27 <samueldr> I should read on that to know

15:27 <ekleog> I think you'd need to rebuild the whole kernel in order to be able to build the mellanox module, though :(

15:27 <gchristensen> oh?

15:28 <ekleog> <sphalerite> still, just copying bits and pieces when you really can't afford to build your own kernel and building your own when you can seems good enough to me :p <-- well, yeah, but it's implementing basically the same logic as multiple-outputs, except everyone must do it themselves and it duplicates stuff on the building machine :)

15:29 <sphalerite> ekleog: not really — I imagine it would be significantly easier to make a generic thing that can make a "trimmed" kernel modules tree from a full one based on modules.dep than to make Kconfig2nix

15:29 <ekleog> gchristensen: when I used gentoo I always build the required with =y and the rest with =n (including module support =n), so I may be wrong here

15:30 <ekleog> sphalerite: well, the Kconfig2nix really only needs to list the modules, if you give me a tick I can make it with a one-liner

15:31 <sphalerite> oh?

15:32 <ekleog> find /run/current-system/kernel-modules/lib/modules/4.14.71/kernel/ -type f | sed 's;^.*/\([^/]*\)\.ko\.xz;\1;'

15:32 <sphalerite> that doesn't work because you need the kernel to be already built

15:33 <ekleog> yeah, the steps would be 1. build the kernel as a monorepo, 2. run this command and generate the kernel.nix, 3. build the kernel and split it into multiple outputs

15:33 <ekleog> potentially using your copying bits of the kernel trick to copy said bits to the multiple-output derivation from the one-output derivation directly on hydra

15:34 <ekleog> (in order to avoid doing two builds)

15:34 <ekleog> strike that, it'd require IFD indeed to be useful, anyway hydra would have to do one build and the contributor of kernel.nix one

15:34 <sphalerite> :|

15:36 <ekleog> now yeah, ideally there'd be IFD… but that world isn't coming

15:37 <gchristensen> a project I work on uses a lot of IFD and I think avoiding it is the right choice

15:38 <gchristensen> it makes for a weird phase of having to build stuff for a while before you know what you can build

15:39 <sphalerite> ekleog: I'd say the best solution is to have a derivation that uses the hydra-built kernel as an input, some function that takes a list of desired modules, and runs through modules.dep to obtain the closure of those modules then copies them into the output

15:40 <sphalerite> small resulting module set, no building a kernel, but you do require the full kernel to be able to build it

15:40 <sphalerite> but AFAICT your suggestion doesn't avoid the latter part either?

15:44 <ekleog> hmm… we can fetch only an output from a derivation from hydra, right?

15:45 <ekleog> so my suggestion would be basically 1. have hydra build the kernel and split it into outputs, 2. the client can ask for just the required outputs and hardlink them into a kernel tree

15:45 <ekleog> the client never needs to touch the whole kenrel

15:47 <sphalerite> yes 2 is easy when you've got 1. The problem is, *how do you get 1*

15:48 <ekleog> well, with the afore-mentioned process of listing outputs on the compiler of the person who did the last kernel update in nixpkgs :)

15:48 <ekleog> s/compiler/computer/

15:48 * ekleog needs to sleep

15:48 <sphalerite> that sounds like it would cause a bootstrap problem

15:48 <sphalerite> because new modules added in a new kernel version won't be taken into account

15:49 <ekleog> yeah, it means people who bump the kernel version need to refresh the module list, like with things like firefox langpacks or the like

15:49 <ekleog> actually firefox langpacks are the exact thing I should be comparing kernel modules with :D

15:51 <ekleog> then… what I say is by no means a priority for anyone I know of, so it's all theoretical speech anyway

15:53 <ekleog> (that said, 'night :))

15:53 <sphalerite> gnight!

15:53 <{^_^}> Night!

16:00 <sphalerite> gchristensen: so I've got just that building in a nix-shell, and written an expression that will hopefully produce the same thing…

16:01 <gchristensen> oooh!

16:03 orivej has quit [Ping timeout: 260 seconds]

16:12 <sphalerite> gchristensen: almost got it, I think

16:14 <sphalerite> gchristensen: https://sphalerite.org/dump/mlx5.nix this *builds*, I'm not sure if it's exactly what you need

16:14 <gchristensen> no idea :D I can try it ~sometime~ but not for the next many hours at least

16:15 <gchristensen> (I'm working from unusual locations for the next few days)

16:15 <Dezgeg> why all this effort instead sticking it to common-config.nix just like the other bajillon options?

16:15 <gchristensen> well.... if there was a nice option to not have to do that, it'd be cool to document it

16:16 <Dezgeg> but now everyone who has this particular card needs to somehow figure it out instead of having it work out of the box

16:16 <sphalerite> oh yeah, build using nix-build mlx5.nix --arg linux '(import <nixpkgs> {}).linux'

16:17 <gchristensen> yeah, this option might be good to enable by default

16:17 <sphalerite> Dezgeg: sure, but having this way of building just one module documented is useful for future things — if someone wants to use module xyz *now* and not have to wait for everything to build after modifying the common one

16:17 <sphalerite> I am in favour of enabling it by default too

16:18 <sphalerite> but if this works, it's nice to have for the purpose of trying out a module quickly

16:18 <sphalerite> in fact, I'll probably use this to build a patched radeon module without building the whole kernel for one of my machines :D

16:18 <sphalerite> once I have that machine again.

16:19 <gchristensen> yeah, I think it is fine to enable by default but if we can make a nice thing for other modules for other niche use cases, it'd be good

16:24 <Dezgeg> well, if this doesn't work for the general case or bitrots in some way, there's a real risk in someone spending half hour of human time to save half hour of compilation time

16:24 jtojnar has joined #nixos-dev

16:26 <gchristensen> yeah I agree, the solution would need to be good and not bitrot

16:27 <gchristensen> ok so andi- pointed out a bunch of builds were deadlocked on hydra. making sure the packet machines are all running 2.1.3 which has a fix for that

16:31 * sphalerite is now testing mlx5.nix with linux and linux_latest across unstable, 18.03 and 16.03 to see how much it might bitrot :p

16:32 <sphalerite> ok, doesn't work on 16.03 because it didn't have overrideAttrs :D

16:35 <gchristensen> ok so andi- pointed out a bunch of builds were deadlocked on hydra. I'm doing a rolling deploy with --force-reboot to each one to solve it.

16:35 <gchristensen> the good news is my monitoring did catch it, next step might be some form of alerting.

16:49 jtojnar has quit [Ping timeout: 252 seconds]

16:50 jtojnar has joined #nixos-dev

17:02 jtojnar has quit [Ping timeout: 252 seconds]

17:03 jtojnar has joined #nixos-dev

17:03 jtojnar has quit [Remote host closed the connection]

17:45 Lisanna has quit [Quit: Lisanna]

18:05 jtojnar has joined #nixos-dev

18:55 orivej has joined #nixos-dev

18:57 matthewbauer has joined #nixos-dev

19:07 matthewbauer has quit [Read error: Connection reset by peer]

19:09 matthewbauer has joined #nixos-dev

19:11 matthewbauer has quit [Remote host closed the connection]

19:16 orivej has quit [Ping timeout: 260 seconds]

19:41 <gchristensen> it would be nice to be able to access the arguments which were first used to import Nixpkgs

19:54 Lisanna has joined #nixos-dev

20:05 <LnL> we could do something similar to pkgs.path

20:18 orivej has joined #nixos-dev

22:34 jtojnar has quit [Remote host closed the connection]

23:01 orivej has quit [Ping timeout: 252 seconds]

23:31 matthewbauer has joined #nixos-dev