#nixos-dev on 2018-08-10

2018-04-19 20:31 gchristensen changed the topic of #nixos-dev to: NixOS Development (#nixos for questions) | https://hydra.nixos.org/jobset/nixos/trunk-combined https://channels.nix.gsc.io/graph.html | 18.03 release managers: fpletz and vcunat | https://logs.nix.samueldr.com/nixos-dev

00:17 Ericson2314 has quit [Ping timeout: 256 seconds]

01:04 Ericson2314 has joined #nixos-dev

01:09 phreedom has quit [Remote host closed the connection]

01:09 phreedom has joined #nixos-dev

01:43 lassulus_ has joined #nixos-dev

01:44 <samueldr> hi, still about #35069, I'm not sure it's big enough to warrant going through staging, but it's still "up to 500 rebuilds"

01:45 <{^_^}> https://github.com/NixOS/nixpkgs/pull/35069 (by NickHu, 24 weeks ago, open): tcl/tk: 8.6.6 -> 8.6.8 and create library symlink

01:45 <samueldr> though am I right in assuming staging workflow is more for "this possibly will break stuff" than a "simpler"(?) patch-level update?

01:45 lassulus has quit [Ping timeout: 240 seconds]

01:45 lassulus_ is now known as lassulus

02:05 phreedom_ has joined #nixos-dev

02:08 phreedom has quit [Ping timeout: 250 seconds]

02:16 orivej has quit [Ping timeout: 240 seconds]

02:23 Drakonis has joined #nixos-dev

02:54 Drakonis has quit [Remote host closed the connection]

04:57 Ericson2314 has quit [Ping timeout: 240 seconds]

05:18 Ericson2314 has joined #nixos-dev

05:28 <Enzime> samueldr: alternatively

05:28 <Enzime> you could branch off their PR branch

05:28 <Enzime> and pull request your changes into their PR branch

05:28 <Enzime> although that's not super clean if the changes aren't additive

05:29 <Enzime> (e.g. you want to change some of their commits)

05:32 <Dezgeg> you could name your commits such that 'git rebase --autosquash' would squash them if the original author were to run that, I guess

05:32 <Enzime> Dezgeg: never heard of that command

05:32 <Enzime> how does that work?

05:33 <Dezgeg> I haven't actually used that personally, but I think I have seen other people use it

05:34 <Dezgeg> basically, if your commit message starts with 'fixup! FOO' then git rebase --autosquash will squash that in the latest commit whose title is FOO

05:44 jtojnar has quit [Quit: jtojnar]

05:45 <Enzime> Dezgeg: apparently you can use git commit --fixup to create such a commit

05:47 <Dezgeg> yes

05:48 <Enzime> Dezgeg: this is cool, thanks for the tip

05:49 Ericson2314 has quit [Ping timeout: 265 seconds]

05:56 FRidh has joined #nixos-dev

06:24 phreedom_ has quit [Ping timeout: 250 seconds]

06:24 phreedom has joined #nixos-dev

07:24 <srhb> Anything we can do for trunk-combined? https://github.com/NixOS/nixpkgs/issues/44354 4 days ago since we even had an eval: https://hydra.nixos.org/jobset/nixos/trunk-combined

07:24 <{^_^}> #44354 (by xeji, 1 week ago, open): Hydra evals of nixpkgs:trunk fail (heap size error)

07:37 ma27 has joined #nixos-dev

07:52 __Sander__ has joined #nixos-dev

08:54 __Sander__ has quit [Ping timeout: 248 seconds]

08:54 __Sander__ has joined #nixos-dev

09:21 ma27 has quit [Quit: WeeChat 2.1]

09:28 xeji has joined #nixos-dev

10:27 orivej has joined #nixos-dev

10:51 lassulus has quit [Ping timeout: 256 seconds]

11:02 init_6 has joined #nixos-dev

11:05 lassulus has joined #nixos-dev

11:10 <infinisil> Now I'm worried about merging PR's that add new packages

11:12 init_6 has quit [Ping timeout: 260 seconds]

11:14 <woffs> Everyone could delete ~20 old/unmaintained/broken packages

11:18 <srhb> infinisil: It's been running for two hours now, fingers crossed I guess.

11:18 <LnL> I wonder if we should split language specific sets into their own jobset, instead doing everything at the same time

11:19 <srhb> LnL: It might be necessary. it sucks to split it up wrt. testing that might want to use things, but yeah...

11:20 <LnL> some sets like perlPackages/rPackages are disabled at the moment because it would blow up evaluation together with what we already have

11:21 <LnL> but that means a bunch of stuff in there is never built AFAIK

11:21 init_6 has joined #nixos-dev

11:22 <infinisil> Not a durable solution unfortunately

11:22 <infinisil> Isn't there this low-memory nix branch? Could hydra use that to make it work?

11:22 <LnL> also the fact that haskellPackages is handled specially in release.nix is a bit weird

11:23 <infinisil> Oh that branch is only reducing memory for copying closures i think

11:24 <infinisil> LnL: This is such that they get built by hydra, but not added to the nix-env -q listing

11:25 <LnL> my point is that we should either make release.nix build _all_ language sets or none

11:27 <LnL> and separating the jobsets into haskell-trunk, python-trunk, etc. would reduce the maximum memory usage of the jobsets significantly

11:30 <clever> LnL: the channel scripts would have to be modified, to wait for several different evals on the same rev to all finish and pass

11:31 <gchristensen> the That error message in that issue isn't a failure IIRC, see: https://github.com/NixOS/hydra/commit/0882519b108e8549ae19cac558888d81ff062893 it trades memory for time.b

11:31 <LnL> yeah, I'm pretty sure you can propagate revisions to other jobsets but ideally channel updates would wait for everything

11:31 <LnL> and doing this has other disadvantages

11:32 <srhb> gchristensen: It will eventually fail with the regular error, that's what's happened the past few days.

11:33 <LnL> splitting the other way, by platform, would also reduce the memory footprint

11:35 <gchristensen> niksnut: have you seen these failures due to memory problems?

11:35 <gchristensen> new since last 8 days

11:43 <srhb> gchristensen: The latest error message is a timeout though. I wonder if that means the evaluator has some other limitation that could just be bumped?

11:43 <srhb> I don't recall any...

11:46 <niksnut> I don't know what causes the timeout. The hydra-evaluator logs don't show anything.

11:47 <niksnut> the queueu runner crashes seem to be fixed

11:53 <srhb> (my $res, my $jobsJSON, my $stderr) = captureStdoutStderr(21600, @cmd); die "$evaluator returned " . ($res & 127 ? "signal $res" : "exit code " . ($res >> 8))

11:53 <srhb> 21600 = 6 hours, does that match?

11:55 <srhb> Oh, is the perl stuff even running that anymore...

12:03 <vdemeester> gchristensen: any "ways" to help getting https://github.com/NixOS/nixpkgs/pull/34402 in ? :P

12:03 <{^_^}> #34402 (by vdemeester, 27 weeks ago, open): Add a containerd module

12:04 <gchristensen> why do execstart's have this weird "" in the first element? https://github.com/NixOS/nixpkgs/pull/34402/files#diff-4b179df618c4fe650294784e73a057e9R66

12:05 <gchristensen> some places seem to be copy-pastaing it around, unless there is a secret reason I don't know about

12:05 <aminechikhaoui> probably to override the execstart from the upstream pkg ?

12:06 <Dezgeg> yes, if that unit is overriding some other unit with the same name, that overrides the ExecStart field instead of appending to ExecStart

12:06 <gchristensen> that is a weird thing

12:06 <gchristensen> why is this the way to do that?

12:09 <aminechikhaoui> Note that for drop-in files, if one wants to remove entries from a setting that is parsed as a list (and is not a dependency), such as AssertPathExists= (or e.g. ExecStart= in service units), one needs to first clear the list before re-adding all entries except the one that is to be removed. Dependencies (After=, etc.) cannot be reset to an empty list, so dependencies can only be added in

12:09 <aminechikhaoui> drop-ins. If you want to remove dependencies, you have to override the entire unit.

12:09 <aminechikhaoui> from https://www.freedesktop.org/software/systemd/man/systemd.unit.html

12:11 orivej has quit [Ping timeout: 240 seconds]

12:16 <aminechikhaoui> gchristensen: also in that PR systemd.packages = [ pkgs.containerd ]; makes the unit use the upstream unit definition i.e from ${pkg}/lib/systemd afaik so you don't have control over that from nix

12:29 globin has quit [Ping timeout: 260 seconds]

12:30 <niksnut> hm, looks like hydra-eval-jobs is hanging somewhere in a boehm GC waiting on a futex

12:30 <niksnut> never seen that before

12:30 <niksnut> maybe boehm doesn't like fork

12:33 <Dezgeg> any backtrace?

12:37 <gchristensen> aminechikhaoui: weird :)

12:37 <gchristensen> aminechikhaoui: thank you a lot

12:37 <aminechikhaoui> yeah systemd can be weird sometimes :D

12:42 Lisanna has quit [Ping timeout: 260 seconds]

13:08 ma27 has joined #nixos-dev

13:10 ma27 has quit [Client Quit]

13:25 <gchristensen> domenkozar: ping? where is the nix-support located in Travis's repos?

13:25 orivej has joined #nixos-dev

13:27 <vdemeester> gchristensen, aminechikhaoui : yeah I took inspiration in other unit files (mainly docker's one)

13:28 <vdemeester> (thanks a lot for the explanation :) )

13:29 Drakonis has joined #nixos-dev

13:38 <domenkozar> gchristensen: I just google it

13:38 <domenkozar> matthewbauer travis nix github :P

13:38 orivej has quit [Ping timeout: 256 seconds]

13:38 <gchristensen> domenkozar: oh, haha, cool

13:38 <gchristensen> we should (very soon) update the travis CI thing to use a pinned version of the nix installer

13:38 <gchristensen> so that when 2.1 comes out, it doesn't break

13:38 <gchristensen> back in ~30

13:39 <domenkozar> will 2.1 break?

13:47 <LnL> how would it break?

13:52 <LnL> I know builtin:buildenv changed, but that doesn't matter for travis

13:52 <gchristensen> the installer is defaulting to multi-user if possible, and the current system assumes linux means single-user

13:52 <gchristensen> https://github.com/travis-ci/travis-build/blob/136521f90d737245ed0b74f73b85d311adf0e74f/lib/travis/build/script/nix.rb#L46

13:55 <LnL> doesn't it run a container? I doubt multi-user would just work on travis

13:55 <gchristensen> not always, with sudo it installs in a VM

14:04 orivej has joined #nixos-dev

14:07 init_6 has quit [Ping timeout: 244 seconds]

14:09 init_6 has joined #nixos-dev

14:16 <gchristensen> w00t I got approval to hack on a long-term project I've been wanting to do, of making an automatic test matrix for the nix installer

14:21 <infinisil> \o/

14:27 ma27 has joined #nixos-dev

14:38 orivej has quit [Remote host closed the connection]

14:38 <domenkozar> nice :)

14:39 orivej has joined #nixos-dev

14:44 init_6 has quit [Ping timeout: 244 seconds]

14:44 ma27 has quit [Remote host closed the connection]

14:56 <LnL> gchristensen: nice!

14:56 <gchristensen> stand by for a thing to read, about what I want to do

14:57 <clever> gchristensen: ive also discovered that it will detect sudo in your script, and silently switch you to the vm, even though you didnt ask for sudo

14:58 <gchristensen> neat

14:58 <clever> gchristensen: and the vm has a different version of ubuntu, with different files in /etc, that can break nixops hard

14:58 <clever> nixops has since been fixed, but its something to beware of

14:58 <clever> so simply adding `sudo echo` to your travis script, causes nixops to fail

14:59 Ericson2314 has joined #nixos-dev

15:04 <gchristensen> LnL, domenkozar, infinisil, https://paper.dropbox.com/doc/Nix-Test-Matrix--AJ0Y2WVM~TgZRLcd4Bw8oF~kAQ-pQZ28yuNgFL03CxIIXzeo any feedback?

15:08 <infinisil> "This shouldn’t be written in bash." haha nice

15:09 <gchristensen> that is a little reminder to me not to start with bash )

15:09 <gchristensen> :)

15:11 <infinisil> gchristensen: Shouldn't multi-user be an option one gets to choose? Or is this just whether multi-user would be supported?

15:11 <domenkozar> why not? if you use nix+bash then it's easy to a) provision the thing b) execute

15:11 <domenkozar> I think you only need to separate a from b

15:11 <domenkozar> and then a can be something non-bashy

15:12 <gchristensen> infinisil: multi-user is a result of the test, so you could imagine a "debian jessie, default options" "debian jessie, multi-user forced", "debian jessie, single-user forced"

15:12 <infinisil> I see

15:12 <gchristensen> domenkozar: because I'm too good at starting with bash and then sticking with it far too long and giving up because I just have a pile of soupy bash

15:13 * infinisil is with gchristensen on that one

15:13 <domenkozar> I give up regardless of bashing :)

15:15 <infinisil> gchristensen: You may want to incorporate "correct failure messages" as well somehow. So when a system has way too little RAM or so, the Nix install errors out correctly, which should still count as a success

15:16 <gchristensen> hmm yeah

15:16 <domenkozar> infinisil: that's like v3 to me

15:17 <domenkozar> right now we'd prefer release of Nix doesn't take internet down

15:17 <domenkozar> on 64GB ram machine

15:17 <infinisil> gchristensen: Also I feel like a binary result is a bit limiting. You could also have results of how long the installation took, or even per stage

15:17 <gchristensen> I don't think anything here indicates the results are binary

15:17 <gchristensen> other than the current example results are... :)

15:18 <infinisil> Ah I see

15:18 <infinisil> Right :)

15:18 <domenkozar> gchristensen: btw this could be a project that rewrites testing framework to use libvirtd

15:18 <gchristensen> oh my word

15:18 <Dezgeg> doesn't sound necessary

15:19 <domenkozar> can you spawn macos with qemu?

15:19 <gchristensen> you can

15:19 <domenkozar> yeah then you probably can use qemu for all of it

15:19 <domenkozar> carry on

15:20 <gchristensen> I dunno, v0.1 might use vagrant

15:20 <Dezgeg> vagrant inside nix-build sounds pain

15:20 <domenkozar> I think if testing framework was a bit more composable, it could be used here

15:20 <gchristensen> it won't be inside a nix-build

15:20 <Dezgeg> but one idea that I had for this kind of testing is to use the cloud images of various distros that run cloud-init

15:20 <gchristensen> the tests to access the internet

15:21 <Dezgeg> which means you get root execution by passing a suitably formatted emulated cd drive that cloud-init knows how to parse

15:22 <infinisil> gchristensen: In the end it would be cool to have a 3D matrix of how this 2D tables result changed over time with the commits :P

15:22 <gchristensen> agreed!

15:22 * infinisil tries to think of another dimension to add

15:23 <infinisil> This could also use some systems that are in a purposefully bad state

15:23 <gchristensen> definitely yes!

15:23 <Dezgeg> wouldn't probably be too hard to also integrate this with the current nixos test runner... I can't remember if I attempted that at some point

15:24 <gchristensen> Dezgeg: I put that under "future requirements"

15:24 woffs has left #nixos-dev [#nixos-dev]

15:27 <Dezgeg> apparently I did make some progress on that... need to re-test that

15:41 <gchristensen> ok all this feedback has been really helpful, thank you all :) I'll definitely consider using the existing test framework for it, but I won't commit to it :)

16:07 <domenkozar> unless somebody objects, I'm giving commit access to timokau[m]

16:24 __Sander__ has quit [Quit: Konversation terminated!]

16:39 <domenkozar> m'kay

16:41 <domenkozar> https://github.com/NixOS/nixpkgs/issues/42053

16:41 <{^_^}> #42053 (by vorot93, 8 weeks ago, open): Desktop keeps suspending on unstable

16:41 <domenkozar> heh

16:49 MichaelRaskin has joined #nixos-dev

16:58 cransom has quit [Quit: WeeChat 2.0]

16:58 cransom has joined #nixos-dev

18:12 xeji_ has joined #nixos-dev

18:12 xeji has quit [Ping timeout: 256 seconds]

18:26 ma27 has joined #nixos-dev

18:46 FRidh has quit [Quit: Konversation terminated!]

19:00 orivej has quit [Ping timeout: 240 seconds]

19:19 genesis has quit [Ping timeout: 245 seconds]

19:22 <niksnut> oh man, I just noticed how complex generic/make-derivation.nix has become

19:22 <niksnut> no wonder nixpkgs evaluation is becoming so slow

19:22 <gchristensen> all the cross stuff?

19:26 <LnL> I've wondered before if stuff like the duplicate entries we've seen recently has impact on that

19:29 <niksnut> gchristensen: not just, also computation of attributes like 'dependencies' (whereas we used to just pass through attributes like buildInputs directly)

19:29 <niksnut> also, lots of calls to lib.unique, which has O(n^2) complexity

19:29 <gchristensen> oh dear

19:29 <niksnut> maps of maps

19:30 <niksnut> many list concatenations

19:30 <MichaelRaskin> I thought there was some builtin added that allowed reasonable uniq…

19:31 <LnL> hmm, thought most of that happened at runtime

19:48 <gchristensen> niksnut: how do you dig around the evaluation of nixpkgs to find performance problems?

19:50 <niksnut> we don't really have good tools to do that

19:51 <gchristensen> my novice brain thinks it'd be neat to emit data sufficient for us to generate flamegraphs

19:55 <clever> ,profiling

19:55 <{^_^}> Use NIX_COUNT_CALLS=1 and/or NIX_SHOW_STATS=1 to profile Nix evaluation

19:55 <clever> gchristensen: ive mostly used these, and just brute-force

19:56 <clever> at one point, i discovered that the nixops made by nixops's release.nix via IFD was adding over a gig to the nixops deployment,because it was being imported by every host in the cluster

19:56 <gchristensen> oh dear

19:59 Drakonis has quit [Remote host closed the connection]

20:02 <MichaelRaskin> Isn't there deduplication?

20:13 phreedom has quit [Ping timeout: 250 seconds]

20:14 phreedom has joined #nixos-dev

20:18 Ericson2314 has quit [Ping timeout: 240 seconds]

20:20 Ericson2314 has joined #nixos-dev

20:26 orivej has joined #nixos-dev

20:45 xeji_ has quit [Quit: WeeChat 2.0]

21:01 <gchristensen> is qemu as exhausting to everyone else as it is to me?

21:02 <clever> gchristensen: i dont really have any issues with it

21:02 <clever> and ive read over its command line args page 2 or 3 times, in full

21:02 <gchristensen> that explains it :P

21:26 xeji has joined #nixos-dev

22:13 xeji has quit [Quit: WeeChat 2.0]

22:15 <aszlig> gchristensen: what's so exhausting?

22:40 <samueldr> aszlig: oh, thanks for the fix, I must have failed at looking it up :/ I'll keep in mind to try even more

22:41 <aszlig> samueldr: yeah, although i don't really feel confortable using read for the passphrase :-/

22:42 <aszlig> samueldr: because this can leave artifacts of it in memory, as commented on that PR

22:43 <aszlig> let alone rm'ing it from a ramfs

22:45 <samueldr> hm, maybe I conflated the "leaky read" issue with the memory safety issue when re-reading

22:48 sir_guy_carleton has quit [Quit: WeeChat 2.0]

23:00 Lisanna has joined #nixos-dev

23:13 <gchristensen> aszlig: I don't know what I'm doing and don't know what I don't know, mostly

23:16 init_6 has joined #nixos-dev

23:23 <aszlig> gchristensen: what are you trying to do then?

23:25 <gchristensen> aszlig: I want to boot an arbitrary qcow2 file (specifically, a vmdk of debian jessie which I converted with qemu-img convert -f vmdk -O qcow2 ./the.vmdk $out) , and am getting nothing after "Booting from Hard Disk..." . I want to have stdio be the console of the vm

23:26 <aszlig> gchristensen: the image is configured so it sends output to virtio-serial, right?

23:27 <gchristensen> I hope so :) but maybe not, I'll mount it and look around. this is my run command: qemu-system-x86_64 -m 2G -smp 2 -enable-kvm -nographic -serial mon:stdio -no-reboot -vga none -net none ./image.qcow2

23:28 <clever> gchristensen: what about -curses ?

23:28 <aszlig> -curses would also work, if you don't have serial output

23:29 <aszlig> qemu then tries to graphically match the font

23:30 <aszlig> otherwise, try -serial pty

23:31 <gchristensen> (looking in to -curses)

23:31 <aszlig> or remove the mon:

23:31 <aszlig> (except if you want to have direct access to it)

23:33 <aszlig> s/it/the monitor/

23:33 <gchristensen> I'd rather avoid -curses, I don't need the monitor

23:34 <clever> gchristensen: do you know if serial console is actually enabled?

23:34 <aszlig> gchristensen: okay, then -serial pty or stdio should be fine

23:34 <aszlig> clever: 01:27 < gchristensen> I hope so :)

23:35 <gchristensen> I'll mount it and look

23:36 <gchristensen> back in ~20. fwiw I'm unpacking the image from https://vagrantcloud.com/debian/boxes/jessie64/versions/8.11.0/providers/virtualbox.box (which is just a tarball)

23:36 <samueldr> for automated (r-ryantm mostly) updates that are "blocked", for various reasons, is there something we could do to mark them, is there an appropriate tags we should use?

23:37 <samueldr> some of them will be fixed in due time (regressions in dependent software, regressions in upstream)

23:37 <samueldr> some of them could mean platform-dependent changes need fixes (darwin mostly)

23:38 <samueldr> oh, I'd instinctively assume anything vagrant won't have serial, as it's expected to have ssh

23:39 <samueldr> gchristensen: you could be interested in this http://libguestfs.org/virt-builder.1.html

23:40 <samueldr> I successfully used it in the past to automate configuring and building system images for qemu usage

23:49 init_6 has quit [Ping timeout: 240 seconds]

23:53 Ericson2314 has quit [Ping timeout: 240 seconds]

23:56 init_6 has joined #nixos-dev