#nixos-dev on 2021-02-01

2020-10-29 21:47 worldofpeace changed the topic of #nixos-dev to: NixOS Development (#nixos for questions) | NixOS 20.09 Nightingale ✨ https://discourse.nixos.org/t/nixos-20-09-release/9668 | https://hydra.nixos.org/jobset/nixos/trunk-combined https://channels.nix.gsc.io/graph.html | https://r13y.com | 20.09 RMs: worldofpeace, jonringer | https://logs.nix.samueldr.com/nixos-dev

00:01 __monty__ has quit [Quit: leaving]

00:08 abathur has quit [Quit: abathur]

00:09 abathur has joined #nixos-dev

00:38 supersandro2000 has quit [Disconnected by services]

00:38 supersandro2000 has joined #nixos-dev

00:47 rajivr has joined #nixos-dev

01:08 <gchristensen> it would be cool if error messages had a check list

01:08 <gchristensen> like: The option value `networking.hostName' in `/home/grahamc/projects/github.com/grahamc/network/flexo/hardware.nix' is not of type `string matching the pattern ^$|^[[:alnum:]]([[:alnum:]_-]{0,61}[[:alnum:]])?$'.

01:08 <gchristensen> okay but how do I fix it

01:09 <cole-h> just make it match that pattern ;)

01:09 <cole-h> (sorry)

01:09 <gchristensen> also, is https://search.nixos.org/options supposed to display a "Q"?

01:10 <gchristensen> https://search.nixos.org/options?channel=20.09&show=networking.hostName&from=0&size=50&sort=relevance&query=hostName the option description also doesn't indicate what I should use to fix it

01:13 <gchristensen> compare that error to - The option definition `security.acme.certs.flexo.gsc.io.allowKeysForGroup' no longer has any effect; Please remove it.All certs are readable by the configured group. If this is undesired,consider changing security.acme.certs.flexo.gsc.io.group to an unused group. which doesn't mention where I set this option

01:14 <gchristensen> I'm not intending on griping here, just noting that some error messages are amazing, and then others vary significantly

01:14 <gchristensen> and am wondering what a check-list would look like

01:32 <samueldr> gchristensen: it's not a Q, it's a thin and weird magnifier AFAIK

01:32 <gchristensen> oh :)

01:32 <samueldr> but I does have the Q vibe

01:32 <gchristensen> brb

01:34 gchristensen has joined #nixos-dev

01:34 {^_^} has joined #nixos-dev

01:41 <abathur> like regex, but it's invalid without matching examples, and requires one distinct non-overlapping match per character of regex?

01:42 <abathur> maybe ignoring comments

01:42 mkaito has quit [Quit: WeeChat 3.0]

01:47 gchristensen has quit [Quit: WeeChat 2.9]

01:47 {^_^} has quit [Remote host closed the connection]

01:49 {^_^} has joined #nixos-dev

01:49 gchristensen has joined #nixos-dev

02:54 <siraben> xdg_utils → xdg-utils #111519

02:54 <{^_^}> https://github.com/NixOS/nixpkgs/pull/111519 (by siraben, 1 minute ago, open): treewide: xdg_utils -> xdg-utils

03:38 <{^_^}> firing: RootPartitionLowDiskSpace: https://monitoring.nixos.org/prometheus/alerts

03:39 <energizer> now that's some impressive chatops

03:57 v0|d has quit [Ping timeout: 272 seconds]

04:44 jonringer has joined #nixos-dev

04:56 kalbasit__ has joined #nixos-dev

05:11 euank has quit [Quit: ZNC - http://znc.in]

05:16 kalbasit___ has joined #nixos-dev

05:17 ek___ has joined #nixos-dev

05:18 ek___ is now known as euank

05:18 kalbasit__ has quit [Ping timeout: 240 seconds]

05:18 jonringer has quit [Remote host closed the connection]

05:26 kalbasit___ has quit [Ping timeout: 240 seconds]

06:03 stolyaroleh_ has joined #nixos-dev

06:47 georgyo has joined #nixos-dev

07:04 orivej has joined #nixos-dev

07:10 saschagrunert has joined #nixos-dev

07:22 orivej has quit [Ping timeout: 256 seconds]

07:23 orivej has joined #nixos-dev

07:36 tilpner_ has joined #nixos-dev

07:37 tilpner has quit [Ping timeout: 246 seconds]

07:37 tilpner_ is now known as tilpner

07:43 <{^_^}> firing: RootPartitionLowDiskSpace: https://monitoring.nixos.org/prometheus/alerts

07:47 <supersandro2000> siraben: thanks!

07:58 teto has quit [Quit: WeeChat 3.0]

08:00 tilpner_ has joined #nixos-dev

08:01 tilpner has quit [Ping timeout: 256 seconds]

08:01 tilpner_ is now known as tilpner

08:06 tilpner has quit [Remote host closed the connection]

08:06 <siraben> supersandro2000: should we alias xdg_utils to xdg-utils?

08:06 <siraben> or just move over completely?

08:06 tilpner has joined #nixos-dev

08:06 <siraben> Profpatsch: how difficult would it be to do an unused inputs analysis across Nixpkgs?

08:07 supersandro2000 has quit [Quit: The Lounge - https://thelounge.chat]

08:07 supersandro2000 has joined #nixos-dev

08:17 <Profpatsch> siraben: uh, not an easy question

08:18 <Profpatsch> I don’t think we have a tool that checks that, and I don’t know how hard it is to build a tool like that

08:18 <Profpatsch> there might be some dynamic parts of nix that make this check non-trivial

08:18 <Profpatsch> and you also have the let/with shadowing rules

08:19 <Profpatsch> But if you want to find a very conservative subset, probably not all that hard

08:38 cole-h has quit [Ping timeout: 256 seconds]

09:06 AlwaysLivid has joined #nixos-dev

09:23 dstzd has quit [Quit: ZNC - https://znc.in]

09:30 <eyJhb> Anyone up to review this? https://github.com/NixOS/nixpkgs/pull/110404 It is not done, as there needs to be some descriptions etc. but wanted to know if the basis for it is OK

09:30 <{^_^}> #110404 (by eyJhb, 1 week ago, open): WIP: module mautrix-* new service to handle all mautrix services

09:37 ScottHDev5 has quit [Quit: Ping timeout (120 seconds)]

09:38 ScottHDev5 has joined #nixos-dev

09:40 <siraben> Profpatsch: I see. I might learn how to use hnix and take a stab at it

09:41 <siraben> Or perform the redundant rec removal again and so on

09:42 <Profpatsch> siraben: I guess with hnix you will have the same problem again, that it can’t do source spans for non-exprs

09:42 <Profpatsch> that is you will be able to find out which vars are unused, but then you have no way to highlight the var that is unused

09:43 <Profpatsch> unless you do hacks like ad-hoc parsing the source spans

09:43 <Profpatsch> siraben: we merged the tree-sitter-nix grammar into nixpkgs recently

09:44 <Profpatsch> Since you don’t need inter-file checks for this, you might as well just use tree-sitter and then implement a conservative subset of variable scoping

09:45 <siraben> I see. How hard is it to use tree-sitter?

09:45 <Profpatsch> collecting free variables recursively and removing them on each introduction site, collecting the ones which are introduced but not in the collection

09:45 __monty__ has joined #nixos-dev

09:46 <Profpatsch> siraben: here’s as etup https://code.tvl.fyi/tree/users/Profpatsch/tree-sitter.nix?id=9cdb10adc8ce78de593436a8347cfa0c97d53bb7#n9

09:47 <Profpatsch> here’s the grammar: https://github.com/cstrahan/tree-sitter-nix/blob/master/grammar.js

09:48 <siraben> are there haskell bindings for tree-sitter?

09:48 <siraben> I'm not familiar with tree-sitter, so this is a parser generator?

09:48 <siraben> And when I traverse the source AST, I can make it pretty-print the AST again with a small diff

09:48 <siraben> ?

09:49 <Profpatsch> siraben: https://tree-sitter.github.io/tree-sitter/

09:57 <siraben> Hm, ok.

09:59 <siraben> Profpatsch: Ah I see, since these are concrete syntax trees they contain all the information needed to print the program out again

10:00 <siraben> What do you mean that hnix cannot do source spans for non-exprs?

10:03 <Profpatsch> siraben: https://hackage.haskell.org/package/hnix-0.12.0.1/docs/Nix-Expr-Types-Annotated.html#t:NExprLocF

10:03 <Profpatsch> e.g. try getting the source span of the function variable v in ({v}: v + 1) with hnix

10:03 <Profpatsch> something like 1:3

10:04 <Profpatsch> there’s only a source span for v + 1 and the whole thing

10:04 <siraben> ah

10:04 <Profpatsch> NExprF doesn’t contain any: https://hackage.haskell.org/package/hnix-0.12.0.1/docs/Nix-Expr-Types.html#t:NExprF

10:04 <Profpatsch> there’s only source spans around `r`s

10:05 <__monty__> Isn't the SrcSpan of the function also that of the parameters?

10:06 <siraben> Profpatsch: Right as seen in `(Compose (Ann SrcSpan) NExprF r)`

10:07 <siraben> Hm that makes it seem to that the entire `NExprF` has a span though

10:07 <siraben> ah it's a functor waiting to fixed, ok.

10:07 <Profpatsch> However! It could be possible to create a parser which adds the conceret syntax to every r

10:08 <Profpatsch> *concrete

10:08 <Profpatsch> instead of just a plain source span

10:08 <siraben> Profpatsch: i'm confused about that compose, wouldn't fixing that imply that we have source spans for every subtree of the syntax as well?

10:08 <Profpatsch> I don’t know how much work that would be though

10:08 <Profpatsch> siraben: it does

10:09 <siraben> So `({v}: v + 1}` how come we don't have the span of v and 1 in the body?

10:09 <Profpatsch> There are

10:09 <siraben> but the entire addition expression v + 1 instead?

10:09 <Profpatsch> I didn’t list all of them

10:09 <Profpatsch> Every level has a src span at every `r`

10:09 <siraben> Oh but for the v in the actual function arg list

10:09 <siraben> there is no span

10:09 <Profpatsch> yes

10:11 <Profpatsch> I still feel like advancing tools working with nix-tree-sitter is more worthwhile than improving hnix to do these things

10:22 <__monty__> Neither uses the Trees that Grow approach, do they?

10:33 AlwaysLivid has quit [Remote host closed the connection]

10:33 AlwaysLivid has joined #nixos-dev

10:44 tilpner_ has joined #nixos-dev

10:46 tilpner has quit [Ping timeout: 264 seconds]

10:46 tilpner_ is now known as tilpner

10:54 tilpner has quit [Ping timeout: 258 seconds]

10:56 tilpner has joined #nixos-dev

11:08 tilpner has quit [Remote host closed the connection]

11:08 tilpner has joined #nixos-dev

11:11 flokli has joined #nixos-dev

11:27 pinpox has quit [Quit: The Lounge - https://thelounge.chat]

11:47 AlwaysLivid has quit [Remote host closed the connection]

11:47 AlwaysLivid has joined #nixos-dev

11:48 <{^_^}> firing: RootPartitionLowDiskSpace: https://monitoring.nixos.org/prometheus/alerts

12:06 orivej has quit [Ping timeout: 256 seconds]

12:43 <jtojnar> siraben: Yeah, we need an alias for out-of-tree references.

12:43 <siraben> Ok then

12:55 __monty__ has quit [Ping timeout: 260 seconds]

12:56 __monty__ has joined #nixos-dev

13:18 mkaito has joined #nixos-dev

13:18 mkaito has quit [Changing host]

13:22 <siraben> Aliases added. Pending merge.

13:22 <domenkozar[m]> what would be a good fix for https://github.com/NixOS/nix/issues/963

13:22 <{^_^}> nix#963 (by domenkozar, 4 years ago, open): error "value is a list while a set was expected" is too vague

13:23 <domenkozar[m]> to fix a number of such issues, ideally, it would take something like opening repl, but it's probably not feasible that it would always result into a working repl

13:24 <domenkozar[m]> printing values is also tricky since one could do (import nixpkgs) + []

13:25 <siraben> lol I ran into that today because I wrote `overlays = [ haskellPackages.ghcWithPackages (h: ...) ]`

13:25 <siraben> well, similar flavor

13:27 <clever> parens needed

13:27 <clever> lists always do that

13:29 <siraben> right

13:31 <domenkozar[m]> ohh https://github.com/NixOS/nix/pull/3901

13:31 <{^_^}> nix#3901 (by edolstra, 25 weeks ago, open): Add a flag to start the REPL on evaluation errors

13:58 BaughnLogBot has joined #nixos-dev

14:08 tilpner_ has joined #nixos-dev

14:09 marek_ has joined #nixos-dev

14:09 marek_ is now known as marek

14:10 tilpner has quit [Ping timeout: 265 seconds]

14:11 tilpner has joined #nixos-dev

14:13 tilpner_ has quit [Ping timeout: 260 seconds]

14:24 <infinisil> gchristensen: Could it be that you broke declarative flake jobsets with your recent hydra changes? Because it seems to be

14:25 <gchristensen> is it possible? ... it is possible

14:25 <gchristensen> what are you seeing?

14:26 <infinisil> There's no error, the declarative jobsets evaluates, it produces the correct result (a store path with jobsets saying `"type": 1` and `"flake": "github:..."`), but the actual jobsets still use the previous state (non-flake in my case)

14:26 <gchristensen> any logs in the database?

14:26 <infinisil> Oh how would I check?

14:26 <infinisil> (but yeah I'm thinking some database update must've failed)

14:27 <gchristensen> find the database, and journalctl -fu postgresql

14:27 <gchristensen> or -eu

14:27 <gchristensen> wait, you're moving from a non-flake to a flake jobset?

14:27 <infinisil> Yeah

14:28 <infinisil> Hold on, getting db logs

14:28 <gchristensen> I don't know why that would be a problem, but good to know

14:34 <infinisil> gchristensen: https://paste.infinisil.com/56vCC2Iito.log

14:37 <infinisil> Hydra project is https://hydra.mantis.ist/project/ecip-checkpointing

14:39 <infinisil> The latest declarative jobsets eval gives https://paste.infinisil.com/idAQaCX-Cg.json

14:39 <infinisil> But e.g. checking the API for a jobset, it's still the non-flake one: curl -H 'Accept: application/json' https://hydra.mantis.ist/jobset/ecip-checkpointing/pr-16

14:40 <infinisil> Gives https://paste.infinisil.com/NL7XPqYFg0.json

14:41 <infinisil> That db error somehow doesn't look related, but it's the only thing I get in the log for a .jobsets eval

14:42 kalbasit___ has joined #nixos-dev

14:42 orivej has joined #nixos-dev

14:44 <infinisil> There's also a whole bunch of these errors, seemingly at the time hydra was updated to master: https://paste.infinisil.com/pohDq6W_ZE.log

14:45 <infinisil> Though I can't scroll back further to see if that happened before too

14:46 <siraben> Is there a community repo of Nix flake templates?

14:46 <siraben> I see https://github.com/NixOS/templates

14:46 <gchristensen> thanks infinisil, looking ...

14:47 kalbasit___ has quit [Ping timeout: 265 seconds]

14:49 <gchristensen> infinisil: it looks like you're running inconsistent versions of code?

14:50 <infinisil> Hmm, inconsistent between what?

14:50 <gchristensen> like the revision of hydra you're running is incompatible with the database

14:51 <infinisil> Oh, hmm I did see something about having to run `hydra-init` in an issue earlier

14:51 <infinisil> I personally didn't do the update to hydra master, but I guess if that needs to be done manually it could easily be forgotten

14:51 <infinisil> Though it is run automatically by NixOS' hydra module. Gonna check if that's being used

14:52 <gchristensen> it should be run automatically by the module, indeed

14:52 <gchristensen> you may need to manually restart the queue runner

14:53 <infinisil> Yeah that ran indeed when it was updated to master. I see `upgrading Hydra schema from version 65 to 66` up to `upgrading Hydra schema from version 69 to 70`

14:53 AlwaysLivid has quit [Remote host closed the connection]

14:54 <infinisil> Hmm the queue-runner has been restarted since the upgrade, I'll try again

14:55 <gchristensen> infinisil: you should be up to version 72 at this point

14:55 AlwaysLivid has joined #nixos-dev

14:56 <infinisil> Ah it's not quite master, just from 4 days ago (was master at the time of upgrading): https://github.com/nixos/hydra/commit/6d047c286f5c86d8240167602d5c8b3c18ce1ab7

14:57 <sterni> infinisil: about the lib.generators.toPretty change, I noticed today that it doesn't fix anything actually, because tryEval apparently doesn't catch the kind of error generated by builtins.functionArgs

14:57 <sterni> I must have tested this with the wrong nix version yesterday, I guess I'll make PR reverting this change

14:57 <sterni> unless there is another way to catch this kinds of failures

14:57 <gchristensen> infinisil: yeah, that commit introduces schema version 72

14:57 <infinisil> sterni: (-> #nix-lang?)

14:57 AlwaysLivid has quit [Read error: Connection reset by peer]

14:57 <infinisil> Hmmm

14:57 <sterni> infinisil: oh right

14:58 <infinisil> gchristensen: Oh, the upgrades were done too

14:58 <gchristensen> okay

14:58 <infinisil> Up to 72, though it was a bit hidden within the SQL

14:59 AlwaysLivid has joined #nixos-dev

14:59 <infinisil> Man I'm bad at reporting problems heh

14:59 <gchristensen> I just checked hydra.n.o and it hasn't seen a single error of this sort: https://paste.infinisil.com/pohDq6W_ZE.log

14:59 <gchristensen> so I'm going to hope restarting the queue runner fixes it ? :)

15:00 <gchristensen> https://paste.infinisil.com/56vCC2Iito.log looking at this error "new row for relation "jobsets" violates check constraint "jobsets_check"" is interesting, that constraint is: alter table Jobsets add constraint jobsets_check check (schedulingShares > 0);

15:00 <infinisil> Yeah saw that too, very weird

15:00 <gchristensen> it sounds like you're not giving the jobset any shares?

15:00 jonringer has joined #nixos-dev

15:00 jonringer has quit [Remote host closed the connection]

15:00 <infinisil> Unfortunately nothing is different with a restarted queue runner

15:01 jonringer has joined #nixos-dev

15:03 <gchristensen> can you give me context lines from https://paste.infinisil.com/56vCC2Iito.log ?

15:04 <infinisil> gchristensen: https://paste.infinisil.com/BsOi0qt8pQ.log

15:04 <infinisil> Oh timestamps missing

15:07 <infinisil> gchristensen: https://paste.infinisil.com/BNarpJbJjM.log

15:07 <infinisil> Note the last timestamps here ^ That's the time when I retried flakes, only these couple lines appeared

15:07 <infinisil> 28 Jan is after the upgrade

15:08 <infinisil> gchristensen: Hmm maybe we could try out creating a new declarative flake jobset for hydra.nixos.org?

15:09 <infinisil> s/jobset/project

15:10 <gchristensen> can you connect to the hydra server, `ps auxfg | grep hydra` and share all the store paths? I'm still thinking that you're running out of sync versions of the code

15:12 kalbasit___ has joined #nixos-dev

15:12 <infinisil> Hmm well it doesn't show the store path of hydra-queue-runner

15:12 <infinisil> gchristensen: https://paste.infinisil.com/3L3pfxk8tc

15:13 <clever> infinisil: ls -l /proc/PID/exe

15:13 <clever> that tells you which binary (absolute path) is behind a given pid

15:13 <gchristensen> lol and those store paths are absolutely useless, since they have absolutely no version information

15:13 <gchristensen> them being dirty is suspicious too

15:13 <infinisil> Ah nice: lrwxrwxrwx 1 hydra-queue-runner hydra 0 Feb 1 14:55 /proc/14773/exe -> /nix/store/xly8wy993ji85g24wd6ap2qkj5bmz3bp-hydra-0.1.19700101.DIRTY/bin/hydra-queue-runner

15:13 <clever> gchristensen: nix-store -q --deriver, i think

15:13 <infinisil> That is at least the same store path

15:14 <gchristensen> can you get the deployed revision?

15:14 <infinisil> That should be the one I linked earlier

15:14 <infinisil> https://github.com/nixos/hydra/commit/6d047c286f5c86d8240167602d5c8b3c18ce1ab7

15:14 * infinisil double checks

15:14 <gchristensen> and the deploy was dirty, what diff is present?

15:16 <andi-> my hydra on ~yesterdays master also throws some 500s when restarting failed jobs... :/

15:16 <gchristensen> nice, let's get it fixed

15:16 justanotheruser has joined #nixos-dev

15:16 <infinisil> I'll look into why that's DIRTY, I don't think it should be

15:16 <infinisil> Maybe because it's fetched by niv somehow

15:19 <gchristensen> andi-: refactoring schemas is hard without a compile time checker

15:19 <andi-> Mine looks more like a perl issue not a DB issue

15:19 <andi-> > Caught exception in Hydra::Controller::JobsetEval->restart_failed "Can't locate object method "project" via package "Hydra::Model::DB::JobsetEvals" at /nix/store/252137553mzhqmqjwv3i9g0wma7a4fpa-hydra-0.1.19700101.DIRTY/libexec/hydra/lib/Hydra/Controller/JobsetEval.pm line 155."

15:19 <{^_^}> error: syntax error, unexpected IN, expecting ')', at (string):471:18

15:20 <gchristensen> yeah, that is the hard part

15:20 <infinisil> And I'm pretty convinced the DIRTY thing just comes from it being fetched from Niv and not with flakes. So I think it uses the `default.nix` with flake-compat

15:20 <andi-> infinisil: yeah, same happens if you use fetchgit

15:21 <gchristensen> andi-: https://github.com/NixOS/hydra/compare/master...grahamc:jobsetevals-fixups if you want to try this patch?

15:26 AlwaysLivid has quit [Remote host closed the connection]

15:26 kalbasit___ has quit [Ping timeout: 272 seconds]

15:29 <infinisil> Hmm but yeah there's no declarative jobsets on hydra.nixos.org

15:30 <infinisil> How about creating one, and with flakes too, just to test whether it works? Because if it does, then it's an issue with my deployment. Otherwise it's an issue with hydra itself

15:31 <andi-> gchristensen: deploying..

15:31 <gchristensen> andi-: I just pushed an updated version which should be more clear and be identical

15:33 <andi-> ok

15:37 <gchristensen> I tested that locally and it worked, fwiw

15:46 <siraben> > builtins.currentTime

15:46 <infinisil> gchristensen: I guess I'll file an issue for the problem I'm having

15:46 <{^_^}> 1612194385

15:46 <siraben> > builtins.currentTime

15:46 <{^_^}> 1612194390

15:46 <gchristensen> infinisil: can you `\d jobsets` ?

15:46 <gchristensen> in a `psql` terminal

15:47 <infinisil> Did not find any relation named "jobsets".

15:47 <infinisil> Wait I'm probably not connected to the right db

15:48 <gchristensen> hopefully you have such a relation :)

15:49 <infinisil> gchristensen: https://paste.infinisil.com/uQN8XMvjhw

15:49 <gchristensen> it is very difficult, debugging this with IRC as the debug protocol :P

15:49 <gchristensen> this doesn't make any sense

15:50 <gchristensen> infinisil: select * from schemaversion;

15:50 <infinisil> 72

15:50 <gchristensen> oh nevermind it does make sense

15:52 <gchristensen> infinisil: link to the hydra again?

15:53 <infinisil> gchristensen: https://hydra.mantis.ist/

15:53 <{^_^}> firing: RootPartitionLowDiskSpace: https://monitoring.nixos.org/prometheus/alerts

15:54 <infinisil> gchristensen: project config: https://paste.infinisil.com/piHVXxGbU0.png

15:54 dhess has joined #nixos-dev

15:54 <infinisil> Points to https://github.com/input-output-hk/mantis-gac/blob/checkpointing-flakes/jobsets/ecip-checkpointing.json

15:55 <gchristensen> infinisil: can we do a video call + tmate session?

15:56 <infinisil> gchristensen: Sounds good!

15:57 <gchristensen> meet.jit.si/UncheckedSchemasNoTypeChecker

15:57 <infinisil> Oh damn, tmate's servers are broken though, still

15:57 <gchristensen> still? :(

16:06 jonringer has quit [Remote host closed the connection]

16:06 <andi-> gchristensen: that patch from earlier fixed my issue

16:06 <gchristensen> nice

16:07 <gchristensen> hopefully we can track down this new one too

16:07 jonringer has joined #nixos-dev

16:13 <gchristensen> infinisil: `log_statement = 'all'`

16:14 <gchristensen> services.postgresql.settings.log_statement = "all";

16:38 globin has joined #nixos-dev

16:47 mikroskeem has joined #nixos-dev

16:53 <infinisil> gchristensen: https://paste.infinisil.com/SG3KcfM40c

16:57 <immae> What is the goal with `((type = 0) = (nixexprinput IS NOT NULL AND nixexprpath IS NOT NULL))` ?

16:57 <immae> is it supposed to be an "imply" term? `type == 0 => (...)` ? If so, it’s incorrect

17:00 <gchristensen> infinisil: UPDATE jobsets SET flake = 'github:input-output-hk/ECIP-Checkpointing/66dbb9c0117d2965617755c193dbb035f096a149', type = '1' WHERE ( ( name = 'pr-16' AND project = 'ecip-checkpointing' ) );

17:02 mikroskeem has quit [Quit: WeeChat 3.0]

17:02 <immae> gchristensen: it won’t work: in your case nixexprinput and nixexprpath are non-null, so (type = 0) = (...) will be false and the check will fail, no?

17:03 <immae> the check should be `(type != 0) || (nixexprinput IS NOT NULL AND nixexprpath IS NOT NULL)` if I understand correctly the goal of it

17:03 kalbasit___ has joined #nixos-dev

17:04 <gchristensen> good catch! we'd come to that at the same time =)

17:05 <gchristensen> I don't think we should relax the constraint, I think we should nullify those columns when setting flake params

17:06 <immae> it’s not "relaxed", the constraint seems incorrect to me in its current state

17:06 <gchristensen> I don't think so... it is invalid to specify an nixexprinput / nixexprpath in combination with a flake

17:07 <infinisil> immae: I think it's just flipped around, type = 0 is equal to (type != 1)

17:07 <immae> hmm 0 is "non-flake" right?

17:07 <infinisil> So it's `(type != 1) == (nixexprinput is not null ...)`

17:07 <infinisil> Yea

17:07 <infinisil> ANd that's then an implication again

17:07 <immae> no an equal sign is not an implication

17:08 <immae> it might be what you want though regarding what gchristensen said :)

17:09 <immae> you want a chekc that says "type 0 => flake == null and nixexprinput != null AND nixexprpath != null", and reverse for "type 1", is that it?

17:11 <gchristensen> if it is a flake, I want flake to not be null and nixexprinput and nixexprpath to be null

17:11 saschagrunert has quit [Remote host closed the connection]

17:11 <gchristensen> if it is not a flake, I want flake to be null and nixexprinput and nixexprpath to not be null

17:12 <immae> Then the boring and no-error-prone way to write it is `(flake IS NULL AND nixexprinput IS NOT NULL AND nixexprpath IS NOT NULL AND type = 0) OR (flake IS NOT NULL AND nixexprinput IS NULL AND nixexprpath IS NULL AND type = 1)`

17:12 <gchristensen> I mean, the schema is correct

17:13 <gchristensen> the update query is not correct

17:14 <immae> if nixexprinput is null and nixexprpath is not null and flake is not null and type is 1 then your check pass but it shouldn’t

17:14 <gchristensen> ah

17:15 <gchristensen> good catch!

17:16 <immae> (you might say that you don’t care, and I would accept it, but the assertion that the schema is correct is false regarding the constraints I understood :) )

17:17 kalbasit___ has quit [Quit: WeeChat 2.9]

17:32 mkaito has quit [Quit: WeeChat 3.0]

17:32 mkaito has joined #nixos-dev

17:41 <gchristensen> immae: you're right

17:41 <gchristensen> immae: can you open an issue and/or PR for it?

17:42 <gchristensen> infinisil: my audio is busted ...... let me know the result of deploying that patch? also, can you link me to the patch?

17:42 <infinisil> gchristensen: Ahh damn. But yeah will do :)

17:42 <gchristensen> thanks :)

17:43 <infinisil> gchristensen: https://github.com/Infinisil/hydra/commit/74122abbaa359381168d5e9c4ba049b4a8f53d0e

17:43 <infinisil> (currently testing it)

17:46 <infinisil> Evaluation has been going for over 4 minutes, which might be reasonable considering it's heavy IFD, but at least there's no error, which is different from before the patch

17:46 <infinisil> Or rather, at least something is happening, instead of it just showing the previous error

17:46 <immae> gchristensen: I sure can, but I have absolutely no clue where this kind of thing is defined. Should I look somewhere in hydra repo?

17:47 <NinjaTrappeur> infinisil: expect at least 15min.

17:47 <infinisil> Oof!

17:47 <NinjaTrappeur> We should materialize that.

17:47 <infinisil> s/We/I :P

17:47 <NinjaTrappeur> https://input-output-hk.github.io/haskell.nix/tutorials/materialization/#materialization

17:47 <NinjaTrappeur> yes :P

17:47 <NinjaTrappeur> enjoy haha

17:48 <infinisil> Well if it's already built, it should be faster

17:48 <NinjaTrappeur> yup

17:48 <infinisil> This is the first time it builds the new flake stuff though :)

17:50 <gchristensen> immae: a file called hydra.sql

17:50 <immae> found it thanks

17:51 <immae> I should write some "migration" one too I guess?

17:51 <gchristensen> that would be great!

17:54 <infinisil> 13 minutes and counting..

17:54 <gchristensen> sounds about right lol

17:56 rajivr has quit [Quit: Connection closed for inactivity]

17:59 <infinisil> Every time I have to wait for something I'm debating whether it's worth starting something new

17:59 <infinisil> s/starting something new/working on something else

18:00 <infinisil> If I knew it would take over 19 minutes I probably would've done so earlier lol

18:00 <gchristensen> yeah it is a motivator in making things as fast as possible, to keep people from getting distracted :)

18:00 <infinisil> Would be cool if Hydra showed every stage of IFD

18:00 <immae> Or you can just look at your server working and take a rest :)

18:03 <andi-> I wish I knew why my hydra instance has problems with running nixos tests... regardless of runner (e.g. even when building them on my workstation) they appear stuck very often. When I run the same drv outside of the hydra remote builder setting it just works...

18:03 <andi-> Load can't be an issue as I've some machines on max-jobs=1

18:10 thefloweringash has quit [Ping timeout: 244 seconds]

18:12 <infinisil> 30 minutes and counting..

18:22 etu has joined #nixos-dev

18:23 <bennofs> just kill IFD, it's never been a great thing...

18:24 thefloweringash has joined #nixos-dev

18:25 <infinisil> It's great in some ways, not as much in others

18:35 <ajs124> andi-: Good to know that someone else has that problem. Although we don't have max-jobs=1.

18:36 <ajs124> I always assumed it was some part of our config, which is kind of weird, in some ways.

18:37 <dhess> We also get stuck NixOS tests on our Hydra, from time to time. It's pretty rare, though.

18:38 <dhess> We also see what look like races where we'll occasionally get spurious failures. They nearly always pass after the job is re-tried.

18:39 <ajs124> Maybe there is something about our config, after all. Just checking right now, there's a nixos.tests.predictable-interface-names.unpredictableNetworkd.x86_64-linux running since 2d 2h 16m 16s.

18:41 <andi-> ajs124: exactly that!

18:41 <andi-> I have no idea why they aren't killed because of a timeout

18:42 <ajs124> If it weren't for other people (which don't like my super hacky solutions), I'd honestly have a systemd time that restarts the queue runner once a day.

18:44 <andi-> ajs124: the hydra builders are rebooted daily ;-)

18:44 <andi-> at least those from packet..

18:59 <bennofs> is the hydra.nixos.org webinterface currently unreachable or is that something on my end?

19:08 cole-h has joined #nixos-dev

19:08 <dhess> bennofs: also here

19:10 pinpox has joined #nixos-dev

19:17 <gchristensen> I'm taking a look see

19:22 <immae> gchristensen: https://github.com/NixOS/hydra/pull/856 for the check fix :)

19:22 <{^_^}> hydra#856 (by immae, 1 minute ago, open): Fix check in jobsets

19:24 <immae> Ah does the travis check that the sql syntax is correct (i.e. does it start a VM with hydra?), or should I test it myself?

19:26 tilpner_ has joined #nixos-dev

19:27 tilpner has quit [Ping timeout: 246 seconds]

19:27 tilpner_ is now known as tilpner

19:29 <gchristensen> please test it yourself too P

19:37 <immae> ok I’ll try that

19:37 <immae> (it’s the first time I’m running a hydra :p )

19:41 <cole-h> gchristensen: Semi-relatedly, but the instructions in hydra.sql (specifically step 2) seem to be "wrong"? I had to run `make -C src/sql -f Makefile.am update-dbix`, and dunno what the `hydra-postgresql.sql` arg is for / means.

19:42 <gchristensen> did you do the bootstrap / configure phase stuff in the HACKING docs?

19:43 <cole-h> o maybe not :D

19:45 <cole-h> Yep, that removed the necessity for `-f Makefile.am`, but still unsure what `hydra-postgresql.sql` is for.

19:45 <gchristensen> hrm maybe it is stale

19:45 <cole-h> (The only reference to that file I can find is in hydra.sql)

19:46 <cole-h> And even renaming that arg to `hydra.sql` shows "Nothing to be done for hydra.sql". So yeah, maybe stale

19:46 tilpner has quit [Remote host closed the connection]

19:46 <gchristensen> cool

19:46 tilpner_ has joined #nixos-dev

19:46 <gchristensen> this is so bizarre

19:47 <gchristensen> a query is taking 5000ms according to the slow query log, and taking just 1ms if I run it locally

19:47 <cole-h> Maybe your NVMe's are better than Hydra's? :D

19:47 <gchristensen> this is all hydra

19:48 <cole-h> I assumed "run it locally" meant you were running it on your replicated hydra backup, and that the "slow query log" was on Hydra (h.n.o) itself

19:49 <gchristensen> ah, I mean manually

19:51 <immae> gchristensen: what does the slow query look like?

19:51 <gchristensen> https://gist.github.com/grahamc/328e686e77be8becd2db8f51113ee29b

19:52 tilpner_ is now known as tilpner

19:55 <symphorien[m]> https://github.com/symphorien/nix-du/issues/5#issuecomment-770752868 << a store path A is alive because `nix-store --query --roots` reports a root B -> C.drv but this root cannot depend on A since it points to derivation files... It looks like an issue to me, but I'd like a second opinion before asking the person to open an issue against nix.

19:57 <gchristensen> I think one of my PRs must have introduced a query in a hot loop

19:58 <{^_^}> firing: RootPartitionLowDiskSpace: https://monitoring.nixos.org/prometheus/alerts

20:01 <gchristensen> https://monitoring.nixos.org/grafana/d/5LANB9pZk/per-instance-metrics?viewPanel=13&orgId=1&var-instance=haumea:9100&from=now-7d&to=now&refresh=30s

20:04 <infinisil> gchristensen: Can confirm the fix works :) Will PR shortly (and cross-link to immae's related PR)

20:05 <gchristensen> cool

20:05 <immae> infinisil: I didn’t feel like "fixing" anything, merely adding even more constraint to something that was already constrained :D

20:06 <immae> But if you’re happy it’s all good :)

20:07 <infinisil> Ah yeah, I've got a fix (created with the help of gchristensen) for the problem I was having earlier, which was very much related to those constraints

20:08 <gchristensen> samueldr: I think the implementation of storing the eval error message in with the jobseteval was a bit naive :)

20:08 <samueldr> I did say I wouldn't comment on the DB part :)

20:08 <gchristensen> hehe

20:17 jonringer_ has joined #nixos-dev

20:18 <lukegb> gchristensen: did it get blocked on another transaction?

20:18 nh2_ has joined #nixos-dev

20:18 alunduil_ has joined #nixos-dev

20:18 <gchristensen> I think it is blocked on literally writing bytes to the wire

20:18 <lukegb> ah

20:21 piegames1 has joined #nixos-dev

20:26 jonringer has quit [*.net *.split]

20:26 alunduil has quit [*.net *.split]

20:26 nh2 has quit [*.net *.split]

20:26 catern has quit [*.net *.split]

20:26 dtz has quit [*.net *.split]

20:26 jtojnar has quit [*.net *.split]

20:26 worldofpeace has quit [*.net *.split]

20:26 symphorien[m] has quit [*.net *.split]

20:26 JJJollyjim has quit [*.net *.split]

20:26 joepie91 has quit [*.net *.split]

20:26 piegames has quit [*.net *.split]

20:26 alunduil_ is now known as alunduil

20:26 nh2_ is now known as nh2

20:31 symphorien[m] has joined #nixos-dev

20:32 joepie91 has joined #nixos-dev

20:32 joepie91 is now known as Guest4239

20:32 <gchristensen> https://monitoring.nixos.org/grafana/d/5LANB9pZk/per-instance-metrics?viewPanel=13&orgId=1&var-instance=ceres:9100&from=1612210958448&to=1612211535256 this spike is when I turned hydra-server back on, and it started going down when I erased the per-evaluation logs

20:33 jtojnar has joined #nixos-dev

20:34 evanjs has joined #nixos-dev

20:35 worldofpeace has joined #nixos-dev

20:36 dtz has joined #nixos-dev

20:37 <samueldr> (throwback to a previous query of mine) would it help if they were files on the FS?

20:42 mkaito has quit [Quit: WeeChat 3.0]

20:45 mkaito has joined #nixos-dev

20:45 mkaito has quit [Changing host]

20:51 pmy_ has quit [Ping timeout: 256 seconds]

20:54 pmy_ has joined #nixos-dev

21:16 stolyaroleh_ has quit [Ping timeout: 265 seconds]

21:17 orivej has quit [Ping timeout: 265 seconds]

21:17 <gchristensen> samueldr: imho that isn't so useful and makes dealing with them harder

21:18 <gchristensen> one option is to go through and explicitly not select the column when it isn't needed, another option is to break it off in to its own table

21:21 <gchristensen> seeing as there are quite a lot of queries for evals, the separate table is probably easier

21:32 <cole-h> As someone who has taken exactly one (1) database class, I like the "separate table" option better as well :)

21:39 <dhess> Does a Hydra expect to be the only writer of an S3 binary cache? In other words, if I have another Hydra (or similar service) writing to the same binary cache, will that cause any problems?

22:01 <edef> multi-writer might cause orphaned NARs sometimes

22:01 <edef> but nothing serious

22:02 <edef> (i'm not sure about the binary cache GC mechanism, that might collect stuff you want to keep)

22:16 mkaito has quit [Quit: WeeChat 3.0]

22:17 catern has joined #nixos-dev

22:23 <dhess> edef: cool, thanks.

22:51 tilpner_ has joined #nixos-dev

22:52 tilpner has quit [Ping timeout: 256 seconds]

22:52 tilpner_ is now known as tilpner

23:03 <{^_^}> resolved: RootPartitionLowDiskSpace: https://monitoring.nixos.org/prometheus/alerts

23:07 tilpner_ has joined #nixos-dev

23:08 tilpner has quit [Ping timeout: 256 seconds]

23:08 tilpner_ is now known as tilpner

23:11 <lovesegfault> PSA: I'm working on bumping glibc

23:16 tilpner_ has joined #nixos-dev

23:18 tilpner has quit [Ping timeout: 256 seconds]

23:18 tilpner_ is now known as tilpner

23:34 zuh0 has joined #nixos-dev

23:37 __monty__ has quit [Quit: leaving]