#nixos-dev on 2019-08-25

2019-04-11 14:17 sphalerite changed the topic of #nixos-dev to: NixOS Development (#nixos for questions) | NixOS 19.03 released! https://discourse.nixos.org/t/nixos-19-03-release/2652 | https://hydra.nixos.org/jobset/nixos/trunk-combined https://channels.nix.gsc.io/graph.html https://r13y.com | 19.03 RMs: samueldr,sphalerite | https://logs.nix.samueldr.com/nixos-dev

00:24 drakonis has quit [Quit: WeeChat 2.5]

00:25 drakonis has joined #nixos-dev

00:25 pie_ has quit [Ping timeout: 250 seconds]

01:16 <jtojnar> worldofpeace: testing the meson GIMP (https://github.com/jtojnar/nixpkgs/tree/gimp-meson), so far found a meson bug or something https://paste.gnome.org/pxp55snmz

02:32 drakonis has quit [Quit: WeeChat 2.5]

02:40 drakonis has joined #nixos-dev

05:24 evanjs has joined #nixos-dev

05:29 evanjs has quit [Client Quit]

05:31 evanjs has joined #nixos-dev

07:11 drakonis has quit [Quit: WeeChat 2.5]

08:06 orivej has joined #nixos-dev

10:20 orivej has quit [Ping timeout: 246 seconds]

10:52 pie_ has joined #nixos-dev

11:28 orivej has joined #nixos-dev

11:53 ixxie has joined #nixos-dev

12:15 <arianvp> I was wondering, can we do something similar for NAR file storage as here? https://github.com/systemd/casync

12:15 <arianvp> catar files and NAR files seem almost identical in goals

12:16 <arianvp> and this could save a lot of storage for binary caches

12:18 <gchristensen> can you give a little explanation of what your idea is w.r.t. how it applies to nars?

12:19 <gchristensen> is it just chunking?

12:24 <tilpner> arianvp: Have you tried it to check how much space it would saveß

12:24 <tilpner> *?

12:24 <arianvp> it chunks NARs with a rolling hash function which allows NARs with similar content to share disk space

12:25 <arianvp> there's probably some entropy-theoretical argument that you can make on how much disk space this can save

12:25 <arianvp> but dont have numbers

12:26 <arianvp> gchristensen: the catar file format and nar file format are virtually identical

12:26 <tilpner> It would depend a lot on the selection of chunks, so it would be good to first check if it would be worth the effort

12:27 <arianvp> the chunk selection happens automatically using the rsync algorithm

12:27 <arianvp> http://0pointer.net/blog/casync-a-tool-for-distributing-file-system-images.html

12:28 <arianvp> (https://moinakg.wordpress.com/2013/06/22/high-performance-content-defined-chunking/ is a nice blog about the idea)

12:29 <gchristensen> interesting

12:31 <arianvp> yes it's an extremely cool technique

12:31 <arianvp> I think the fastest way to see if this is worthwhile is to write a little script that converts NARs into CATARs and see the size difference

12:31 <arianvp> :P

12:32 <arianvp> (after then indexing said CATARs)

12:32 <tilpner> I can see how it might be helpful for serving store paths from local disk

12:32 <tilpner> Not sure how feasible it is for cache.nixos.org

12:33 <arianvp> well the nice hting is that it will give incremental updates too. If a NAR of nginx-1.19 is similar to nginx1.18 the download would be incremental

12:33 <arianvp> however it wuld mean you need to store the NARs on the client which we currently dont do I think?

12:33 <tilpner> Oh, so Nix learns about chunks, it's not just a cache/hydra-side thing?

12:33 <tilpner> No, they're currently not stored, and that might increase space usage a lot

12:34 <arianvp> casync solves this by mounting the NAR files using FUSE

12:34 <arianvp> instead of unpacking them

12:35 <tilpner> Which would probably be a slow-down on average over having the realised files on disk

12:35 <arianvp> yeh so it would both be a space-saving but also a bandwidth-saving technique

12:35 <tilpner> The space savings might make up for that, but that needs numbers

12:35 <arianvp> yes there might be a slow-down. and yes these are all unknowns

12:35 <arianvp> just seems like a fun direction to explore; ill put some time in it

12:36 <tilpner> I'm exporting my local store to NARs and intend to chunk them afterwards

12:36 <tilpner> Just to see how/if the size changes

12:37 <tilpner> How would this play with compression though?

12:37 <tilpner> Don't most local caches store the nars in compressed form?

12:37 <arianvp> it works okay-ish with things like squashfs where random access is preserved

12:37 <arianvp> but yes compression can indeed ruin some of the benefits

12:38 <arianvp> https://discourse.nixos.org/t/distri-a-linux-distribution-to-research-fast-package-management/3742/7 this also is a related disucssion i think

12:38 <arianvp> a nix-like distro that also uses FUSE

12:39 <tilpner> Yes, I've seen it. I'm cautious against running every single library/application through FUSE, but I don't know how much actual overhead there is

12:39 <arianvp> ack

12:40 <clever> arianvp: have you seen narfuse and fusenar?

12:40 <tilpner> If it dedups really well, it might even speed up access on slow spinning disks

12:40 <clever> https://github.com/cleverca22/fusenar i originally wrote fusenar in c++, using the nar library already in nix

12:41 <clever> but to find the type of the root node (is /nix/store/foo a file or dir?) you have to parse the entire nar, even if its 2gig

12:41 <clever> and thats when the lure of haskell and lazy evaluation got me into haskell :P

12:41 <clever> with taktoa's help, i rewrote the whole thing as https://github.com/taktoa/narfuse

12:42 <clever> arianvp: narfuse lets you turn a directory full of foo.nar files, into a directory full of foo's

12:42 <clever> so /nix/foo/ can be mounted to /nix/store, and then bar.nar turns into just plain /nix/store/bar

12:42 <arianvp> cool

12:43 <clever> one limitation though, is that it operates on nar files, not nar.xz

12:43 <arianvp> clever: got it

12:44 <arianvp> have you seen the rest of the disucssion above? About the casync tool which does automatic chunking and reuse of nar files?

12:44 <clever> but it could be modified to operate on a .nar.xz instead

12:44 <arianvp> (Well; they have a file format very similar to NAR files; and trying to find out if we can do asimilar chunking thing)

12:45 <clever> the nar file format is pretty simple

12:45 <arianvp> tilpner: note that the individual chunks in casync are stored compressed. So that would make .xz'ing nar files redundant

12:45 <clever> its mostly just a series of size prefixed strings

12:45 <tilpner> Oh!

12:45 <tilpner> Neat

12:45 <arianvp> so I would expect less disk storage, and faster transfers with this kind of system

12:46 <clever> if i'm reading this code and remembering it correctly, a file is basically the following strings

12:46 <clever> "entry" "(" "name" "foo.txt" "node" "contents" ")"

12:47 <clever> each string, is prefixed by a 64bit int denoting the string's length

12:48 <clever> and a directory is then a "(", a series of "regular|symlink|directory" + <the above> pairs, and a ")"

12:49 <clever> so you could trivially break a nar up into its component strings, and then hash each one

12:50 <clever> of note, the strings: name, node, entry, regular, symlink, directory, contents, (, and ) appear often

12:50 <clever> so you might need an escape hatch to not dedup those (they are smaller then a hash) and just keep them in the resulting stream

12:51 <clever> so you would transform a nar, into a series of tokens and hashes, along with a hash=body set, that can be shared

12:54 <clever> arianvp: but if your running zfs, you get the same kind of dedup, at least at the storage level, but not at the network layer

12:55 <arianvp> there is some explanation about why this is still a benefit on COW systems

12:55 <arianvp> casync is COW-aware and will even do some reflinking magic itself too

12:55 <clever> network is the main one i can see

12:55 <arianvp> yes

12:55 <arianvp> it is meant as a image delivery mechanism

12:55 <clever> nix-store --optimize pretty much makes cow-aware stuff un-needed

12:55 das_j has quit [Remote host closed the connection]

12:55 <arianvp> not as a storage-mechanism

12:58 <clever> another issue though, at the network layer

12:59 <clever> are you going to store each chunk as a file over http?

12:59 <clever> thats a round trip per file

12:59 <tilpner> Might be fine with pipelining or parallel requests?

12:59 <clever> thats kinda half the point of things like nar and tar, so you can download 1000's of files as a single bytestream, rather then having to do 1000's of requests

13:00 <arianvp> HTTP2 solves that doesnt it?

13:00 <arianvp> (Not that the casync tool currently supports that. it's a bit of an experimental thing it seems)

13:00 <clever> http1.1 can do pipelining, 2 just adds things like the server force-feeding you stuff you didnt know you wanted yet

13:01 <clever> but if you already have parts, the server force-feeding you chunks is a waste of bandwidth

13:01 <arianvp> yeh just wanted to say. you dont want predictive push as the benefit comes from the client knowing where they are

13:02 <clever> i can see ipfs being of use here, but that complicates another one of my ideas

13:02 <clever> and makes the perf even worse

13:02 <arianvp> not sure what ipfs has got to do with this? I do remember bringing up casync on an IPFS+nix thread before though but dont remember why

13:03 <arianvp> oh yeh in this thread: https://github.com/NixOS/nix/issues/1006

13:03 <{^_^}> nix#1006 (by Ericson2314, 3 years ago, open): git tree object as alternative to NAR

13:03 <clever> ipfs is just a merkle_hash(value)=value storage system

13:03 <arianvp> just like casync :P

13:03 <clever> and if you know the hash, you can then fetch the object from the ipfs network

13:03 <arianvp> (sort of)

13:03 <clever> there is a limit to chunk size in ipfs

13:03 <clever> so it also supports special chunks, that are just a list of hashes of other chunks

13:04 <tilpner> IPFS had fairly high resource consumption, last I tried (hours ago)

13:04 <tilpner> 30-40% CPU on a small VPS, and 300-400MB memory use

13:04 <clever> the main cost i can see with ipfs, is that it has to turn each file, into a tree of chunks

13:05 <clever> and it then has to post into the DHT, at the hash of each chunk, "peer XYZ has this chunk"

13:05 <clever> so if your file breaks into 1000 chunks, you have to do 1000 DHT puts

13:05 <clever> and if somebody wants to download that file, they have to do 1000 DHT gets, from random points within the hash table

13:05 <tilpner> And it has to join the DHT in the first place, which rules out a few usecases of Nix

13:06 <clever> ipfs would be an optional thing

13:06 <clever> i'm just thinking, if you are going to be hashing each chunk anyways, if you use the ipfs hashing rules, then you are making it an option

13:07 <clever> if you use plain sha256, then your forcing the need to have a hash->hash lookup, to use ipfs fetches

13:08 <Ericson2314> clever: have they not considered a sparse DHT with just some common roots, and if you can't find it in there crawling up your graph to see if anyone has something which refers to the thing you are missing?

13:08 <Ericson2314> not a great system, but good to cope with a too large dht

13:09 <clever> Ericson2314: ive not looked in depth at what the dht is doing, mostly just filling the gaps in with how other dht's work

13:10 <Ericson2314> right i haven't looked either

13:11 <arianvp> how do I copy my entire /nix/store to nar files?

13:11 <clever> arianvp: one minute

13:11 <arianvp> nix-store --export ?

13:11 <clever> --export doesnt make a nar

13:12 <clever> it makes a different thing, that contains many nars

13:12 orivej has quit [Ping timeout: 245 seconds]

13:12 <clever> `nix-store --dump /nix/store/foo > foo.nar` will make a nar, but wont include any closure info, it wont even include the "foo" in the nar itself

13:14 <clever> [clever@amd-nixos:~]$ nix copy /nix/store/n9z80xrc7bidx5hcap2wvb5l9r2vk6y0-hello-2.10 --to file:///home/clever/cache-test/

13:14 <clever> arianvp: this will recursively copy (and xz compress) a given path, to a given dir, and generate narinfo files that perserve the closure data

13:14 <clever> if that directory is served over http, it can then be used as a binary cache

13:21 pie_ has quit [Ping timeout: 250 seconds]

13:21 <arianvp> so to have one for my system I have to pass the nixos derivaiton there?

13:21 <clever> yeah

13:21 <clever> you can also pass it /run/current-system/ to just cache whatever your currently running

13:22 <arianvp> how does the cache do non-directory entries in nix store?

13:22 <arianvp> are they also NAR'd?

13:22 <clever> yep

13:22 <clever> the root element in the nar is a file in that case

13:22 <clever> or a symlink

13:22 <arianvp> as in

13:22 <arianvp> . is a file?

13:23 <clever> yeah

13:23 <arianvp> darnit casync doesnt support that. so ill have to skip all the file stuff and only d odirectories

13:23 <clever> with nar files, every element only has a type, and a body

13:23 <clever> names are not attached to the elements themselves

13:23 <clever> rather, a directory is just a series of name+element pairs

13:24 orivej has joined #nixos-dev

13:24 <clever> and the root element can be any type of element, so its valid for the root to just be a file, in which case, no name exists

13:26 <clever> arianvp: also, for any fixed-output derivation with outputHashMode = "recursive";, the sha256 of it, is just the sha256 of the nar

13:27 <clever> arianvp: so when your doing fetchFromGithub, your giving it the hash of the nar for the $out it generates

13:27 <clever> outputHashMode = "flat"; is for the special case where you expect $out to be a file, in which case, your giving it the plain hash of that file without wrapping it in a nar

13:27 <clever> arianvp: you may need to special-case files, and just have them bypass catar? and just be stored as a single chunk?

13:28 orivej has quit [Ping timeout: 245 seconds]

13:32 pie_ has joined #nixos-dev

13:34 <arianvp> okay casyncing my entire store

13:34 <arianvp> :)

13:35 <clever> arianvp: nix copy also has a --all flag

13:35 <arianvp> Nix doesnt really have a hard dependency on NARs does it? as in I could just extract these catar files and then register the paths right?

13:35 <arianvp> make a prototype that bypasses nix's own caching stuff

13:36 <clever> arianvp: /nix/store is read-only by default, so you must go thru nix-daemon to extract anything

13:36 <arianvp> oh yeh darnit

13:36 <tilpner> zfs does not like this .castr structure

13:36 <tilpner> 9.1G vs 7.3G (--apparent-size)

13:37 <clever> tilpner: try to find the worst offending file, then ls -lhs it

13:37 <arianvp> yeh so there is special case code for btrfs filesystems in casync

13:37 <arianvp> wonder if it does the same on zfs probably not

13:37 <tilpner> zfs doesn't support reflink AFAIK

13:38 <arianvp> ah

13:38 <tilpner> clever: It's a giant directory forest, and lots of small files

13:38 <clever> tilpner: ah, there is a min size for things

13:39 <arianvp> tilpner: allign your --chunk-size with the zfs min-size

13:39 <arianvp> will probably be more friendly :)

13:39 <clever> ls -lhs will reveal the min size

13:40 <clever> [clever@amd-nixos:~/apps/nixpkgs-master]$ ls -lUsh /nix/store/.links/ | head

13:40 <clever> 4.5K -r--r--r-- 5 root root 538 Dec 31 1969 07a8cfj62fij9nb23zb12d7nhzq2czfngjn4m2dnbir5cr9f8p5a

13:40 <clever> 512 lrwxrwxrwx 2 root root 83 Dec 31 1969 1w19ljky3g9dnnis4x6zj443lmsxiyz7jws6hzyc3qqzifk0hj9y -> /nix/store/l95nkqp7bdimqnz9ixay1aahljzsz7vc-python-2.7.15/lib/python2.7/decimal.pyc

13:40 <clever> at 538 bytes and up, it eats a whole 4.5kb

13:40 <arianvp> oh I forgot to enable --with=symlinks oops

13:41 <clever> 83 bytes and down, it fits inside the pointers that would normally say where the data exists, so it doesnt even need a data block

13:41 <arianvp> wonder what will happen now, whether it will follow the symlinks or just ignore the

13:41 <clever> between 83 and 538 bytes, youll need to investigate more :P

13:41 <tilpner> clever: They're not small enough to fit under 83 bytes

13:42 <tilpner> And I'm not sure what I expected anyway

13:43 <tilpner> Presumably, the ecosystem would have to agree on one chunking configuration

13:43 <tilpner> Which might be problematic with everyone running different FS' with different settings

13:44 * tilpner ends casync after creating 390841 .cacnks

13:45 <arianvp> 274812 and going

13:45 <arianvp> =)

14:05 das_j has joined #nixos-dev

14:05 <das_j> how is that assert at the top of pkgs/os-specific/linux/phc-intel/default.nix supposed to work? Wouldn't meta.broken be more appropriate?

14:08 <clever> das_j: broken might also work, would want to test it though of course

14:08 <das_j> clever: Alright. Because it prevents one of my systems from evaluating because it has a 4.9 kernel

14:09 <clever> das_j: i would expect broken to cause the same problem, if you attempt to reference that package

14:10 <das_j> clever: That's the weird thing. I do not reference it

14:10 <clever> what does --show-trace say?

14:11 <das_j> Now that I think about it, it's probably another issue. The system builds fine on 19.03, but fails on unstable

14:11 <das_j> cc ajs124

14:34 ixxie has quit [Ping timeout: 258 seconds]

14:55 pie_ has quit [Ping timeout: 250 seconds]

14:59 pie_ has joined #nixos-dev

15:00 ixxie has joined #nixos-dev

15:42 ixxie has quit [Ping timeout: 248 seconds]

15:48 ixxie has joined #nixos-dev

15:57 drakonis has joined #nixos-dev

16:03 pie_ has quit [Ping timeout: 250 seconds]

16:06 pie_ has joined #nixos-dev

16:07 lopsided98 has quit [Remote host closed the connection]

16:09 lopsided98 has joined #nixos-dev

16:28 ixxie has quit [Ping timeout: 245 seconds]

16:36 ixxie has joined #nixos-dev

16:47 justanotheruser has quit [Ping timeout: 245 seconds]

16:49 pie_ has quit [Ping timeout: 250 seconds]

16:54 orivej has joined #nixos-dev

17:02 justanotheruser has joined #nixos-dev

17:14 pie_ has joined #nixos-dev

17:19 pie_ has quit [Ping timeout: 250 seconds]

17:41 <marek> how do we forward changes from staging to master? just opening a PR againts master with cherry picked commit once it is verified in staging?

17:42 <samueldr> staging eventually graduates into staging-next when it's deemed good to go to master, and staging-next is where a last check for breakage, and fixing those is done

17:42 <ivan> staging is for things that rebuild too many packages and so shouldn't go into master until a staging-next merge

17:43 <samueldr> staging-next is eventually merged into master once deemed good

17:43 <globin> marek: see also https://github.com/NixOS/rfcs/pull/26

17:43 <{^_^}> rfcs#26 (by vcunat, 1 year ago, merged): staging workflow

17:48 <marek> ok, just wondering if I'm responsible for forwarding my change once it is in staging or it will get to master eventually on its own

17:49 <marek> globin: I see, thank you

17:57 das_j has quit [Remote host closed the connection]

18:03 pie_ has joined #nixos-dev

18:16 drakonis has quit [Ping timeout: 246 seconds]

18:17 pie_ has quit [Ping timeout: 250 seconds]

18:19 drakonis has joined #nixos-dev

18:54 <worldofpeace> jtojnar: Noticed the bug also, I could reproduce it with latest meson.

18:56 <jtojnar> worldofpeace: 10⁹$💩️

18:56 <jtojnar> reported here https://github.com/mesonbuild/meson/issues/5844

18:56 <{^_^}> mesonbuild/meson#5844 (by jtojnar, 3 hours ago, open): AttributeError: 'NoneType' object has no attribute 'startswith'

18:59 <worldofpeace> jtojnar: 🤣 It's rather funny really. Wonder why it's an issue we only trigger

19:00 <jtojnar> worldofpeace: because Meson normally means Python 3 is available

19:00 <jtojnar> whereas we have purity

19:02 <worldofpeace> 🔷 in rough. Oh these assumptions.

19:19 <worldofpeace> jtojnar: so what do you think of splitting the module into what they think is core already https://gitlab.gnome.org/GNOME/gnome-build-meta/blob/master/elements/core.bst. We've done a kind of interesting thing in deepin https://github.com/NixOS/nixpkgs/blob/master/nixos/modules/services/desktops/deepin/deepin.nix. The actual desktopManager enables those services, so perhaps we can do similar?

19:43 <jtojnar> worldofpeace: yeah, that looks good

19:43 <jtojnar> except I want core to be even more barebones

19:43 <jtojnar> core without core-utilities

19:47 edwtjo has quit [Ping timeout: 272 seconds]

19:48 <worldofpeace> With three separate options would that be needed? Then the gnome3 module default enables all three of them.

19:52 ixxie has quit [Ping timeout: 248 seconds]

20:12 edwtjo has joined #nixos-dev

20:26 <jtojnar> worldofpeace: do you mean one option each for core-os-services, core-shell and core-utilities?

20:27 <worldofpeace> jtojnar: yes

20:27 <jtojnar> worldofpeace: yeah that sounds good to me

20:28 <worldofpeace> jtojnar: thanks, putting it together now

20:38 <worldofpeace> jtojnar: huh it seems they don't use telepathy default https://gitlab.gnome.org/GNOME/gnome-build-meta/commit/69e9ccc898ae1482fbc79a42491f364fd4fb6160

20:39 <jtojnar> worldofpeace: yeah, they have been trying to get rid of it for a long time

20:40 <jtojnar> there is almost no upstream core development (only some backends, mostly in Qt land)

20:40 <jtojnar> empathy has been dead for ages

20:41 <jtojnar> one of the main authors even renounced it

20:43 <jtojnar> which is too bad, if Purism adopted it instead of libpurple, everything would be much nicer

20:51 <worldofpeace> Though I do understand purism's choice of doing that jtojnar with all that reviving things can look unpleasant

20:52 <jtojnar> worldofpeace: it's not like libpurple is particularly alive, but I can understand that they wanted to limit necromancy to minimum

21:57 pie_ has joined #nixos-dev

22:08 paradigm has joined #nixos-dev

22:13 ryantm has quit [Quit: Lost terminal]

22:15 ryantm has joined #nixos-dev

22:34 <jtojnar> worldofpeace: this would be useful https://github.com/mesonbuild/meson/pull/1330#issuecomment-305514546

22:46 drakonis has quit [Quit: WeeChat 2.5]

22:51 orivej has quit [Ping timeout: 244 seconds]

23:02 <worldofpeace> jtojnar: more than useful, I think we want that 😃

23:20 justanotheruser has quit [Ping timeout: 244 seconds]

23:21 justanotheruser has joined #nixos-dev

23:29 <jtojnar> worldofpeace: I am going to add a lot of notes not related to the split, since I am already paying attention to every line

23:30 <worldofpeace> jtojnar: comments on github, or do you mean pushing comments to the branch?

23:30 <jtojnar> worldofpeace: on GH

23:31 <worldofpeace> 👍️ jtojnar

23:31 <jtojnar> will add an emoji to the relevant stuff so it is easier to address