#nixos-chat on 2018-04-01

01:46 zybell_ has quit [Ping timeout: 265 seconds]

02:47 {`-`} has joined #nixos-chat

02:51 Lisanna has quit [Quit: Lisanna]

03:03 jtojnar has joined #nixos-chat

06:33 Bogdacutu has joined #nixos-chat

08:25 Bogdacutu has quit [Ping timeout: 240 seconds]

09:17 goibhniu has joined #nixos-chat

11:01 monotux has quit [Ping timeout: 240 seconds]

11:02 monotux has joined #nixos-chat

11:05 <andi-> Nice April's fool question on the Mailinglist m(

14:37 taktoa has quit [Remote host closed the connection]

15:09 zybell has joined #nixos-chat

15:18 jtojnar has quit [Read error: Connection reset by peer]

15:19 jtojnar has joined #nixos-chat

15:48 <zimbatm> the FHS troll?

15:51 <zimbatm> related to FHS, maybe we should add a FHS-check in the fixup phase if they have an "installable" flag. It would avoid unnecessary conflicts when building user profiles.

16:48 zybell has quit [Ping timeout: 240 seconds]

17:24 zybell_ has joined #nixos-chat

17:51 <infinisil> (Dropping in from #nixos-borg to continue discussing version control systems with MichaelRaskin)

17:52 <infinisil> MichaelRaskin: Is git messier than I think?

17:53 tilpner has joined #nixos-chat

17:53 <samueldr> s/git/*/

17:54 <MichaelRaskin> Well, git has that DAG of commits, which it writes to disk in an unsafe way, and everything on top is a mess

17:58 <infinisil> Yeah I think it's better how monotone and apparently also Fossil use databases instead of files

17:58 <zybell_> unsafe? what do you call unsafe

17:59 <MichaelRaskin> Well, you can use files safely, just use explicit write barriers

17:59 <zybell_> ?

17:59 <infinisil> While files are easy to handle on Linux, databases are a better abstraction for most cases

17:59 <MichaelRaskin> zybell_: git writes files in such a way that can cause corruption if power is cut.

17:59 <MichaelRaskin> For example, on ext4 filesystem with default settings

17:59 <infinisil> imo. And files are easier to handle, but that's probably only because it's the only thing we know and is well established

18:01 <zybell_> If files are immutable , checked by hash you can check that file is not corrupted.

18:01 <simpson> infinisil: To a point, sure. The problem is that everything still has to write to The Filesystem in the end, and that makes some stuff unhappy. For example, I figured out how to corrupt a Fossil DB reliably by putting it in a 'magic' auto-synced network folder. It'd be great if that sort of thing *did* reliably work.

18:02 <MichaelRaskin> zybell_: git doesn't write files in the way you have just described

18:02 <infinisil> simpson: How did that work?

18:03 <MichaelRaskin> simpson: well, does this syncing setup guarantee anything about fsync?

18:03 <simpson> infinisil, MichaelRaskin: SQLite was surprised that the network folder had been written to by another SQLite on the other side of the network.

18:04 <zybell_> Thats on read, a corrupted file is a if it wasn't there and eventually gc.

18:04 <simpson> The really fun part is that sometimes when it happened, the write came from *inside the localhost* whooooo whooooo~

18:05 <simpson> I never root-caused it, I just stopped putting Fossil DBs in magic folders.

18:05 <infinisil> Maybe the filesystem itself should be a database!

18:06 <MichaelRaskin> Well, there are write barriers and there is fsync

18:06 <infinisil> Files, aka strings of bytes, aren't flexible at all

18:06 <infinisil> as far as i know

18:06 <MichaelRaskin> Of course, network filesystems often fail to provide fsync guarantees (never mind locking guarantees), and FS-using programs often fail to use anything to communicate ordering requirements

18:07 <simpson> MichaelRaskin: There's also Fossil's workdirs and extra dotfiles and several piles of TR1 written to less-than-SQLite's-standards, and also I learned what I feel is the appropriate lesson, and I also reported it as a bug on the magic-folder feature since that's the likeliest culprit.

18:08 <MichaelRaskin> Fossil tries to bit a bit too fancy with its accounting, in my opinion

18:09 <MichaelRaskin> I prefer Monotone where less things are outside of the main database

18:09 <MichaelRaskin> (but it is a completely non-moving project)

18:09 <simpson> infinisil: It's funny that you say that. Most of my side-business stuff is written to treat Tahoe-LAFS as a database and to do all of its work using patterns which avoid or tolerate write collisions.

18:10 <simpson> The biggest reason was that containers don't have filesystems, so abstracting the FS away made containerizing easier.

18:10 <simpson> I mean, containers have filesystems. But they don't have, y'know, filesystems.

18:10 <infinisil> Umm yes yes

18:10 <zybell_> If I write the expected hash into the filename, then I can write the content in any order, only when the content reaches the expected(and predicted)bit pattern, the file magically appears in the eys of a reader that checks the hash.

18:11 <infinisil> simpson: (I really don't get what you're saying, I don't have any experience with Tahoe-LAFS and only little with containers)

18:12 <simpson> Actually, I wanna retell zooko's story of the ancient toasters. Suppose that one day we discover a lost civilization's ancient toasters. They are magical and wonderful and make the best toast ever, to the point where even if you're not utilitarian, you really want these toasters integrated into society somehow. There's only one tiny problem...

18:12 <MichaelRaskin> zybell_: then you discover that this is bad for performance

18:12 <zybell_> So I can be lazy with write checks, because of this magic atomicity.

18:12 <simpson> ...they blow up sometimes. Not always! But sometimes.

18:13 * infinisil listens closely

18:13 <simpson> zooko argues that if they only blow up once in a bilion times that we try to make toast, then they might become popular in society. One-in-a-quadrillion is even worse; they'd be household objects,

18:14 <simpson> But if they're one-in-a-hundred, then society will realize the danger pretty quickly and build special toast-containment domes for safely producing toast.

18:14 <zybell_> Huh?Why is this bad for performance? Without fsync?

18:14 <simpson> zooko was arguing that we should, if we can't *prove* that write collisions are impossible, make our systems have write collisions pretty often, and design everything to be fault-tolerant around that.

18:15 <MichaelRaskin> zybell_: because FS provides much better throughput for reading and writing large contiguous blocks

18:16 <simpson> zybell_: And to add to MichaelRaskin's awesome phrasing, this is because the hardware itself usually reads and writes in large contiguous blocks.

18:16 <infinisil> simpson: Pretty smart move by zooko

18:17 <simpson> infinisil: Yeah, he's a great philosopher. Everybody gives him shit these days for starting an altcoin though. I dunno. I think the parable stands on its own; I think I actually heard it thirdhand through warner.

18:17 <MichaelRaskin> Oh, what hardware does is another sad story.

18:17 <zybell_> The hash must be checked anyway because of cryptographic guarantees. And large continous blocks are included in the phrase 'any order'.

18:17 <MichaelRaskin> A different sad story for each generation of storage technology

18:19 <MichaelRaskin> zybell_: if you have a single large file, you are likely to get a contiguous read on the hardware level. If you have a lot of small files, they are likely to be scattered on purpose to provide space for their future growth,

18:19 <MichaelRaskin> because filesystems do not currently provide a way to commit to having a file that can never grow beyond some size

18:24 <zybell_> I didn't say and didn't mean a random order. And the single large file you get when the objects are packed. Sensibly this is done only when you have a backup in the form of unpacked objects, because the atomicity doesn't work with packed objects.

18:24 <MichaelRaskin> Random order comes from the FS allocation strategies

18:33 zybell_ has quit [Ping timeout: 248 seconds]

18:41 <infinisil> And MichaelRaskin also showed me Pijul: https://pijul.org/

18:42 <infinisil> While it's really early in its development, the idea behind it is really nice

18:45 zybell_ has joined #nixos-chat

18:48 <zybell_> Dont know if this came through

18:48 <zybell_> You can fsallocate() the size before writing, or even ftruncate() the file to the needed size(ftruncate()works upwards too).

18:49 <zybell_> If the FS honors such requests by modifying the allocation strategy, it may be used more often if properly documented.

18:49 <zybell_> Sent again

18:49 <infinisil> (It did not come through indeed)

18:50 <MichaelRaskin> zybell_: well, there are cases when applications ftruncate, then grow the file on the next use. So FS doesn't pack such allocations too densely.

18:51 <MichaelRaskin> Also, each file gets an integer number of allocation units.

18:53 <infinisil> Lol, lobsters should check that out if you haven't seen it already: https://lobste.rs/index.php

18:53 <zybell_> The idea would be to leave space as long as the file is open, but put a matching fsallocate behind a closed file.

21:45 goibhniu has quit [Ping timeout: 260 seconds]