XSF Discussion - 2017-03-02

Ge0rG 00:03:11
Damn, you've got me. I type my gpg password rather often. I can look up the other things for mutt tomorrow if you are interested
Zash 00:03:33
I'd rather know how to not quit mutt by accident all the time
Ge0rG 00:04:59
Unbind the Q key
Zash 00:05:25
Whos brilliant idea was it to put quit and 'go back' on the same key anyways?
Ge0rG 00:07:59
Zash: it's a sensible idea in general. Except when you want to "leave" a limit filter
SamWhited 03:42:55
I have my password saved in a GPG'ed file; mutt unlocks GPG on start to get the password, which also keeps the GPG agent unlocked for 15 minutes or whatever, which works pretty well.
Ge0rG 04:04:52
Zash: for incoming mail, you can set pop_pass and imap_pass in imap, or even bind a key to a macro like "cimaps://user@domain:password@server/INBOX\n"
Zash 04:08:33
https://tools.ietf.org/html/rfc6778 and https://tools.ietf.org/html/rfc7017
Ge0rG 04:09:22
That's so meta.
Zash 04:10:44
https://trac.tools.ietf.org/group/tools/trac/wiki/Imap
Ge0rG 04:11:46
Zash: you might want to tell the XSF why you are pasting all the URLs in here.
Zash 04:16:23
Ge0rG: I'm sleep-pasting URLs I think
Ge0rG 04:17:09
Zash: time to get coffee, then.
Ge0rG 04:17:28
I've had my first coffee of the day at 0430 local time.
Zash 04:18:42
Anyways, the IETF seems to have gone through the process of figuring out better ways to access mailing list archives, so I'm trying to nudge people towards looking at the work they did.
Ge0rG 04:32:53
Zash: I'm not sure how IMAP is going to help in that regard. It sounds to me like a mix of NNTP nostalgia and nerd cred.
Ge0rG 04:33:17
Zash: I'd like to have a feature where you can search the ML by affected XEPs. So a kind of tagging.
Ge0rG 04:33:40
And people write the craziest things into the Subject:, so you can't just /~s XEP-0123
Ge0rG 04:33:59
if we could add XEP-xxxx tags post-factum, it would be great.
Zash 04:37:33
Makes archives predating your subscription accessible.
Ge0rG 04:38:29
Zash: last time I needed that (and it was to correctly reply-to to a mail), I just downloaded the .mbox. I think that the number of people who care about that, outside the IETF, is small.
Ge0rG 04:39:10
Zash: and the set of people who fail to import an .mbox into their MUA, but manage to connect to an anon IMAP is probably very small.
Zash 04:44:48
The underlying point is to look at what a similar organization did about pretty much the same problem.
Ge0rG 04:46:43
Okay, I can buy into that
Zash 04:48:07
They did end up with a pretty nice search thingy.
Ge0rG 05:02:10
Zash: I hope you don't mean "connect with imap, use your MUA search" approach
Zash 05:05:04
Ge0rG: https://mailarchive.ietf.org/arch/
Ge0rG 05:06:06
Zash: it looks like a web MUA to me. I searched for "xmpp" and wasn't impressed with the results too much
Ge0rG 05:06:12
OTOH, it looks like a MUA.
Ge0rG 05:12:32
Oh Android. If you register your app as an Intent handler, older versions use "{handler_title}" as the display text, and newer versions use "Open with {handler_title}". I'm pretty sure I'm not the only one to find "Open with Add contact" a strange wording.
Ge0rG isn't awake either, yet. Just misread the last members@ thread as "XSF Bored Meeting Minutes". 05:14:52
jonasw 07:32:11
ah, that ietf-mailarchive-thing is nice
jonasw 07:32:22
seen it a couple of times
jonasw 08:09:53
I’m starting the writeup of the XEP-115 (Entity Capabilities) replacement. I have a few questions: 1. I would like to acknowledge waqas work and the work of the authors of XEP-115. How do I do that appropriately? The XEP-Template doesn’t have an acknowledgements section, but seeing that XEP-115 (and others) have one, I assume that’s an appropriate way to do it. Correct? 2. In the examples I will need a namespace. Where will I source it from? Should I use a namespace under my own control and the editor will choose a different one when the XEP is accepted as experimental?
Kev 08:10:36
Is this a replacement of 115, or an update to 115?
daniel 08:11:52
jonasw: there is no formal way for acknowledgements. Most authors just dedicate an entire section to it
jonasw 08:12:42
Kev: replacement, you can probably work your way from http://logs.xmpp.org/xsf/2017-02-28/#19:49:01 upwards to see the discussion around that.
Kev 08:13:43
Just re-using 115 seems appropriate to me, you're not in need of drastically changing the protocol, are you?
Kev 08:15:04
(I note that other things like pubsub have dependencies on 115, so if you write a whole new XEP you're looking at patching a *lot* of XEPs to update those dependencies)
daniel 08:16:21
That's probably true
jonasw 08:16:42
interesting point, noone seems to have thought about that the other day
jonasw 08:17:00
a namespace bump for 115 would be less intrusive probably
Kev 08:17:52
A namespace bump, if needed, or maybe a backwards-compatible update (if possible) seem reasonable to me. But keep in mind it's not coffee-o'clock yet, and I don't even drink coffee.
jonasw 08:18:32
backwards-compatible won’t happen. the algorithm (and I’m not talking about sha1 or something) is broken and in need of fixing for eight years.
Kev 08:18:59
I'm not utterly convinced that means it can't happen (forwards-compatible can't happen, certainly), but I'm not convinced it can, either.
jonasw 08:19:59
i should probably announce coffee-o-clock now.
jonasw 08:21:48
in my opinion, xep 60 doesn’t have a dependency on 115, but on 30. it’s just worded badly.
jonasw 08:22:44
or rather, "in my reading" than "in my opinion"
jonasw 08:24:22
from the amount new work I’m doing for it, an update to 115 feels more appropriate than a new xep, too
Flow 08:29:47
Kev, Steve Kille: Would MIX be interested in an atomic CAS for PubSub. For example to race-free replace the subject/topic/... of a node. I'm considering writing a CAS add-on XEP for PubSub.
jonasw 08:30:21
what is CAS?
Flow always wonders why there is no CAS for PubSub 08:30:25
jonasw 08:30:33
(I only know Computer Algebra System, which I assume you don’t mean)
Flow 08:30:39
jonasw: compare-and-swap
jonasw 08:30:42
ah!
jonasw 08:30:45
makes sense.
jonasw officially announces coffee-o-clock! 08:30:54
jonasw 08:31:14
(or rather, tea-o-clock)
Ge0rG had two cups of coffee yet. Time to get a new one. 08:31:54
jonasw 08:32:17
Flow: I feel that CAS will be hard to implement server-side. when do two XML subtrees compare equal?
Flow 08:32:28
jonasw: by node id
Flow 08:32:34
err item id
Tobias 08:32:38
CAS?
jonasw 08:32:39
okay
Tobias 08:32:44
ah..nvm
jonasw 08:33:24
Flow: CAS would be useful for data storage in PEP nodes, too
Flow 08:33:40
jonasw: It would be useful everywhere where PubSub/PEP is used
jonasw 08:33:51
mostly everywhere :)
jonasw 08:33:54
but yes.
Flow 08:34:15
and where you want to avoid accidentially deleting existing data because of a race condition
jonasw 08:37:00
there are usecases where you add data instead of replacing by item id :)
Tobias 08:37:31
I wonder why 115 didn't just use Canonical XML standard for c14n of disco to later hash it https://www.w3.org/TR/2008/REC-xml-c14n11-20080502/
jonasw 08:38:19
Tobias: I was wondering about that, too, but I think canonical XML is strict with the relative ordering of elements
jonasw 08:40:39
also I‘m not sure how many xml libs support c14n; considering that there are *still* some in use which don’t do namespaces properly
Tobias 08:40:56
could be, yeah
Tobias 08:42:42
jonasw, you're aware of this thread, right? https://mail.jabber.org/pipermail/standards/2011-August/025011.html
jonasw 08:44:26
not yet
Flow 08:44:37
jonasw: which usecases are that?
jonasw 08:44:58
Flow: microblogging-ish :)
Flow 08:45:24
ahh right
Tobias 08:45:29
jonasw, it discusses a lot issues with current XEP-0115, that should be solved in a new version
jonasw 08:45:34
Tobias: thanks!
jonasw 08:45:37
I’m looking into it
Flow 08:45:48
jonasw: Also https://wiki.xmpp.org/web/XEP-Remarks/XEP-0115:_Entity_Capabilities
jonasw 08:46:00
I was also planning to ask standards@ for input when I have a first draft
Tobias 08:47:01
Flow, what? the IANA has two registries for hash names?
Flow 08:47:40
Tobias: Yep
jonasw 08:47:50
that’s a good point; the one we currently use doesn’t list sha3 for example
Flow 08:48:05
I discovered that when searching for a registry for ISR-SASL2
Tobias 08:48:11
Flow, einmal mit profis :P
Flow 08:48:37
Tobias: Hehe, to be fair, that could happen to the XSF too :)
Tobias 08:48:56
Flow, nah...we'll only ever have XEP-0300, which can be updated relatively easy
Tobias 08:49:07
i think IANA stuff requires lots of time and process
Flow 08:49:20
If someone knows if and whom we should tell about this within the IETF/IANA, then please do so/tell me.
Flow 08:49:46
Link Mauve: BTW, SASL2?
jonasw 09:03:07
does anyone know the rationale for querying a specific disco-node containing the hash in the verification procedure xep 115?
Tobias 09:05:15
jonasw, what exactly do you mean?
jonasw 09:05:45
example 3 here: https://xmpp.org/extensions/xep-0115.html#discover
jonasw 09:06:06
node='http://code.google.com/p/exodus#QgayPKawpkPSDYmwT/WM94uAlu0=' instead of simply querying without node. is the idea to avoid races with changing capabilities?
jonasw 09:06:35
hm, it mentions "backwards-compatibility"
jonasw 09:06:49
for avoiding races it seems helpful, why was it abandoned?
jonasw 09:06:58
(even though races wouldn’t be harmful here)
Flow 09:08:02
jonasw: so that you get the result of that very same hash?
jonasw 09:08:10
yes
Flow 09:08:14
that approach seems sensible to me
Tobias 09:08:15
could also help with server side caching i suppose
jonasw 09:08:36
Flow: I don’t like the approach though, from an implementers point of view
Flow 09:08:39
e.g. Smack also responds to the last 10 hashes
Flow 09:09:01
jonasw: I do like the approach from an implementers point of view
Tobias 09:09:14
Flow, you keep a history what sets of features the last 10 smack releases supported?
Flow 09:09:36
Tobias: No, disco features are dynamic, not tied to a smack release
jonasw 09:09:50
Flow: there is no harm in a race here, because if you get a race with an unknown hash (if you know the hash, you don’t care) you simply get the updated disco#info and discard the hash.
Flow 09:09:52
so the last 10 features of the connection
Tobias 09:09:55
that yoo, yeah
Flow 09:10:47
jonasw: true, no race here, but it helps with other things, like tobias said, server side caching, and I think it's the cleaner approach
jonasw 09:10:54
how does it help with server-side caching?
Flow 09:11:08
jonasw: The server can cache the response
jonasw 09:11:27
hm okay
Flow 09:11:31
and send it instead of forwarding the request to the queried client
Tobias 09:11:35
jonasw, the server doesn't need to forward the IQ to the to-JID if it knows the from-JID just wants the disco#info for a hash
jonasw 09:11:39
makes sense
Tobias 09:11:43
it could reply directly
jonasw 09:12:29
seems like using a different format for these nodes would be great though: '{ecaps2-namespace}#{hash-algo}.{hash-value}' or something along those lines to make it easily recognizable
jonasw 09:13:26
right now a server needs to track the 'node' exported in <{caps}c/> to know whether a disco-node is a caps hash
jonasw 09:13:30
*belongs to a caps hash
jonasw 09:27:10
is there an element I can use to link to another section in a XEP?
jonasw 09:27:21
except <link url='#anchor'/>
dwd 09:28:59
IANA has *no* registry for hash names. IANA has several protocol registries to cover parameters for hashes, some of these are strings.
jonasw 09:29:51
dwd: that makes sense and explains the odd titles for those registries.
Flow 09:30:26
like "Named Information Hash Algorithm Registry"
dwd 09:30:38
We co-opted one for our purposes in XEP-0300, but it's originally for PKIX, so it contains OIDs as well.
dwd 09:31:00
Maybe we should also allow urn:oid:2.16.840.1.101.3.4.2.1 for SHA-256?
jonasw 09:31:09
no
jonasw 09:31:12
no no no no
Tobias 09:31:13
dwd, although that one hasn't been updated since 2000something
jonasw 09:31:20
oids are a mess.
dwd 09:32:03
jonasw, How can you say that? They're terribly convenient stable identifiers. Even if Surevine only has one OID arc (Isode has two - snazzy).
jonasw 09:32:05
ugh, the names in xep-0300 are longer than some base64-encoded hash values themselves…
Tobias 09:32:28
jonasw, what names?
jonasw 09:32:31
dwd: as long as you don’t need to parse them semantically, it’s fine probably, like urns
jonasw 09:32:37
Tobias: <var> <name>urn:xmpp:hash-function-text-names:md5</name> <desc>Support for the MD5 hashing algorithm</desc> <doc>XEP-0300</doc> </var>
Tobias 09:32:53
yeah...that's so people don't used md5 :P
dwd 09:33:10
jonasw, Oh, the feature names.
Tobias 09:33:12
jokingly
jonasw 09:33:34
well, close. >>> len(base64.b64encode(hashlib.sha256().digest()).decode("ascii")) 44 >>> len("urn:xmpp:hash-function-text-names:sha-256") 41
dwd 09:34:02
jonasw, Well, that's a reason to use SHA3-512, then.
jonasw 09:34:35
my python cannot into sha3
jonasw 09:34:58
hm, 3.6 can’t either…
Tobias 09:35:20
#sad
jonasw 09:35:34
but that looks like a configuration problem; it also doesn’t have BLAKE2b512 which is available in 3.5 here
mathieui 09:36:26
jonasw, 3.6 can do sha3 just fine
jonasw 09:36:29
Tobias: did you mean <sad/>?
jonasw 09:36:41
mathieui: yes, it appears to be a problem with my python3.6.0a3 probably sourced from debian/experimental
Tobias 09:36:43
jonasw, nah..i mean the trumpish hashtag sad ;)
Tobias 09:37:10
jonasw, so 3.6 doesn't have blake2 but 3.5 has?
jonasw 09:37:13
Tobias: or rather xep-14 <x xmlns="jabber:x:tone">sad</x>? :>
jonasw 09:37:25
Tobias: as I said: it’s most likely an issue with my local setup, the documentation says it is there:
jonasw 09:37:28
https://docs.python.org/3/library/hashlib.html
mathieui 09:37:47
Tobias, 3.6 has blake2 as well
Tobias 09:38:03
nice
jonasw 09:38:29
meh, short names for the functions in xep-0300 would be great
jonasw 09:38:33
or am I just missing those?
dwd 09:38:46
jonasw, The long names are only used in the disco#info, right?
jonasw 09:38:51
dwd: it apperas so
dwd 09:39:04
jonasw, The actual use in protocol are short names, like "md5".
jonasw 09:39:26
dwd: but there doesn’t seem to be a registry or source to refer to on which short name to use for which function.
Tobias 09:39:36
jonasw, table 1 has short hash function names
jonasw 09:39:50
for some, yes.
Tobias 09:40:08
see the sentence before the table
jonasw 09:40:11
it is lacking sha3-{224,384} for example
jonasw 09:40:15
even including that sentence
Tobias 09:40:35
well yeah..didn't see much sense in those intermediate values
jonasw 09:41:15
fair point
jonasw 09:41:27
re-using 0300 makes a lot of sense
Tobias 09:41:47
the standard should probably be 256bit ones, and if you need more security, might as well go to 512 bit then
jonasw 09:43:07
hm, would making new hash functions mandatory trigger a bump on the <hash/> element…?
jonasw 09:43:11
that sounds like a *lot* of fallout.
Flow 09:44:52
jonasw: why should it trigger a (namespace?) bump?
jonasw 09:45:26
Flow: I don’t know. I’m asking.
Guus 09:47:48
*couch*Flow logo*couch*
jonasw 10:52:32
Kev: out of curiousity, what software are you talking about in your mail from 09:57+01:00?
Tobias 10:53:27
i just assumed that mail was some weird welsh humor :)
dwd 10:56:17
jonasw, I suspect it's mailman...
Guus 10:56:46
as we're all here: Does any more need to be discussed regarding https://github.com/xsf/xmpp.org/pull/269 ?
Guus 10:56:54
or rather: my merging of it?
Tobias 10:56:54
dwd, a new version of mailmain you mean?
Tobias 10:57:10
or the current mailman?
dwd 10:57:25
Tobias, No, I think it's just whatever we're using now. I suspect there might - might - be sarcasm at play here.
jonasw 10:57:50
Guus: FWIW, github has a review feature, and it may make sense to have one or two eyes confirm that they took a close look on the changes, possibly leaving comments.
Tobias 10:58:04
dwd, never seen him use that before though
dwd 10:58:22
Tobias, No, it's unusual in those who are cursed by not being English.
Tobias 10:58:58
dwd, you misspelled 'blessed' there
jonasw 10:59:19
I had to change my editors dictionary to en_US (from en_GB) to write XEPs :<
dwd 10:59:35
What? Why?
jonasw 10:59:45
because XEP-0134 (or -0001?) says so.
dwd 11:00:00
Sounds like a candidate for a PR, then. :-)
jonasw 11:00:07
https://xmpp.org/extensions/xep-0143.html#nt-idp1712848
Guus 11:01:49
jonasw: I don't disagree, but as far as I know, that feature is not used by XSF. We could, sure. I don't feel that there's a need for it here (the consequences of missing something in a PR review are very unlikely to be catastrophic for our website, and I prefer a continuous release cycle), but I accept that others think differently.
jonasw 11:02:24
Guus: it’s really low-entrance-barrier though (if you’re a github user), and I don’t mean that it should be *mandatory*.
Guus 11:03:09
jonasw: I'm using it for other projects. Not knowing when to use it appears to be my problem. :) I thought your PR was fine.
jonasw 11:03:56
have you checked I didn’t slip in a try: shutil.rmtree("/") except: pass in? :)
Guus 11:03:56
I am assuming that you thought so, because you PR'ed it in the first place.
jonasw 11:04:20
I’m new in the XSF, my word shouldn’t count a thing when I add code to servers.
Guus 11:05:34
Oh, you could have slipped in things. I recognized your name, I glanced at the code, I ran it locally, it had the desired effect and did not delete my root partition. That combined made merging the PR an acceptible risk for me.
jonasw 11:05:47
:-)
jonasw 11:07:12
I’m just saying that I completely understand the point of people asking for thorough reviews. I would do the same if it was my infrastructure.
Guus 11:07:46
Who am I to object to thorough reviews?
Guus 11:08:21
I think mine was thorough enough by my standards, but I am fully aware that others have different standards.
Kev 11:08:31
I think there's a significant difference between 'updating text on the website', which I'm fine with people generally having access to do. And "running code on our servers", for which most people don't have rights.
Tobias 11:08:47
Guus, i agree though that i should probably have left a note in the PR that I was planning to review it soonish
Kev 11:09:14
Running code that people thought was fine, but wasn't sensibly vetted caused us to not take part in GSoC last year, and huge amounts of wasted effort for me in the process, not to mention the downtime of the server so the XSF couldn't fulfil its primary purpose for a day.
dwd 11:09:31
FWIW, the pelicanconf.py file (the only one, as I understand it, that is executed on the server) looks perfectly safe to me and adequately simple.
dwd 11:10:05
It also looks clearly bounded, in as much as I can solve the halting problem in my head.
Tobias 11:10:23
dwd, as far as I know https://github.com/xsf/xmpp.org/blob/master/buildCompleteWebsite.sh is run to build the whole website on the server
Kev 11:10:32
I think the more crises someone has been through with production servers, the less blazé they get about deployment :)
Tobias 11:10:36
because pelican has very limited capabilities
Kev 11:11:09
Anyway, I don't object to the PR based on the description, I just don't want any code deployed on XSF servers that hasn't been reviewed by iteam.
jonasw 11:11:09
Tobias: it can do anything python can if you put it in the pelicanconf :>
Guus 11:11:12
Kev: I've been a production herding developer, professionally, for 10+ years.
Kev 11:11:39
Guus: And how many times has pushing something without checking it caused a day's worth of downtime for you? ;)
Tobias 11:11:50
jonasw, probably
Guus 11:11:55
including websites that have significant amount of views (millions, monthly)
Guus 11:12:15
Kev: I did check.
dwd 11:12:18
Kev, I think you may mean blasé, rather than blazé.
Kev 11:12:32
dwd: I very much do.
dwd 11:12:52
Although there's an argument for either.
Kev 11:12:54
Guus: Then I have no objection. Your original comment didn't mention that you'd reviewed the code, just run it locally.
dwd 11:13:33
jonasw, So, not threading, then? :-)
Kev 11:13:35
Well, I still have an objection in principle, because I think the server admins should get to review the code too, but I'm happy in this instance if you've reviewed the code.
Guus 11:14:53
Kev: I'm pretty sure I did not review it up to your standards. I'm also not worried by that.
jonasw 11:15:04
dwd: depends on the specific python implementation and the specific task. Python can very much thread in the sense that C extensions which are called from python code from different threads may in fact run in parallel. It is just pure python code which, on CPython at least, isn’t run in parallel. :)
dwd 11:15:28
Kev, This is build-time code, incidentally, not runtime code. So I'd hold it to lower standards.
Kev 11:15:43
jonasw: "Python can totally thread, as long as you code in C instead of Python"? :)
dwd 11:15:48
jonasw, Yeah, I'm only too aware...
jonasw 11:16:01
Kev: pretty much
Kev 11:16:42
dwd: When it's run on the server, I'm not sure the standards need to be much lower. If it's malicious, same effect, if it manages to resource-starve and bring down the server, same effect. There are some runtime cases (resource-heavy, but not resource-starving) that don't apply, but the standard's still pretty high.
jonasw 11:17:46
actually, this is why in the organisations I use pelican, the build system and the contents are separate repositories. The build system repository has strict review requirements, content lesser so.
jonasw 11:18:00
(although, fun fact: pelican lets you write to arbitrary files from the content files alone :-))
jonasw 11:18:10
(well, the current master branch doesn’t anymore)
Tobias 11:19:24
jonasw, templates probably still can though, right?
jonasw 11:19:47
not sure about that, but I don’t consider templates content.
Kev 11:19:50
Anyway, my opinion isn't going to matter for long. My new games PC has just arrived, and Cath is going to kill me as soon as she gets home and sees the den.
Tobias 11:20:32
heh :)
Guus 11:22:04
You have time for a games PC? *envy*
Kev 11:23:39
Sure. It just sits there, it doesn't need much time.
Kev 11:23:46
Now playing games, that would take more time...
jonasw 11:24:17
I would like to re-ask my question now that more people are active. When writing a new XEP, in the examples and specification I will need a namespace. Where will I source it from? Should I use a namespace under my own control and the editor will choose a different one when the XEP is accepted as experimental?
Guus 11:24:42
which of both is what will get you killed later today?
Kev 11:25:03
jonasw: It's easiest for the Editors if you use an appropriate NS from the start, although technically IIRC the Editors should pick one.
jonasw 11:25:13
okay
Kev 11:25:34
Stripping out your NS to replace it with an xmpp one at publication time is mostly busy-work.
Kev 11:25:47
And while the other Editors are much less lazy than me, stil ... :)
jonasw 11:25:50
ack
jonasw 11:26:08
just wanted to make sure that I don’t overstep any boundaries by suggesting a namespace from the xmpp-urn-namespace
Kev 11:29:30
jonasw: It's slightly tweaking the process, but it's the sensible thing to do, and what everyone else does.
Kev 11:30:07
Guus: The mess, and that I'm not intending getting rid of my old games PC, but running both in parallel both run the risk of death-by-spouse.
Guus 11:32:16
Kev: in which case, I am glad I had the chance to meet you in person at FOSDEM, before your premature death.
jonasw 11:33:40
is there any precendent to form arbitrary (i.e. entity controlled) disco#info nodes from an urn:xmpp:-namespace? so for http://… namespaces it’s obvious to use # as a separator, is there any precedent what to use with urn:xmpp:-namespaces?
Kev 11:34:29
I'm afraid I'm too stupid to understand the question.
Tobias 11:35:29
jonasw, so you want to have dynamic namespaces, not previously defined in a XEP or registry?
jonasw 11:35:41
not namespaces, but disco#info node names
jonasw 11:35:52
nah, I’m too stupid to formulate it clearly. see in https://xmpp.org/extensions/xep-0115.html#discover <query xmlns='http://jabber.org/protocol/disco#info' node='http://code.google.com/p/exodus#QgayPKawpkPSDYmwT/WM94uAlu0='/> the node there is composed of a URL base and a hash value.
jonasw 11:36:53
I don’t see the point of using some client-provided string as a prefix so I would like to use the namespace of the XEP as prefix. what kind of separator makes sense between the prefix and the hash info? Is there a precedent for that?
jonasw 11:38:13
ah yes, it appears so
jonasw 11:38:22
xep 290 also uses #
arc 15:13:09
the argument that won me over on not allowing clients to dictate their resource was that of distributed hosting routing
Tobias 15:14:08
you mean clustering?
arc 15:16:00
sure, whatever term you want to have for a @server hosted by multiple servers. and sorry i completely misread the conversation above, so that statement was kinda out of the blue
Ge0rG 15:17:05
I'm still not convinced of that clustering use case. "Google does it this way" doesn't cut it for me.
arc 15:17:19
Ge0rG: we're going to need it for IoT
Kev 15:17:35
Ge0rG: Well, I guess it'd be interesting if you could explain how you solved it in your clustered server, to persuade the other clustered server vendors that it's easy?
Ge0rG 15:18:02
Kev: wait, let me fire up a bunch of dockers.
arc 15:18:37
right now prosody can effectively handle 40k concurrent users on an average AWS instance last i ran the brute force test. in order to scale to the size that some of these IoT manufacturers want you need multiple servers, ideally geographically distributed
Ge0rG 15:19:22
arc: what about running different per-region domains?
arc 15:19:51
the last sit-down I had with an IoT manufacturer they said 10m units is what they consider base level, and any solution they consider should be able to scale to ten times that
Ge0rG 15:20:58
are there any xmpp installations handling north of 1m connections? I only remember WhatsApp's we-are-awesome post in that regard.
Tobias 15:21:15
really wonder if all those IoT devices need permanent connections
SamWhited 15:21:21
per-region domains is changing the security model. Also, it means if I live in the US, but I travel to China, I'm still connecting to my server in the US (or whatever domain I registered on). We were talking about single domains, multiple-domains is a completely different thing.
Ge0rG 15:21:25
Tobias: of course they do!
arc 15:21:33
Tobias: for receiving input, yes. though they're not very active.
SamWhited 15:21:40
Ge0rG: I can't give exact figures (and don't know them anyways), but I'm pretty sure we (HipChat) are.
SamWhited 15:21:56
(and we also use the server-assigned-resource-part-for-routing solution, FWIW)
MattJ 15:21:56
FWIW Prosody's clustering will use the resource for internal routing purposes
arc 15:21:57
in one case a device wanted to send a "heartbeat" with 12 bytes of data every 6 seconds (1/10th of a minute)
Ge0rG 15:22:14
arc: that's a very intensive use case
arc 15:22:32
Ge0rG: yes, and each device having a retail price of around $15
arc 15:22:45
that's the future we face and have to plan for
Kev 15:22:50
I like that Arc has such a high opinion of our maths that he had to explain that 6 seconds was 1/10 of a minute :D
arc 15:23:13
Kev: sorry i haven't had my tea yet lol
jonasw 15:24:10
tea <3
Guus 15:24:23
I for one wonder how many seconds 2/10 of a minute is.
arc 15:24:32
I will readily admit that a 100m service blitzed my brain out. I mean, sure we can toss around big numbers like its nothing, but that's actually some significant engineering challenges.
Kev 15:24:55
arc: It undoubtedly is.
arc 15:25:20
at that rate you need dedicated S2S routers. and questions like where are the heartbeats routing to
Ge0rG 15:25:22
I could also imagine that a 100m IoT deployment has different requirements than a public chat service
Ge0rG 15:26:15
(and also probably different sysop challenges, where having a resource string as a debug tag is less useful)
arc 15:27:30
absolutely. from the EXI side those stanzas are extremely small. as long as the 12 bytes of data are encoded in int or float attributes within their custom schema, the whole stanza could be around 16 bytes. and since the devices will be communicating with a finite number of other devices, mostly on the same LAN..
arc 15:28:32
my recommendation was embed their XMPP server in their 802.15.4 to wifi gateway module, to keep a majority of the traffic local and reduce their service end traffic as a first point. which i think is what they're doing
MattJ 15:28:34
Ge0rG, client-provided debug tags aren't guaranteed to be unique, I'm really unconvinced by your argument
SamWhited 15:29:31
I've also come to the conclusion that agreeing to compromise on that basis was a mistake… if you were using the resource part as a debug tag you were using a quick hack; if that's a thing we want, we need a real solution, we don't need to make a part of the JID more complicated just so someone can see sometihng in existing logs.
SamWhited 15:30:01
Adding stuff to the JID that isn't related to routing is changing the purpose of JIDs, and that feels like a bad idea.
Ge0rG 15:31:15
MattJ: a properly implemented client can provide sufficient uniqueness.
MattJ 15:31:36
Ge0rG, you're not a server developer, clearly :)
SamWhited 15:31:54
As a general rule of thumb I don't think we should ever have to rely on a "properly implemented" client.
MattJ 15:31:58
Indeed
arc 15:31:58
SamWhited: from the EXI side it doesn't matter. the entire JID is one string in the string table. i think having a human readable (aka designed for the UI) resource after the # makes some sense. though, that could also be done through pep
Ge0rG 15:32:12
MattJ: but I know a little bit about client development
jonasw 15:32:20
at this point I tend to agree with SamWhited. for debugging, there really should be something else, like an additional optional stream header which can be used for debugging, or a stream feature to attach a debug identifier to a stream or use <identity/> as soon as it’s available.
MattJ 15:32:39
Ge0rG, it's a nice idea, for you, with your client. But in the real world, on a real server, we can't depend on every client being Yaxim
arc 15:32:50
wouldn't this make sense to attach to PEP?
MattJ 15:32:55
I totally get why you want a debug tag, and let's do that. But I think it's separate to the resource
SamWhited 15:33:01
Or just some form of fingerprint the server constructs (so that the client doesn't have to do anythihng), eg. maybe it queries the client for its disco#info, and then hashes that along with the JID and any other info it can get and uses that to track sessions
MattJ 15:33:05
arc, no, because PEP is per user, not per client
arc 15:33:38
MattJ: couldn't the PEP .. sorry still early .. list a resource to human readable lookup?
jonasw 15:33:40
SamWhited: for a single session, a server can just roll a random number.
Ge0rG 15:33:44
MattJ: let me rephrase your suggestion: let's create a nice perfect future debug tag sometime in the remote future, and remove the existing and working debug tag right now.
SamWhited 15:33:56
jonasw: ah, yah, I guess this is about tracking clients, not sessions. oops.
SamWhited 15:34:18
The existing and working debug tag that breaks more critical parts of the system and makes everything more complicated.
jonasw 15:34:37
for clients use <identity/> as soon as its available and log it to associate the identity with the session nonce in the logs.
jonasw 15:34:46
identity + bare jid probably
SamWhited 15:34:47
And requires that clients do a specific thing which they may or may not actually do.
MattJ 15:34:47
Ge0rG, given that you're currently the only person I've seen suggesting that the resource string can and should be used this way, I don't think we're anywhere near your ideal being reality either
MattJ 15:35:40
i.e. other clients don't use the resource this way, you do. You'll update to use the debug tag, they won't
SamWhited 15:35:49
I remember at summit people complained that identity couldn't be used for this, but I don't remember why? What jonasw suggested sounds sensible, and works today.
jonasw 15:35:58
I see that the resource is *currently* a nice way to track a client in debug logs; but BIND 2.0 won’t be there tomorrow. There’s plenty of time for server devs to adapt. This could easily be part of the UX considerations for sysops in BIND 2.0
jonasw 15:36:06
(s/BIND/Bind/?)
MattJ 15:36:43
I'd be fine (and glad) to include some kind of unique client identifier in bind2
Ge0rG 15:36:44
MattJ: I don't know how many sysops of public servers are active in this MUC
jonasw 15:36:49
or even include a "debug identifier" in Bind 2.0 which is never ever exposed to anything but server logs. although I think a stream header would be nicer because it allows tracking even before authentication succeeded.
jonasw 15:37:03
ha, MattJ beat me to it
MattJ 15:37:05
and with bind1 clients, use their provided resource as a cookie, and then use something else for the actual resource
Zash 15:37:24
What is it with you and writing lots of text while I'm out on a walk?
MattJ 15:37:26
(sorry, cookie == debug tag in my mind)
jonasw 15:37:39
MattJ: makes sense
SamWhited nods 15:37:45
jonasw 15:37:49
sounds like a very useful way forward
MattJ 15:37:49
Zash, you should take your phone, to make sure you never miss a message!
Zash 15:38:09
~~I did, for photos of all the snsow~~ ✎
Zash 15:38:13
I did, for photos of all the snow ✏
Ge0rG 15:38:16
MattJ: I want to be able to easily grep my logs for certain things, and to get all traffic exchanged with a given client instance (including re-auth and 0198 resumption)
jonasw 15:38:23
(this discussion also pins me to a chair in a waiting room where I wanted to leave 20 minutes ago, but whatever)
arc 15:38:28
phone? he should have always wear Glass so this room is constantly flowing above his eyeball
Ge0rG 15:38:42
MattJ: or to get all traffic exchanged with a certain client software.
Tobias 15:38:54
Zash, how much ❄?
jonasw 15:38:54
Ge0rG: I think you actually want structured logs
MattJ 15:39:17
I want to submit pull requests to all other clients to change their default resource string to "yaximXYZ"
jonasw 15:39:18
cramming all those criteria in a single string isn’t doing any good
Zash 15:39:19
My position on resource selection is that the rules in xmpp-core are fine and don't need changing.
Zash 15:39:54
I agree with SamWhited that something else ought to be used for this kind of tracking and debugging.
Zash 15:40:30
Ge0rG: Would it satisfy you if we returned the log tag in the handshake somehow?
Ge0rG 15:40:33
Zash: the rules in xmpp-core are sufficient indeed. As long as the server doesn't override what the client sends ;)
arc 15:41:05
the more i think about it, the less i think about this as an issue of debugging, but more of the use case where you want your contacts to be able to specifically reach you on your laptop vs phone vs whatever
arc 15:41:23
that was brought up at the summit, i dont remember by who
SamWhited 15:41:24
The rules in xmpp-core would be fine, except that if you let clients "set" a thing, they're going to stop reading the RFC at that point and assume that's the JID they get. In my mind the rules should be "the server sets the resource part, it's opaque to clients, and the clients get no say in it"
Zash 15:41:24
arc: That is doable via disco#info
jonasw 15:41:34
Ge0rG: what about the following: 1. bind 2.0 allows for a "debug tag" 2. servers are strongly encouraged (via UX considerations in the bind 2.0 xep) to include that debug tag to every log message related to that client ?
Zash 15:42:01
SamWhited: The client gets to make a suggestion, but the server decides. Similar to how extensions and stuff work in TLS.
SamWhited 15:42:03
Because it's for *routing* which is strictly a server concern.
SamWhited 15:42:31
Zash: Yah, I wouldn't mind that, except it seems to be a source of bugs because clients don't actually pay attention to the servers decision
SamWhited 15:42:43
Or at least, that's what it sounded like at summit.
arc 15:42:48
SamWhited: most client authors AFAICT don't write to the rfc, they use it as a rough guide and really write to a server
Ge0rG 15:42:51
SamWhited: there is still no consensus on whether that _routing_ info should be persistent for a given client instance or not.
Zash 15:43:11
arc: And that's how we get "but it works in Internet Explorer".
SamWhited 15:43:12
Ge0rG: Sure, but that's orthogonal (and probably up to the server / service)
SamWhited 15:43:21
arc: Indeed :(
Ge0rG 15:43:37
SamWhited: actually it's related, because the client is the only one that knows its identity on a reconnect
jonasw 15:43:39
there should be a way to pain to those who do that, arc
Zash 15:44:16
Ge0rG: Have you thought about my suggestion of including a namespaced attribute on the stream header? That's greppable in logs, which gives you the sessions log tag, which you can then grep for.
arc 15:44:34
jonasw: a network testing script which tests a client or service for compliance
arc 15:44:54
starting with "fun" things like sending <stream:stream version="2.0">
Zash 15:45:09
Are there any security issues with using the stream ID as tag in logging?
SamWhited 15:45:10
Ge0rG: Ah, yah, fair enough, I guess you can't really separate that from the clients control.
arc 15:45:28
and using custom prefixes.
Ge0rG 15:45:35
Zash: I want to reduce the number of IDs, not increase it.
arc 15:46:36
just basically go through the RFC for every MUST and SHOULD, write a test for that case, and MUSTs show up as red, while SHOULD appears in yellow - any client failing to (eg) accept a different resource than requested by the server would show up this way
arc 15:46:55
and if you provide it, and its something client authors can find, they will almost certainly use it.
Ge0rG 15:46:55
Sorry, I'm in a meeting currently, and I'm heavily sleep-deprived. Can't focus on the discussion here.
Zash 15:47:16
arc: FWIW I don't think the client needs to know its own resource in that many cases.
arc 15:47:59
sure but can you think of a case where a client not understanding its resource correctly would cause a fault that you could test for on the server side?
Zash 15:49:17
Strip out the 'to' attribute on everything you send, see how the client reacts.
jonasw 15:49:42
as a client, I don’t care about the to a server sends me
arc 15:49:56
yea isnt it legal to do that?
Zash 15:50:27
No 'to' attribute is supposed to be semantically equivalent to to=full JID
arc 15:50:36
i mean i guess you could test an iq ping addressed to nobody, to the client by a random resource, to the client's requested resource, and to the client's given resource
Zash 15:50:43
Or the bare JID in the other direction
arc 15:52:34
replying to a ping that's misaddressed should at least be a warning, tho in that case it'd often be hard to say whether it was understanding its resource correctly or not
arc 15:52:53
but if it only replied to its requested resource but not its given resource..
Zash 15:52:58
Isn't that an error on the servers part?
arc 15:53:23
Zash: test servers must send bad data. thats the point.
Zash 15:53:57
There's been a bunch of security issues related to not validating the 'from' on certain stanzas, like roster requests and such.
arc 15:54:32
the point of a test suite isnt to test whether a client behaves correctly with typical data to a properly functioning xmpp server. the point is to test whether it behaves according to the RFC, so in many cases the client would - i assume - need to close the connection and reconnect.
jonasw 15:54:37
yeah, but from is not to
arc 15:55:02
or send an <iq type='error'> or etc
arc 15:56:04
i mean i above proposed one of the first tests would be <stream:stream version='2.0'> to check that the client is actually parsing the stream version according to the RFC. it should reject the connection, right there
Zash 15:58:13
arc: https://modules.prosody.im/mod_conformance_restricted.html may be of interest to you
arc 16:00:48
Zash: i'll look at it
arc 16:00:57
but does it send intentionally bad data to test?
arc 16:01:19
I have a utf8 test suite I'd *love* to see how both clients and servers respond to
Zash 16:01:24
Yes, sends XML things forbidden by the RFC
jonasw 16:01:24
sending PIs is bad data i guess :-)
jonasw 16:02:21
damn i need tobunload csi
jonasw 16:02:36
*to unload
arc 16:02:57
Zash: have you tested for UTF-8? what happens when NULL is in the middle of a stanza, say in the <message><body>? or ending a <message><body> with a chr(148) followed by </body>
Zash 16:03:00
arc: Have we had the conversation about IDNA versions and PRECIS and how the only reasonable thing to do is crawl down under ones desk and cry?
arc 16:03:21
Zash: no but it sounds like a conversation id love to have ;-)
SamWhited 16:03:42
Heh, this is true.
Ge0rG 16:04:02
arc: yay! please tell me if Unicode Robot Face (🤖 U+1F916) is a legal resource character
SamWhited 16:04:04
and Unicode, and UTF-8, and natural languages
arc 16:04:30
Ge0rG: I don't know but i'd love to find out!
SamWhited 16:04:41
I'm almost certain it is; I can go check if you really want.
arc 16:05:02
i discovered that GNU Screen has some deep UTF8 issues, as does Synergy
arc 16:05:15
I started digging in and found lower level libraries were at fault
arc 16:05:39
GNU Screen only handles 1 and 2 byte unicode
arc 16:05:51
internally it was using UCS2
Zash 16:06:38
Like how MySQL has something called "utf8" which only supports up to 3 byte UTF-8 sequences?
arc 16:08:03
heh
SamWhited 16:08:04
Yup, it's valid
arc 16:08:24
i think SamWhited cheated
Zash 16:08:34
arc: GNU libidn and IBM ICU behave differently when given Unicode outside of Unicode 3.something or whatever was state of the art at the time. One accepts. One rejects. Much fun.
SamWhited 16:08:45
https://gist.github.com/SamWhited/cc6fd0a9c0a1559c71f828f6b6c8b729#file-validjid-go
SamWhited 16:09:04
That JID implementation is using a very well tested PRECIS implementation that's built with Unicode 9
arc 16:09:21
Mr Miller *IS* in the DC area, we're setting up a time for coffee
MattJ 16:10:13
^5
Ge0rG 16:22:59
Now I wish I could have Robot Face as a sRVname SAN in a LE cert
Zash 16:26:28
Ge0rG: Nice things, they are unobtainable.
Ge0rG 16:27:21
Zash: like Unobtainium?
SamWhited 16:28:21
Oh no, Unobtanium is much more attainable than nice things.
Ge0rG 16:28:30
Bummer.
Ge0rG 16:28:43
BTW, why is the Board Meeting over now?
Zash 16:29:01
It was the board meeting to end all board meetings
Ge0rG 16:30:08
Zash: I think it only ended three of them.
arc 16:50:46
lol
arc 16:51:40
so today's joy on the FLOSS Foundations mailing list is the announcement of the new Open Fashion Foundation, quote, "to disrupt fashion industry with lessons learned from computing industry."
Zash 16:52:19
Aaaawhat who let this override browser shortcuts?!
SamWhited 16:52:19
So they're going to spend all their time adding new features to cloths and ignoring the fact that the cloths are unraveling and falling off?
Zash throws things at LE's discuss thing 16:52:38
arc 16:52:56
SamWhited: lol
arc 16:54:00
this is one I won't even be synical about. Its just a pure bundle of joy that someone out there has made FOSS licensed fashion a personal mission in their life
SamWhited 16:54:22
ahem, yes, sorry about that. I mean, "good for them" :)
arc 16:54:54
can you imagine a fashion show hosted by this organization?
Zash 16:55:58
The latest in beard and ribs fashion?
arc 16:57:19
"This piece by Manuel Debrough, available under the Apache 2.0 license from github..."
arc 16:58:02
Zash: oh no, dollars to donuts I'm willing to bet a fabulous gay man is behind this.
SamWhited 16:58:29
heh, I have a bit of a guilty pleasure in that I really enjoy fashion stuff (even though I know nothing about it, which is probably obvious if you've ever seen the way I dress), so that actually sounds pretty nifty
SamWhited 16:58:41
But I do enjoy seeing the things people come up with
arc 16:58:56
actually I can see them trying to QueerEye geek's tshirt and jeans
SamWhited 16:59:13
aww yeah, I'm gonna be fashionable for once
arc 17:00:06
the rugby club I started 4 years ago in DC just raised over $2500 in one night hosting a drag show.
arc 17:01:18
https://goo.gl/photos/XEKE5peqYG2b4gfb7
arc 17:03:37
when I mentioned this on IRC, one of my friends with the Gnome foundation immediately said they needed to run a drag show, and had people volunteering. The thought of that alone is priceless.
arc 17:04:11
so yea I can see a geek fashion show, especially in san francisco
arc 17:04:32
they could raise thousands for charity too
dwd 17:05:42
I can see "designer-stained t-shirts" and "artful crumpling" becoming a thing.
SamWhited 17:06:27
Hah, indeed. I'm going to start a new line: "morning coffee spill"
dwd 17:07:37
"Bob wears jeans (model's own) and a t-shirt (free from some conference)"
arc 17:07:58
dwd: have you ever watched queer eye?
dwd 17:10:39
arc, Can't say I have.
moparisthebest 17:11:02
gah I hate that, I have jeans with holes worn in them by myself by working before that was in fashion, and now I don't want to wear them for fear of people thinking I'm trying to be fashionable...
arc 17:11:54
https://www.youtube.com/watch?v=g5dZ4QG7dW0 most of the men they makeover are shaggy geeks. they turn them metro. in almost every case the man starts with tshirt and jeans, and they end up posh with a new haircut, product, etc - also with their house/office made over.
SamWhited 17:11:57
hipster moparisthebest was into jeans and t-shirt's before they got all popular
moparisthebest 17:12:15
:(
dwd 17:12:22
moparisthebest, You're way older than I thought, then. I recall holes in jeans being fashionable, and that was when my mum bought me clothes.
moparisthebest 17:12:54
I seriously still wear the same jeans and t-shirts I wore when I was 18 and stuff, my wife tries to throw them away all the time lol
dwd 17:13:01
arc, See, I don't need that. I *can* dress up. I just usually *don't*.
moparisthebest 17:13:11
dwd, oh maybe it went out of style and back in, or I just didn't know about it, I'm 31 :P
arc 17:13:15
https://youtu.be/g5dZ4QG7dW0?t=11m25s is where they bring this one guy to buy fashonable denim to replace his "jeans"
arc 17:14:38
dwd: nor I. but its a great visual
arc 17:15:26
this is more like my husband and I: https://www.youtube.com/watch?v=kbf_nFtA8YQ
dwd 17:16:05
moparisthebest, Yeah, I'll be 43 soon, and I suspect my mother was telling me ripped jeans should just be replaced at about the time you were born, then...
arc 17:19:38
its funny, i have a tshirt and jeans policy - and have gotten a lot more traction with it than otherwise.
arc 17:20:00
also the beard. the bigger the beard, the more they think you know. John "Maddog" Hall taught me that trick
jonasw 17:46:46
that’s some unexpected backlog
Ge0rG 17:48:22
so much text, so laggy connection.
jonasw 17:48:48
Ge0rG: barely worth it if you’re not into fashion. most likely not worth it on your 30% loss link there.
Ge0rG 17:49:34
the link already feels like 20%. Looks like it's improving. I even have sub-second latency.
Ge0rG 17:51:34
Maybe I should fire up Gajim to see how it behaves with MSN and high-latency links.
arc 18:27:17
heh
arc 18:27:34
If I have the joy of reading about Open Fashion Foundation today so should all of you ;-)
jonasw 18:30:23
is there a section in a usual XEP where I can put notes on alternative variants I considered but eventually decided against? much like PEPs have, for example here: <https://www.python.org/dev/peps/pep-0448/#variations>? otherwise I might add a Design Considerations section…
Ge0rG 18:31:27
jonasw: +1 for Design Considerations
moparisthebest 18:31:32
that sounds right to me
Zash 18:32:22
# requirements it needs to do the thing # discussion we could do something, but that has these problems we colud do something else, which seems pretty good, so the rest of the spec is about this
Ge0rG 18:32:31
I think that every XEP should contain its rationale.
Zash 18:32:35
+1
jonasw 18:32:38
yESSSSss
jonasw 18:33:34
Zash: hm, PEPs do it differently: requirements, then spec, then other variants. I actually like that, because when I implement something, I don’t need to read the other variants. If I want to know why the other variants were rejected, I can skip to that section. thoughts?
Zash 18:34:22
No it should start with the schema! :)
jonasw 18:34:44
ah, I wish one could rely on schemas in XEPs.
moparisthebest 18:34:59
my ideal documentation would just start with already written code :)
jonasw 18:35:09
moparisthebest: no.
moparisthebest 18:35:17
in the language I'm using
moparisthebest 18:35:25
and it has to magically know that beforehand
moparisthebest 18:35:31
yea I'm joking sorry :)
jonasw 18:35:36
:-)
moparisthebest 18:35:45
I agree with you about that PEP order jonasw
Zash 18:35:54
Language Specification: What the code does is correct. EOF
moparisthebest 18:36:09
right :)
jonasw 18:36:10
:D
moparisthebest 18:36:29
if you think you found a bug you are mistaken, it's actually a feature
jonasw 18:36:41
#php
moparisthebest 18:37:18
and it's apparantly worked for xep115 for 10 years right?
Zash 18:37:32
Is fine, don't worry
moparisthebest 18:38:04
... why did I automatically read what Zash just said with a russian accent?
jonasw 18:38:31
https://www.youtube.com/watch?v=rp8hvyjZWHs (Trust me, i’m an engineer !)
Ge0rG 18:41:03
Hm. I need to youtube-dl that so I can watch it. ETA: 12:51
jonasw 18:41:33
don’t.
moparisthebest 18:41:39
some of those things are actually awesome
moparisthebest 18:41:46
the backhoe rowing the boat for example
Ge0rG 18:42:59
jonasw: alternatively, you could stream it to the MUC with libcaca and LMC.
jonasw 18:43:15
Ge0rG: my client cannot into LMC
Ge0rG 18:43:53
I'm sure mathieui would be glad to provide a video streaming plugin for poezio :D
jonasw 18:44:51
Tobias: you mentioned earlier that a server could cache xep115 responses for those specific disco#info nodes.
jonasw 18:44:58
I wonder whether that’s a great idea after all.
jonasw 18:45:18
I was wondering whether it has any privacy implications for a client.
jonasw 18:45:29
(on behalf of whom the server is answering)
Zash 18:46:09
jonasw: You may be able to guess that the server has seen a disco#info before through timing
jonasw 18:46:40
Zash: well, yes, but lets assume that a server has seen that disco isn’t revealing anything, for example because all servers use the capsdb.
jonasw 18:47:59
I wonder whether it would be okay for a server to reply on behalf of a client if the client is not actually online. While that would prevent any unintended presence leaks if the server answers for a resource which would by itself not have answered to that specific asker, it has the downside that stuff may be confused if a server answers a request for a resource which isn’t even online.
Tobias 18:48:50
jonasw, as long as you have not an extremely user specific client feature set, that shouldn't be a an issue
SamWhited 18:49:27
I don't think it's a problem because it's generally up to the server to enforce permissions / decide who can query what anyways, not the client.
SamWhited 18:49:44
So your server SHOULD be taking precautions to prevent presence from leaking anyhow
SamWhited 18:49:50
(or whatever is being queried)
jonasw 19:08:25
what are the criteria for an xsd to appear here? <https://xmpp.org/schemas/>
Ge0rG 19:08:34
NoooOooOOOooo! [download] 87.6% of 3.35MiB at 45.18KiB/s ETA 00:09ERROR: unable to download video data: [Errno 104] Connection reset by peer
jonasw 19:08:41
Ge0rG: youtube-dl can resume :)
MattJ 19:08:45
Ge0rG, it supports resum...
MattJ 19:08:47
:)
MattJ 19:09:02
I should know. Are you using my wifi by any chance? :)
Zash 19:09:19
MattJ: You have wifi?!
MattJ 19:09:40
Too many complaints from "smart"phone users in the house to resist any longer
Ge0rG 19:09:50
~~MattJ: free WiFi on a rowded train, moving at 200km/h~~ ✎
Ge0rG 19:09:58
MattJ: free WiFi on a crowded train, moving at 200km/h ✏
moparisthebest 19:13:12
kind of amazing that works at all
arc 19:22:43
jonasw: https://youtu.be/rp8hvyjZWHs?t=2m37s has got to be the best hack I've seen in a long time
jonasw 19:23:31
:D
moparisthebest 19:24:17
arc, the rowing backhoe? yea that impressed me the most
arc 19:24:35
yea..
moparisthebest 19:25:01
there is no arguing with that one, boat motor breaks, have a backhoe on board, it's ingenious
arc 19:25:52
i thought my use of a toilet fill valve in a bucket for plant watering was good
Zash 19:25:53
I don't usually have a backhoe on board
arc 19:26:04
this is a whole new level
dwd 19:26:16
Zash, So what do you do if your motor breaks?
moparisthebest 19:26:48
probably something boring like an oar
Zash 19:27:47
I guess I would have to convert it into a putt putt boat
Zash 19:28:37
I would also have to get a boat and a motor...
arc 19:28:51
what if you had a car onboard and could get it up on jacks
moparisthebest 19:29:26
change out the wheels for paddles like an old river boat?
dwd 19:29:28
arc, If he doesn't even have a boat he's got worse problems.
SamWhited 19:32:31
arc: Like this (sort of)? https://www.youtube.com/watch?v=dyBl9vf8Td0
arc 19:32:49
thats true. Zash how will you hack up a boat to start with?
Zash 19:34:04
But why would I have a boat? Not really a water person.
Zash 19:34:52
I'd rather have cabin in the woods and some potatoes. Backhoe would come in handy then.
arc 19:35:14
Oh, I *really* doubt that you want to have cabin in the woods
arc 19:37:00
https://www.youtube.com/watch?v=NsIilFNNmkY
ThurahT 19:37:03
true, there are nicer things than a portal to a demi-god-demon
Zash 19:40:25
Can't be worse than the mosquitoes
arc 19:41:02
i'll take mosquitos over the horrific monsters they send to kill you
arc 19:41:17
and what rises if EVERYONE fails
moparisthebest 19:41:37
I think I'd prefer the things I could kill with guns
arc 19:42:14
I think the scene of the japanese school children circling around and dispelling the demon is the best
arc 19:44:21
https://www.youtube.com/watch?v=IIE8Fq4Zm1E
arc 19:45:10
"The spirit of the demon will now live in the happy frog!" ... "How hard is it to kill a group of 9 year olds?"
moparisthebest 19:45:18
this has been an odd day in the xsf, went from talking about fashion, to boats rowed by backhoes, to demons in cabins in the woods
arc 19:45:32
blame me.
moparisthebest 19:45:37
with some xmpp sprinkled in :)
arc 19:45:59
yea there's XMPP involved, that's all that matters. That means we can charge lunch to the corporate card right?
Ge0rG blames arc. 19:47:21
arc 19:47:24
after all the work I did I realized this morning that the hash function isnt likely all that useful for embedded systems, and in 95%+ of the cases won't even get included in the binary
arc 19:47:40
embedded xmpp is unlikely to include text xml.
Ge0rG 19:47:47
It ain't no fun with the lags.
arc 19:48:42
the hash function is used pretty much, if not entirely exclusively for hashing text strings in order to find a cooresponding match on the string table
arc 19:49:11
anyone else have a problem that you dig too deep into a problem that you lose sight of the big picture?
SamWhited 19:49:28
oh yes… frequently.
Ge0rG 19:51:37
When I dig too deep into a problem I always encounter sub problems to which there is no documented solution on the Internets, but often many people having the same issue.
MattJ 19:51:55
Don't get me started, today has been one of those
arc 19:51:56
i hate that.
Zash 19:52:25
I still got some glibc in my eye from yesterday.
arc 19:52:26
or you dig deep enough that you realize its a problem caused by the language you're using that can't be fixed, just.. worked around
MattJ 19:52:31
e.g. the moment when I realised (after putting log statements all over the place) that the testing tool I was using was broken, and connecting to the wrong server
MattJ 19:52:42
(in production)
Zash 19:52:55
Why isn't getrandom() in glibc until like the latest bleeding edge version nobody has?
MattJ 19:53:02
and the rabbit hole just goes deeper
MattJ 19:53:39
and now I'm just looking for some utility that will read lines from stdin and send them somewhere as UDP packets
MattJ 19:54:00
and trying to pretend I don't need to write my own
Zash 19:54:09
netcat
MattJ 19:54:28
netcat failed on the "line" part
arc 19:54:29
my first "in office" job had two charming things; 1) a ban on coffee in the office (only green tea, because of management philosophy hogwash), and 2) "Eat Me" cookies in a sealed container in the break room for when you get trapped too deep in a rabbit hole
arc 19:56:28
it took me far too long to realize the reference
Zash 19:56:42
hah
arc 19:58:46
a also found that for every schema i could think of, bitpacked EXI is better, faster, and smaller binary than compressed EXI
arc 19:59:24
i didnt expect that.
moparisthebest 19:59:32
that's just a type of compression though isn't it?
Zash 19:59:47
What's compressed EXI?
moparisthebest 19:59:48
like it'd probably be equally susceptible to CRIME / BREACH type attacks?
arc 19:59:49
I guess you could call bitpacking a form ofcompression..
arc 20:00:06
Zash: so there's 4 modes for EXI; bitpacked, simple byte-aligned, pre-compression, and compression.
arc 20:00:34
byte-aligned is essentially the same as bitpacked but always padded to byte alignment, obviously
Zash 20:00:34
Can you explain them in terms of ASN.1 encoding schemes? :)
arc 20:00:57
compression is pre-compression plus DEFLATE
Zash 20:00:57
(that was a fun rabbit hole too)
moparisthebest 20:01:30
so which ones are secure under encryption? only pre-compression?
arc 20:01:49
pre-compression is byte-aligned, but with similar types of data grouped together on the stream. so eg all int values are together, all string values together, etc
arc 20:02:04
i wouldn't propose to know the answer to that moparisthebest
jonasw 20:02:04
MattJ: socat READLINE: UDP:?
moparisthebest 20:02:34
arc, probably should have someone figure it out before starting to use/promote it though?
arc 20:02:36
but the idea with pre-compression is that some form of compression will be applied on, eg, the TLS layer
MattJ 20:02:39
jonasw, I saw that, but READLINE seems to actually involve the readline library, i.e. it's intended for human input, not piping from another program
jonasw 20:02:50
MattJ: and STDIN doesn’t do the trick? :/
jonasw 20:02:58
*STDIO
moparisthebest 20:03:10
arc, I think most if not all TLS libs removed support for TLS level compression because it's woefully insecure
arc 20:03:25
moparisthebest: i can *barely* hold enough of the EXI specification in my head to work on it. i don't have room for encryption on top of it.
Ge0rG 20:03:37
New personal record. Sigh: 64 bytes from 141.44.1.1: icmp_seq=9 ttl=53 time=377539 ms
MattJ 20:03:40
jonasw, only if they split on lines (which I see no indication of)
jonasw 20:03:46
meh
moparisthebest 20:04:02
arc, and you shouldn't have to consider it at all as long as you don't do anything that makes it insecure, compression being one of those things
Ge0rG 20:04:03
MattJ: I'd write a small loop with scapy.py
MattJ 20:04:10
I found a utility, it just needs the correct command-line arguments
arc 20:04:11
but i would assume if you consider compression insecure, eg DEFLATE, Brotli, etc, then you would prefer bitpacked over all options
MattJ 20:04:13
lua -e'u=require"socket".udp() for line in io.lines() do u:sendto(line, os.getenv"HOST",os.getenv"PORT") end'
jonasw 20:04:27
python3 -c 'import socket; s = socket.socket(socket.AF_INET, socket.SOCK_DGRAM); while True: s.write(s.stdin.readline().rstrip("\n"))'?
jonasw 20:04:30
heh
MattJ 20:04:34
~~Lua wints ;)~~ ✎
MattJ 20:04:39
Lua wins ;) ✏
arc 20:05:07
at this point my primary concerns are the size of the embedded image. cutting text-domain XML out reduces the binary size of the library in about half. removing compression library is a pretty big win too.
jonasw 20:05:14
moparisthebest: CRIME and BEAST are based on the fact that the packet size changes depending on previously sent content, I doubt that this is the case with bit-packed, from the sound of the name :)
jonasw 20:05:19
but I haven’t looked into it, at all
arc 20:05:29
wolfssl is pretty small
jonasw 20:05:50
soo… now I have that xep-ecaps2.xml here, let’s check out xep-0001.xml on what I need to do next.
moparisthebest 20:05:53
arc, well compression is insecure because if an attacker can add the string "ar" to the payload and the size doesn't increase, then add the string "arc" and it still doesn't change, and build up from there, it can figure out what's under the encryption
moparisthebest 20:06:02
so if bit packing works in a similar way, it's equally insecure
Zash 20:06:22
jonasw: Print it on paper, fold a paper airplane and aim for SamWhited :)
moparisthebest 20:06:31
right jonasw I don't know either, just saying it's probably something that should be determined
arc 20:06:46
moparisthebest: hmm. no i don't think so. so the only way you could reverse engineer it would be exploiting the string table.
moparisthebest 20:06:52
like it'd be another useless thing to work on if it was proven as insecure as compression arc , idk
jonasw 20:06:59
moparisthebest: it also probably does not matter much for IoT-thing <-> gatewaything.
moparisthebest 20:07:22
yea it's pretty obvious security doesn't matter when it comes to IoT haha
jonasw 20:07:35
like Ge0rG quoted yesterday: "The S in IoT is for security"
SamWhited 20:07:47
It would actually be pretty awesome if XEPs were submitted that way…
arc 20:08:23
moparisthebest: ok, so string values are stored in the string table. this refers to whole strings only, but eg a JID you're communicating with would be added to the string table and referenced by id.
SamWhited 20:08:25
Please change the font to OCR-A or something first so I can scan it back in though.
jonasw 20:08:26
SamWhited: because you would not have to do any work, as paper planes don’t travel several thousand km?
SamWhited 20:08:43
jonasw: Says you; that just means you're not building a big enough paper airplane!
jonasw 20:08:55
SamWhited: we could also try XMPP over RFC 1149
SamWhited 20:09:07
heh, indeed
SamWhited 20:09:33
My favorite part is that there are Errata for that one.
jonasw 20:09:40
there was an actual implementation
moparisthebest 20:09:50
ok arc so the full payload size would increase with the strings you added, say "arc" would increase it 3 bytes, *unless* that FULL string was already in there, then it wouldn't decrease at all? if I understood you correctly
moparisthebest 20:10:22
that at least would not let you incrementally guess strings like 'a' then 'ar' then 'arc' etc etc
arc 20:10:32
moparisthebest: so if you can send a string value containing a 3rd party JID that you want to know if that agent is already communicating with, AND you know the schema being used, then you can determine whether that agent has communicated with that JID already.
moparisthebest 20:10:45
like you I don't know enough to say without a doubt that makes BREACH or CRIME not a problem, but it seems better to me...
arc 20:10:53
moparisthebest: yes. I do not recall a method for partial or combined strings
jonasw 20:10:57
assuming you can observe the network traffic between those entities, which may only be within the local wifi
Ge0rG 20:11:02
RFC1149 would be faster than my current link.
arc 20:11:08
i'm still loading the spec back into my head. but i remember that as a fault.
arc 20:11:33
one of my criticisms of EXI actually is the lack of a "list" type
moparisthebest 20:11:39
I'd feel better if someone like xnyhps said they'd reviewed it and it looked good to them :)
SamWhited 20:12:02
eeew, I just decided I should actually print EXI and read it… but it goes on forever.
arc 20:12:37
this comes up in some XML schemas such as SVG, where paths are made of collections of floats, ints, and characters separated by spaces
Zash 20:13:08
Hm, I should look at what a printer costs
arc 20:13:11
SamWhited: yea its not light reading. I recommend https://www.w3.org/TR/exi-primer/ to start with
SamWhited 20:13:18
arc: Thanks
jonasw 20:13:25
is there an email-adress where XEPs are supposed to go? the http://xmpp.org/xmpp-protocols/xmpp-extensions/submitting-a-xep/ page linked in XEP-1 404s
arc 20:13:36
that gives a very nice overview without sucking you into the details
SamWhited 20:13:47
jonasw: You can submit a PR on GitHub
jonasw 20:13:49
Zash: nothing, just "google" for one and ask the owner kindly to send you the printouts :)
Ge0rG 20:14:06
jonasw: you can make a PR of the XEP in inbox/
jonasw 20:14:08
SamWhited: which puts my xep in the inbox/ dir?
jonasw 20:14:09
right
SamWhited 20:14:13
jonasw: Yup
arc 20:14:20
you're younger than me, you might be able to handle it better, but ive had to segmentize the details so i dont get overwhelmed. its a lot to hold in your head at once
SamWhited 20:14:38
oh I doubt that; if you can't hold the entire spec in your head I doubt I have any chance
arc 20:14:54
that's flattering but I doubt its true. age wears down your memory
SamWhited 20:15:08
jonasw: See the other XEPs in there for naming, I *think* you don't want it to start with xep- for reasons that I can't remember… something, something tooling.
arc 20:15:11
I'm turning 38.
jonasw 20:15:39
yeah, figured that much
moparisthebest 20:16:04
speaking of the inbox, some of those things are *ancient*, does or should it ever be cleaned out?
jonasw 20:17:06
am I the only one *always* falling for the delay github has with showing the "you have pushed to branch X n minutes ago, do you want to pull request?", hitting F5, seeing it appear before the page has reloaded, click compare & pull request and then the page reloads and you’re back to square one?
SamWhited 20:17:10
I think the editor readme says it never gets cleaned out. We don't want to break old pages.
SamWhited 20:17:28
Oh yah, I do that all the time
moparisthebest 20:17:54
break pages? do they get rendered?
moparisthebest 20:18:16
or you just mean links to the xml ?
SamWhited 20:18:37
moparisthebest: they get rendered on the site, just like actual XEPs
moparisthebest 20:19:00
I didn't know that
jonasw 20:19:26
SamWhited: https://github.com/xsf/xeps/pull/440 consider yourself paperplaned also: https://www.youtube.com/watch?v=Co452wJ-3Lg (Long Distance Calling - Black Paper Planes) (Music)
moparisthebest 20:19:34
https://xmpp.org/extensions/inbox/
moparisthebest 20:19:36
awesome
SamWhited 20:19:51
moparisthebest: Also, ¿Porque no los dos?
SamWhited 20:20:08
(I couldn't find the adorable little girl gif to send, so you just get text)
arc 20:20:44
given the current status of IoT I think I might actually focus for a few weeks on *just* the schema compiler and get a XEP out for it. the one thing im missing for the XEP is a definition for the schema of the schema
SamWhited 20:21:00
> the schema of the schema
SamWhited 20:21:03
I'm so sorry…
jonasw 20:21:07
that meta
jonasw 20:21:15
arc: schemas like in XML Schemas for XEPs?
jonasw 20:21:27
how are you going to deal with the mostly incorrect or inaccurate schemas out here in XEPs?
jonasw 20:21:40
well, probably not mostly.
jonasw 20:21:50
but they’re not normative, I’ve been told once.
arc 20:21:56
yea, in order for a client to transfer to the server the schema that it wants to use, which the server doesnt already have, it needs to be able to dump the EXI-encoded schema to the server. and that needs to be defined since every client and server needs to be able to understand it
arc 20:23:08
so the EXI schema for the EXI schema needs to be defined in the XEP
jonasw 20:23:15
that’s meta.
arc 20:23:25
its why I havent touched the XEP yet.
arc 20:23:36
but it needs to happen, and sooner the better
jonasw hands arc a large bag of tea. 20:23:39
moparisthebest 20:24:53
sounds like he needs something harder to me
arc 20:25:13
i havent written a line of code in a month. i'm up for it.
moparisthebest 20:25:15
maybe 160+ proof
jonasw 20:25:19
there are too many movies showing that coke doesn’t end well.
jonasw 20:25:20
oh
jonasw 20:25:22
nevermind.
Zash 20:25:23
160+ proof tea?
arc 20:25:28
oh I have a copeous amount of cannabis
arc 20:25:57
there's a "Balmer limit" to cannabis too, though.
jonasw 20:26:18
heh
arc 20:26:49
er "Ballmer Peak" https://xkcd.com/323/
jonasw 20:26:53
:D
dwd 20:26:56
jonasw, That your protoxep? ecaps2?
jonasw 20:27:04
dwd yes
arc 20:27:20
though its more a cliff. more is better, to a point, and then rapid degeneration. its around the point that you start feeling like time is on a bungee chord
dwd 20:27:30
jonasw, I think you win the prize for using every obscure separator character in the ASCII subset.
jonasw 20:27:38
dwd: thanks :D
jonasw 20:28:11
they were barely enough, I was worried I’d also need EOT
dwd 20:28:15
jonasw, Can those appear in XML?
jonasw 20:28:22
dwd: no.
jonasw 20:28:36
XML forbids control characters except htab, newline and carriage return
jonasw 20:28:47
(those between 0x00 and 0x20 at least)
Ge0rG 20:28:57
Hm. Thereis an IoT thread going on with me in Cc. I wonder who deemed me so important and why.
dwd 20:29:22
jonasw, Perfect. Nicely done.
jonasw 20:29:30
dwd: thanks! :)
arc 20:29:34
Ge0rG: you are the chosen one for IoT. you must lead the way, because everyone knows nobody else knows it
dwd 20:29:51
arc, IoT is different and special from everything else.
Ge0rG 20:30:19
arc: this must be a SCAM.
arc 20:31:16
I'm humored by these IoT "Meetups" full of VCs who think IoT means a standalone device that communicates solely with their service, like a modern wifi-connected thermometer that you can control with your phone through their online service
Zash 20:32:48
jonasw: "Cabability"
arc 20:33:04
in that ideology things like protocol standards don't matter. they mostly use a HTTP ReST API between the device an their service
dwd 20:33:06
arc, The sad thing is that most of these devices are going that way.
jonasw 20:33:19
Zash: that’s only because you cannot use entities in <dt>! thanks, fixed locally, waiting for more of these stupid typos before I push another commit.
arc 20:33:43
dwd: only because of the novelty of it. we need to catch up to steer course
dwd 20:33:46
arc, And worse, those that aren't suffer - my iKettle, for instance, is controlled locally, but people want to integrate - and they have to integrate via cloud services now.
Zash 20:34:07
dwd: Like the e-reader thing requiring an account with some online service to display text?
arc 20:34:18
why does your .. what i assume is a water kettle.. need remote access?
arc 20:35:05
that's my other IoT rant that I won't get into. not everything needs a chip in it. bloody Target selling basketballs with a chip in it to count bounces and report them to your phone via bluetooth
dwd 20:35:11
arc, So I can set it to boil from my desk, and - more importantly - so I get a notification on my smartwatch when it does.
arc 20:35:14
my basketball does not need bluetooth.
dwd 20:35:38
arc, I understand. You're wanting it to use zigbee instead?
arc 20:35:48
dwd: lol
jonasw 20:35:54
+1
arc 20:36:03
dwd: you're doing well roleplaying an IoT VC!
dwd 20:36:28
arc, I'm just like a VC, except without the money.
arc 20:36:40
oh so you're homeless? ;-)
Zash 20:36:42
dwd: My water boiler has this amazing wireless notification protocol called "loud click and the sound of boiling water slowly fading away"
moparisthebest 20:36:55
a basketball with a bounce counting chip?
SamWhited 20:37:02
mine makes a sort of loud whistling noise when the water is ready
dwd 20:37:04
Zash, Well. I can actually hear the kettle from my desk, in fairness.
moparisthebest 20:37:08
I'd think you were joking if I didn't know better
xnyhps 20:37:09
moparisthebest: I didn't read much of the backlog, but the DEFLATE option for EXI very likely is vulnerable, without, probably.
Zash 20:37:13
Weren't there baseballs with accelerometers in them to measure how hard they got hit?
SamWhited 20:37:15
it sounds vaguely like air being forced through a small round opening
moparisthebest 20:37:17
maybe I will move to a cabin in the woods like Zash :)
jonasw 20:37:30
Zash: uh, I once had an oven which had the protocol of "if you don’t take care the water boils over the pots edge and flows down the sides into the oven tripping the RCA and thus cutting of your power"
arc 20:37:32
we have a bluetooth enabled pressure cooker. it has a bluetooth range of maybe 8 feet, 10 if you're lucky. the app you need to communicate with it has basically a clone of the physical interface on the machine
moparisthebest 20:37:51
xnyhps, yea any compression like deflate/brotli/etc would be, the question was whether the 'bitpacking' optimization without compression would be
moparisthebest 20:38:09
or, without what we normally call compression
moparisthebest 20:38:13
I suck at wording
Zash 20:38:39
moparisthebest: Call it PER
arc 20:39:18
xnyhps: DEFLATE only or newer methods like Brotli too
dwd 20:39:20
moparisthebest, EXI in bitpacking mode doesn't have back-references, which is the basic issue.
arc 20:39:52
but there is the string table, which I think would argue could have issues, and that's in all modes.
arc 20:40:54
your own JID, for example, will be on the string table. so if someone could send you a jid as an attribute value, i believe it could under specific conditions, confirm if that is your JID or not.
Zash 20:41:09
jonasw: 'the i;octet' intentional or typo?
arc 20:41:29
or if your device is communicating with a server, and they know which IP you're communicating with but not the specific hostname..
dwd 20:41:31
Zash, ACAP Comparator. Not a typo.
jonasw 20:41:58
Zash: that’s how it’s called. not my idea :/
dwd got his ACAP server compiling again the other day because someone actually wanted to use it. 20:42:00
Lance 20:42:21
jonasw: ^5 on the XEP, this looks awesome
jonasw 20:42:27
Lance: thanks
arc 20:42:58
but given what moparisthebest described earlier I think that's a lot less of a security risk, since you couldn't pull out substrings to progressively reverse engineer, and the specific conditions are more difficult to otherwise achieve
Zash 20:43:16
jonasw, dwd: Well it could also have been an artifact of the conversion to epub I did
moparisthebest 20:43:30
right I think you couldn't progressivly build up by guessing 1 character at a time that way arc
dwd 20:43:35
Zash, RFC 4790, now, extracted from ACAP. My mistake, I'm behind the times.
xnyhps 20:43:47
If you're not compressing the password anyway, the thread model becomes rather vague.
moparisthebest 20:43:55
which again sounds better/more secure to me, but probably not as secure as not being able to guess at all? I'm sure someone could come up with an attack
xnyhps 20:44:18
Finding out someone's JID requires quite a lot of access for not that much information.
arc 20:44:22
yea no the string table refers to whole qnames and string values
xnyhps 20:44:43
Or who someone is talking to, etc.
arc 20:45:00
yea its not exposing, say, integer values coming from a sensor
dwd 20:45:16
Surely you'd need to address things to them? So at best, you're able to try to guess if someone who you already know by IP address, who is also in a chatroom with you, is the Jid you think they are?
xnyhps 20:46:09
dwd: Yeah, and you probably have much easier ways to do that.
arc 20:46:10
well it only confirms that they're a JID in your string table. it wouldnt expose, necessarily, if they were that JID vs had talked to that JID
arc 20:46:48
EXI doesn't "understand" XMPP beyond the schema you provide it.
arc 20:47:26
i think it might be possible under certain conditions for an IoT vendor to craft an insecure schema tho
arc 20:47:40
for example sensor data should be fixed length
dwd 20:48:05
TBH, I don't think that the use of deflate in XMPP is a general problem anyway. In extremely high-risk cases, perhaps, and if you're dumb enough to use PLAIN and TLS compression.
arc 20:48:12
in that way EXI is more secure than text xml in that integer should be a fixed length, where a string representing an integer is not
jonasw 20:49:07
Zash: do you have a diff of xep 369 from 0.8 to 0.8.1 from your fancy difftool at hand?
arc 20:49:23
for security any XEP for sensor data, it should be actually put in the security section that float and integer values should be zero-padded to their maximum value to decrease risk of data leakage
jonasw 20:49:48
zero-padded to their maximum value? how does zero-padding to maximum value work?
arc 20:50:32
if all the stanzas from a device are the same except being X length, X+1, X+2, X+3, etc based on the scale of a specific integer value, you can determine whether that value is 0-9, 10-99, 100-999, etc
Zash 20:50:38
Don't EXI basically work like if you were to generate optimal C structs for all the things, then send that down the wire?
arc 20:50:49
so if the maximum value is, say, 255, it should send as '001' '002' etc
jonasw 20:51:09
arc: wait, leading zeros are encoded?
arc 20:51:26
jonasw: for text xml
moparisthebest 20:51:29
if it's a string it has to be
arc 20:51:43
what i was saying is this is a weakness in text XML that EXI doesn't have
moparisthebest 20:51:43
but even if not most things send integers in a set number of bytes
arc 20:52:13
sure, but eg, a light sensor could flip between 0 and 100, and that would make it obvious what the state was
jonasw 20:52:26
ah I thought you were talking about EXI already, of which I assumed that it encodes it as binary integer
arc 20:52:38
people do not generally encode 0 as <light value="000"/>
arc 20:52:51
yea it encodes as a binary integer
jonasw 20:53:03
is it a variable-width encoding?
arc 20:53:20
i would have to look that up again, i havent touched that part in awhile
arc 20:53:34
i know you can constrain the range of most values
jonasw 20:53:34
that would have the same issue then, and it cannot be worked around with leading zeros
arc 20:54:42
well i don't believe its variable width per value, i think its only variable width by schema. if the schema says the integer value of a given attribute is 0 to 127, it'll do the right thing.
arc 20:55:04
i havent touched that since november tho, id have to read up on it again
jonasw 20:55:17
no worries
arc 20:55:46
but im like 98% certain that an integer, float, etc value is fixed width from stanza to stanza
moparisthebest 20:55:55
the question is if an integer can be 0 to 65535, it obviously encodes 60000 as 2 bytes, but does it encode 120 as 1 byte or 2 ?
moparisthebest 20:56:06
that'd be a type of compression too
arc 20:56:12
i believe that if an integer is a short it will always be a short.
moparisthebest 20:56:18
could leak something, idk
arc 20:56:28
you're right it could. but i dont think it does that.
moparisthebest 20:56:33
that's how everything I can remember seeing works yea
arc 20:57:03
and when we draft EXI 2.0 that is something that should be definitely put on the table as a concern
moparisthebest 20:57:50
in general it seems like most things pre-2013 kind of took security as an after thought and might need to be revisted today
arc 20:58:31
so far the only thing I would like to add to EXI is being able to encode a delineator-separated sequence like is used in SVG
arc 20:58:38
if we had that, the SVG world would be all over it
arc 20:59:00
being able to encode paths more efficiently would be a major breakthrough.
Zash 20:59:06
jonasw: You happen to know which revisions that correspond to?
jonasw 20:59:45
Zash: nevermind, I diffed it locally
arc 20:59:56
my initial interest in EXI came from getting tired of hearing about why X chat system doesn't use XMPP, but a binary protocol, for efficiency on mobile / etc
arc 21:00:08
and the same is true for SVG vs proprietary vector formats
jonasw 21:00:15
I’m going to bring up the <feature xmlns="…" /> stuff on standards@ again.
moparisthebest 21:01:23
my complaint about SVG is that most things just arbitrarily execute javascript from them
moparisthebest 21:01:28
not a great security feature
Ge0rG 21:02:24
I wish I'd get some more insight from The Elders on carbonated body-less normal messages...
arc 21:02:33
moparisthebest: the same is true for XHTML-IM
moparisthebest 21:02:52
yep arc
jonasw 21:03:23
script content is not allowed in XHTML-IM…
moparisthebest 21:03:39
but like on my discourse instance I enabled common image format uploads, for example png, jpg, gif, and svg
jonasw 21:03:48
(reminds me, I wanted to polish up my XSLT which strips off anything not allowed as per xep 71)
moparisthebest 21:03:57
then, luckily it was a friend, uploaded an svg with some XSS javascript to steal cookies and showed me :)
jonasw 21:04:20
are there any xslt/xhtml wizards here?
moparisthebest 21:04:43
I'd assume this is where the xslt wizards live :) not me though
Lance 21:05:10
jonasw: stuff like <a href="javascript:alert(1)"> can still exist even without allowing <script> elements
jonasw 21:05:24
Lance: haven’t thought of hrefs, good point
jonasw 21:05:49
but that is usually easily filtered depending on the webview used
dwd 21:05:52
Lance, Dependsing on CSP.
moparisthebest 21:05:55
a blacklist would be a never ending hole
jonasw 21:06:06
moparisthebest: that’s why I’m using the whitelist from the XEP.
dwd 21:06:28
moparisthebest, No, I mean Content Security Policy stuff would prevent inline javascript from working.
moparisthebest 21:06:30
I'm not positive you can do that kind of thing with xslt
moparisthebest 21:07:07
yea dwd, not sure how you get/set that with something like xhtml-im
moparisthebest 21:07:19
surely if there was a handy .noJavascript() method they would have called it
arc 21:07:46
XSLT could do it. You shouldn't do this with XSLT.
arc 21:07:59
no matter how hard you try it will always leave a hole
jonasw 21:08:02
arc: what exactly?
arc 21:08:11
jonasw: filtering XML/HTML
jonasw 21:08:17
hm
jonasw 21:08:23
how else are you going to do it?
jonasw 21:08:44
also, I think that this should be pretty sound: https://github.com/horazont/aioxmpp/blob/devel/data/xhtml-im-sanitise.xsl (leaving aside the @href issue)
arc 21:09:01
I'm in the camp for saying XHTML-IM shouldn't be supported
arc 21:09:07
I wasn't. now I am.
moparisthebest 21:09:12
I agree
jonasw 21:09:15
arc: I also do not like XHTML-IM.
jonasw 21:09:28
but then again, there are people who want rich text in their IM clients.
Zash 21:09:39
BBcode
moparisthebest 21:09:41
you can have rich text without html
jonasw 21:09:49
moparisthebest: is there a XEP for that?
moparisthebest 21:09:57
not that I know of :)
arc 21:09:58
https://plus.google.com/+ArcRiley/posts/BXpPxYRcRim
moparisthebest 21:10:09
someone was advocating markdown somewhat recently
jonasw 21:10:16
(actually, a body type="text/markdown" or type="text/rst" would be great; just make sure your markdown/rst doesn’t pass through HTML…)
moparisthebest 21:10:59
right :) or it starts all over
Zash 21:11:08
Wasn't Markdown is defined as a HTML superset?
jonasw 21:11:13
yes, Zash
arc 21:11:19
i dont think thats still a complete solution.
Zash 21:11:25
Nice things, you can't have them
arc 21:11:35
the <a href="javascript:"> links will leak through
moparisthebest 21:11:40
well as Zash said bbcode it is then
Lance 21:11:43
plus the issues with multiple flavors of markdown, etc
moparisthebest 21:11:47
I'm sure there are plenty of libraries already ready to use
moparisthebest 21:11:49
in php...
jonasw 21:11:56
gah, bbcode is annoying too.
Zash 21:12:00
There can be only one! (And it is pandoc)
Zash <3 pandoc 21:12:03
moparisthebest 21:12:24
as the saying goes annoying or insecure pick one
moparisthebest 21:12:32
I probably just made that saying up
arc 21:12:39
Lance: btw one thing i love is the stream framing from websockets? the added overhead for jabber:client namespaces is completely eliminated in EXI
Lance 21:13:06
yes!
arc 21:13:30
if back then when that was being vexed over, if someone had said "in 5 years that won't be an issue anyway because EXI" it would have made the decision much easier
Flow 21:13:39
jonasw: I do think that xep115 has hash agility, and signalling the caps using a second hash algo wouldn't require a ns bump
moparisthebest 21:13:50
re: markdown only one markdown I know has a defined spec, http://commonmark.org/
arc 21:14:06
good lord, i cant even use libxml2 anymore. its just painful.
jonasw 21:14:18
Flow: there was some mailing list post where people discussed otherwise, in the thread Tobias linked I think
arc 21:15:09
schema-based xml coding makes so much more sense
moparisthebest 21:15:49
so I think if you mandated commonmark with the exception of no support for http://spec.commonmark.org/0.27/#html-blocks it might be easier, would need more thought
Flow 21:15:50
nothing prevents clients from using a second hash mech, as long as they still send the mandatory to implement one
Zash 21:16:20
Flow: You mean sending multiple <c> elements?
Flow 21:16:50
Zash: yep
Zash 21:17:39
Flow: Doesn't fix the algorithm for producing the hash tho
Flow 21:17:50
Zash: Right
Flow 21:19:03
But I don't aggree with the statement that the change of the hash function of xep115 requires a namespace bump in ecaps2
Flow 21:19:31
jonasw: Any particular reason for going with a new xep instead of updating xep115?
jonasw 21:19:54
Flow: I asked here, and people suggested that a clean new xep is the better way to go.
Flow 21:20:48
jonasw: i see
Lance 21:20:50
IIRC, it was so we could flag 115 as obsoleted by the new one
Kev 21:21:11
jonasw: Well, I think I suggested that a new XEP was the wrong way to go, and updating 115 was preferable :)
Lance 21:21:14
as an encouragement to devs to upgrade
jonasw 21:21:18
Flow: to be clear, I’m happy to drop -xxxx and merge the changes into 115 if council prefers that.
Ge0rG 21:21:31
also to prevent people from doing some compat with the old stuff badly.
jonasw 21:22:18
but considering that it were council people who suggested to go with a new xep, I followed that suggestion.
Flow 21:23:46
pfff, council people are not always right ;)
Lance 21:24:04
not even close :)
jonasw 21:24:28
Flow: they’re, from my understanding, those who decide whether a patch to XEP-115 will be accepted though.
Kev 21:24:29
I think it led to the wrong outcome in this case, but I can't fault the logic of taking advice from Council in general.
Flow 21:25:45
Sure, asking for feedback is always a good idea.
SamWhited 21:26:07
It seems like a good idea to me to go with a new XEP in this case just to encourage people not to try and have backwards compatibility with the old one (which rather defeats the purpose of having a new one), but I don't feel strongly about it and could be convinced either way.
jonasw 21:26:37
in any case, I’m off for tonight. may read the backlog if highlighted
SamWhited 21:26:40
defeats the purpose in this case, I mean, since it's a security issue. Backwards compatibility is sometimes a good idea.
Kev 21:26:46
SamWhited: 115 is a core dependency of a *lot* of XEPs. I don't think replacing it is warranted in this case.
SamWhited 21:27:23
yah, that is tricky, not sure what to do about that. Either way tough we'd have to solve that problem and I suspect the two will have to coexist for a while.
Flow 21:27:33
Kev: The question is: Is xep115 is dependency or xep115 *and* the current namespace of xep115?
Kev 21:27:47
Well, at least for the dependency, it's straightforward, as the dependency is just on the latest version of 115.
Kev 21:28:01
Whether it should be or not is another matter, of course.
Flow 21:29:09
This is a fundamental question as we will find ourselves in the situation more and more in the future. For example with the XEPs depending on xep300
Lance 21:29:19
Yeah, aside from PEP, most of the "dependency" for these XEPs is just the fact that it optimizes the true dependency on disco#info
Zash 21:29:30
Do we need a BCP kind of thing?
Flow 21:29:50
Do we want to update all consumers of xep300 if it receives an incompatible update?
Flow 21:30:25
Or do we want to sepcify a dependency as xep number *and* "namespace", and update the consumers one after another?
Flow 21:31:45
Lance: Well said. I hate that some XEPs give you the impression that xep115 is an alternative to xep30
Flow 21:32:06
Zash: BCP?
Zash 21:32:26
Flow: IETF thing, like a pointer to the latest RFC on some specific topic.
Lance 21:32:37
Best Current Practices
Flow 21:32:41
ahh berst current practice
Zash 21:32:52
Flow: RFCs never change, but a BCP may be changed to point to a new RFC
Flow 21:33:06
isn't the the opposite what XEP do?
Flow 21:33:29
i.e., they do change, so we need a pointer to a fixed revision of a xep
Flow 21:33:48
(which we have in our attic btw)
Zash 21:34:20
Final XEPs are probably the closest to how RFCs work
Flow 21:34:29
true
Flow 21:35:27
ahh, enough DNSSEC fun for today. I follow jonasw to the realm of sweet dreams where everthing is like it should be
arc 21:36:00
its too bad SRV records don't allow additional information
Ge0rG 21:36:02
Flow: and dreem of jumping and colliding SHA1eep?
Lance 21:37:11
arc: what kind of additional info?
arc 21:37:30
i havent touch DNS resolution in awhile, can you send a single request for multiple SRV records?
arc 21:37:41
Lance: for example, the server capability, protocol version, etc
Zash 21:37:54
arc: Multiple how?
moparisthebest 21:37:55
with the same name sure
Lance 21:37:56
arc: whether or not to start with EXI, hrm?
arc 21:38:07
Lance: yes, or TLS, or etc
moparisthebest 21:38:13
I suppose that'd be what TXT records are for arc
arc 21:38:15
yes I know there's a XEP for TLS
moparisthebest 21:38:22
or encode TLS or not TLS in the name like I did haha
moparisthebest 21:38:34
that would easily explode though if you try to encode more
arc 21:38:47
moparisthebest: yes, but doesnt that require multiple lookups? or can the two alternative names be requested at once?
moparisthebest 21:39:08
now we have _xmpp-client, and _xmpps-client, we don't want _xmppse-client and _xmppe-client for exi for example too, probably
Zash 21:39:11
_xmpp{s,}-{client-server}{,-exi}._tcp
arc 21:39:22
yea. so.. part of EXI is the first byte of an EXI stream is never a valid text unicode string by any enconding
moparisthebest 21:39:26
yea arc that's 2 seperate lookups
arc 21:40:02
one way is SRV records. the other way is to just punch EXI at the server, and it either responds with EXI or not
moparisthebest 21:40:09
arc, uh what about ALPN I think that neatly solves your problem?
arc 21:40:16
ALPN?
moparisthebest 21:40:31
tls extension, tells it the protocol(s) you'd like to speak
moparisthebest 21:40:38
Application Layer Protocol Negotiation ?
moparisthebest 21:40:43
http2 uses it
arc 21:40:52
oh, yes that could work
arc 21:41:10
ive seen this before, just forgot about it
moparisthebest 21:41:24
xep-0368 uses it too, but optionally
arc 21:42:37
yea i saw this mentioned somewhere about http2 awhile ago. so, what does the payload look like
Zash 21:43:04
A text string in a TLS extension
Ge0rG 21:43:21
a byte array.
Ge0rG 21:43:44
because text strings are imPRECISe
arc 21:44:08
ok so we could define a meaning for that which is extensible to other things
moparisthebest 21:44:13
yea Ge0rG is more correct it's a precisely defined sequence of bytes
arc 21:44:32
the key is it must be possible to use EXI without support for text XML
moparisthebest 21:44:50
so basically an EXI xep could depend on xmpps-* records from xep-0368, and send it's own custom ALPN protocol sequence
moparisthebest 21:45:09
or optionally, both xmpp-client and xmpp-exi-client or whatever
moparisthebest 21:45:17
and server would say I can speak X
moparisthebest 21:45:31
at which point you'd proceed or try next SRV record
arc 21:46:04
i *hope* that server support would be well deployed before its an issue
arc 21:47:07
oh interesting. it doesnt look like Contiki OS supports ALPN
Lance 21:47:08
arc: also, once the EXI XEP is decent, I'd be happy to help with making a proper xmpp-exi websocket binary subprotocol
arc 21:47:37
Lance: absolutely. but lets get a javascript library for it first ;-)
arc 21:48:21
from the times its been brought up i think the right path is to kill 0322 and start fresh. the one up there is utter nonsense from an implementers point of view
arc 21:49:21
50% of the document is re-implementing EXI header format in a less compact form
arc 21:50:07
and it doesn't even really get into how to handle a "pure" EXI stream (not starting with text XML)
arc 21:52:41
the mechanism I think is best is this: 1) Client sends EXI header with <open> framing. in the header, the schemaId field contains a hash identifier for the schema it wants to use, generally in sha256: URI format, but this allows future hash values to be used
arc 21:53:32
2) if server doesn't already have that schema, it responds with EXI header for a "default" stream using the schema-schema, and gives an error that the requested schema must be provided
arc 21:55:36
3) if client receives such an error, it will restart its EXI stream with the same schema and transfer that schema 4) server responds with the hash as it understands it wishes the client to use in the future (generally, sha256: URI) 5) stream restarts (or continues after step 1, if server responded with the EXI header for the same schema) normally
arc 21:56:41
the error-restart method should only be needed after a server is wiped, upgraded, or the first time a client of a specific version connects to it. sha256 is suggested to minimize this (large servers will already have the schema on file) but can be boosted in the future
arc 21:57:53
it otherwise uses the same framing as websocket.
arc 22:00:10
vs XEP 0322 it removes the issues with asking the server to download schemas from a HTTP resource (eg, using XMPP servers to multiply ddos attacks on webservices), removes the need for a text XML parser, reduces handshakes to initiate a typical connection, and removes redundant negotiation
moparisthebest 22:01:59
so it just sends a hash of the schema it wants to use?
moparisthebest 22:02:08
no other info about it?
arc 22:02:33
yes in the EXI header field for schemaId. i believe the hash URI standard allows for length too
moparisthebest 22:03:18
I was going to ask what stops a malicious client from uploading a 10gb schema
arc 22:03:30
if the hash isnt known by the server, it asks the client to transfer the whole thing, and then the server gives the client a URI to refer to that schema in the future - which might be a newer hash
arc 22:03:52
moparisthebest: the server should cut it off at some point obviously. schema should never be anywhere near that big, especially EXI encoded.
arc 22:04:20
i mean you could make the same claim for what stops a client from sending a 10g <stream:stream opening element with a gazillion attributes
moparisthebest 22:04:26
that is true
moparisthebest 22:04:55
I wonder what current servers do with that hehe
moparisthebest 22:05:02
or clients
arc 22:05:26
with EXI? the few experimental ones use XEP 0322
arc 22:05:42
i am not aware of EXI being used in production anywhere tho
arc 22:05:53
the only complete implementation of EXI I'm aware of is written in Java
arc 22:06:08
my libexi will be #2.
moparisthebest 22:06:11
oh I meant I wonder what current servers or clients do with 10 gigabyte <stream:stream xml
arc 22:06:20
oh, that's a good question
moparisthebest 22:06:30
evil me wants to try it out
arc 22:06:36
I'm willing to bet at least one will catch on fire
moparisthebest 22:06:38
not at a production server of course other than mine :)
moparisthebest 22:07:23
I'm guessing some are protected by a naive "no xml will contain > 10m so that's my buffer size"
moparisthebest 22:07:34
or similar, but yea, testing time
arc 22:08:05
well id bet actually that expat or libxml2 will dutifully attempt to parse it regardless.
SamWhited 22:08:14
What is realistically the biggest packet size a server should expect? Not more than a couple of kilobytes surely?
arc 22:08:39
SamWhited: with HTTP over XMPP it could be more. isnt there a way for a MTU to be set?
Kev 22:08:45
Given the minimum maximum stanza size is 10k, no, a bit more than that.
Kev 22:08:51
Depending what you mean by 'packet'.
arc 22:09:07
i assume stanza
SamWhited 22:10:11
yah, I don't know what I meant by packet… "start stream tag or any second level element" I suppose
arc 22:10:51
amount of data in the XML parser which is not yet returned to the client?
arc 22:10:51
er, application
arc 22:11:13
moparisthebest: this is a good secure case to note
arc 22:12:44
another issue servers might want to look out for is flooding it with new schemas. an LRU cache should be used to keep the number of schema from being pushed out of control by an attacker
moparisthebest 22:13:38
it might or might not matter, but it could be a bit racy
moparisthebest 22:13:53
like if 10000 iot devices all connect at the same time, request the same hash, server doesn't have it
arc 22:13:54
yea disk size. but you can flood that with logs too
moparisthebest 22:13:59
I guess they all simultaneously upload it?
arc 22:14:31
that sounds like a crazy race condition
arc 22:14:53
actually no, that'd almost never happen because each one has to be provisioned right?
moparisthebest 22:15:21
it seems like it'd happen when you reboot the server or something though
arc 22:15:24
i mean almost never happen that two try to send in the same schema at once. and one would hope the server can handle that well
arc 22:15:40
oh, true. or upgrade it such that it wants to wipe the cache
moparisthebest 22:15:56
maybe something like that
moparisthebest 22:16:18
maybe you block the others while a few are uploading or something?
moparisthebest 22:16:25
servers might be able to do something smartly
arc 22:16:39
if a server policy is to, eg, use a SHA512 for added security because the operator considers SHA256 weak, even if it "has" the schema on disk it would need clients to transmit it in order to give it the hash that it wants
arc 22:17:02
the schema shouldnt be large. thats why EXI encoding too.
moparisthebest 22:17:10
I kind of assumed once a schema is uploaded the server would store it along with *all* the hashes
arc 22:17:21
it could do that too.
moparisthebest 22:17:33
anyway I'm off here for the day :) have a good one
arc 22:18:04
so if a newer client asks for a sha512: right off the bat the server can respond "correctly"
arc 22:19:24
all the server MUST do is return the schemaId it would like the client to refer to this schema with in the future. it SHOULD return with a hash URL, and it SHOULD record and handle any hash URL by any method the server considers secure
arc 22:20:10
so that clients connecting to the server for the first time using the same schema as another client of the same model, can do so without having to send the schema first.
moparisthebest 22:20:34
any reason it just wouldn't always use the hash?
arc 22:20:46
#futurehash
moparisthebest 22:20:46
that seems like the only way you could be safe knowing you were both talking about the same thing
arc 22:21:25
allow the server to support future hash mechanisms without clients needing to understand them
arc 22:22:54
a client sends a sha256: URI. the server responds to uploading it with a sha512: uri. client records and uses what the server gave it. the sha256: URI the client started with a guess. if sha512: were to become a new standard every client could use it.
arc 22:23:33
otherwise a client connecting to a server for the first time would just start with the default schema and send the schema in order to get the identifier. which could become a bit much.
arc 22:24:00
in 2017 i think we all consider sha256 strong. 2020 who knows
arc 22:24:34
this is just me spitballing though.
moparisthebest 22:25:14
so maybe a server MUST respond with a hash, it MUST respond with the hash in the same algorithm the client sent unless it doesn't understand that algorithm, in which case it MUST respond with the hash in the 'strongest' algorithm the server supports as decided by the server
arc 22:27:01
that has some odd implications too. the hash itself is added weight for every connection. if 256 is considered enough, it should use 256.
arc 22:28:27
802.15.4 devices have an effective MTU of around 100 bytes, and over 6lowpan packet fragmentation can cause real connectivity issues. its best to keep the EXI-encoded stanza payload under 100 bytes
arc 22:29:59
the exi header with a sha256 uri consumes almost 100 bytes by itself, iirc
arc 22:30:10
if its just <open> though its fine
arc 22:31:10
i imagine #futurehash is more likely to be used over 802.11ah or similar newer, low-power protocol though which isnt necessarily subjected to the same constraints
arc 22:38:04
in some cities right now, every bus is driving around with a 802.15.4 transceiver in a weather-proof plastic shell and a tiny solar cell glued to the top of the bus, rechargable battery, recording and sending realtime air quality data through a makeshift mesh network using, IIRC, some MQTT-based protocol
arc 22:40:09
since they use 2.4ghz the buses are regularly delinked from the mesh network due to excessive frame collisions and inability to return pings, so restarting a stream on reconnect while under pressure is a real thing
arc 22:40:42
fragmentation multiplies the problem in those cases.