XSF Discussion - 2021-04-14

moparisthebest 06:23:21
when confronted with a too-big stanza as a server, what would the implications be if the server just silently dropped the stanza and carried on instead of terminating the stream with an error ?
moparisthebest 06:23:45
maybe this answer is different for s2s vs c2s
jonas’ 06:23:50
moparisthebest, if it can do that, it should instead bounce the stanza properly
jonas’ 06:24:22
then there are no (new) implications, beyond what you already get when servers make decisions about which stanzas to deliver
moparisthebest 06:24:25
what if it can drop it, but can't parse it
jonas’ 06:24:34
unlikely, I think
jonas’ 06:24:49
to bounce a stanza, you only need the startElement SAX event
jonas’ 06:24:57
(or equivalent)
jonas’ 06:25:07
I think you already need that + endElement + magic to determine the stanza boundaries anyway
jonas’ 06:26:33
I guess dropping would be similar to s2s-closing (without stream management), so there’s not much of a difference there, except that deliverability of other stanzas will be improved (as they don’t get lost in s2s TCP buffers when the stream is killed)
moparisthebest 06:27:24
I'm determining stanza boundaries by counting tags and don't have an XML parser
jonas’ 06:27:29
how are you counting tags?
jonas’ 06:27:53
(to count tags, you already need something like an XML parser ;))
jonas’ 06:28:08
~~enough of an XML parser to add only a shim layer to extract the attributes out of the tags anyway~~ ✎
jonas’ 06:28:18
enough of an XML parser to add only a shim layer to extract the attributes out of the attributes needed for bouncing anyway ✏
moparisthebest 06:30:26
roughly, incrementing on < and decrementing on /> or </ so a stanza boundary is when the counter hits 0
jonas’ 06:30:56
<![CDATA[ < ]]>?
moparisthebest 06:31:29
hopefully no one sends that :D seems to have worked so far
jonas’ 06:31:35
it is valid though
jonas’ 06:31:59
and: what stops you from catching the slice from the first < to the first > and parsing that as attributes? you can drop all namespace-related stuff as that’s not needed on stanzas anyway, just plain key/value
moparisthebest 06:33:54
hrm maybe, so you think that'd be valid? sending an error response on too-big-stanzas and just skipping them otherwise? but not terminating the stream?
jonas’ 06:34:00
yes
jonas’ 06:34:10
<policy-violation/> would be an appropriate stanza error I think
jonas’ 06:34:13
or maybe <resource-constraint/>
jonas’ 06:34:32
https://tools.ietf.org/html/rfc6120#section-4.9.3.14
jonas’ 06:34:40
look at that, policy-violation has stanza size limit as an example
jonas’ 06:34:48
oh, that’s the stream error, sorry
moparisthebest 06:35:30
antranigv in this MUC has an avatar that makes a stanza bigger than 262,144 bytes for instance
jonas’ 06:35:32
but I think the stanza error is as valid
jonas’ 06:36:02
and bouncing individual stanzas with whatever errors you feel like is the right of a server; clients need to cope with that
jonas’ 06:36:23
dropping on the other hand is tricky because it may be seen as violation of the "MUST reply to IQ requests" clause of RFC 6120
flow 06:38:57
> moparisthebest> antranigv in this MUC has an avatar that makes a stanza bigger than 262,144 bytes for instance One of the reasons that make me believe that servers should be very conservative about the sizes of stanzas they accept. Someting in the range of 10 KiB to 64 KiB seems appropriate
flow 06:40:18
I also think that with larger stanza sizes you get easily into scheduling issues, as in, a single stanza can dominate the link while it is transferred
flow 06:41:15
~~This is turn means, that you should probably bounce to big stanzas at the first hop, and close the stream on all following hops as soon as the limit is reached~~ ✎
flow 06:41:28
This is turn means, that you should probably bounce too big stanzas at the first hop, and close the stream on all following hops as soon as the limit is reached ✏
jonas’ 06:41:49
good point about hogging the link
flow 06:42:05
Unfortunately, I believe that the major parts of the ecosystem would break if we do that
flow 06:42:22
But I really hope that we get the ecosystem towards a reasonable sized stanza limit in the future
flow 06:42:49
~~Of course, this requires, things like RSM on disco#(info|item) responses~~ ✎
flow 06:43:00
Of course, this requires things like RSM on disco#(info|item) responses ✏
jonas’ 06:43:51
servers already do that :(
dwd 06:43:52
I wonder if MAM-like responses on disco might make more sense. Or additional sense.
jonas’ 06:43:53
;)
flow 06:44:35
Furthermore, if we talk about techniques that split large content into multiple stanzas, then we have to be aware that it's not easy for the generating entity to assess the resulting stanza size
jonas’ 06:45:00
~~just found this https://github.com/horazont/muchopper/commit/7affc581a8539eebc190371d95539bed1fb8bd7c#diff-6d5dca6e6e9b9db00d48d0bb6b748f7302f6b941e4bcee48427a54ff74122c46R48~~ ✎
jonas’ 06:45:11
just found this https://github.com/horazont/muchopper/commit/7affc581a8539eebc190371d95539bed1fb8bd7c#diff-6d5dca6e6e9b9db00d48d0bb6b748f7302f6b941e4bcee48427a54ff74122c46R53-R59 ✏
flow 06:45:15
But probably a pramgatic approach like, "aim for 10KiB stanzas, but limit at 32KiB" is good enough here
flow 06:46:15
jonas’, is https://github.com/horazont/muchopper/commit/7affc581a8539eebc190371d95539bed1fb8bd7c#diff-6d5dca6e6e9b9db00d48d0bb6b748f7302f6b941e4bcee48427a54ff74122c46R53 fixed today?
jonas’ 06:46:51
I did not check again since that commit
moparisthebest 06:49:38
it's ok I'm 99% sure I found an ejabberd bug while looking at this too, are `<features xmlns="http://etherx.jabber.org/streams"><starttls xmlns="urn:ietf:params:xml:ns:xmpp-tls"><required/></starttls></features>` and `<stream:features><starttls xmlns="urn:ietf:params:xml:ns:xmpp-tls"><required/></starttls></stream:features>` the same or not?
moparisthebest 06:50:45
prosody, dino, gajim, and conversations are ok with either, ejabberd requires the second
jonas’ 06:51:15
they are the same in XML 1.0 with Namespaces
jonas’ 06:51:21
but yeah, ejabberd is picky
moparisthebest 06:51:57
odd failure mode too, it just hangs for 90 seconds and then sends `<stream:error><connection-timeout xmlns='urn:ietf:params:xml:ns:xmpp-streams'/><text xml:lang='en' xmlns='urn:ietf:params:xml:ns:xmpp-streams'>Idle connection</text></stream:error>` and hangs up
jonas’ 06:52:16
yep, probably the parser doesn’t read it as belonging to the stream namespace and then drops it
jonas’ 06:53:07
moparisthebest, but note the interop note in https://tools.ietf.org/html/rfc6120#section-4.8.5
jonas’ 06:53:30
> Implementations are advised that using a prefix other than 'stream' for the stream namespace might result in interoperability problems.
moparisthebest 06:54:45
ah, nice
flow 06:57:43
moparisthebest, sounds like you have a new(?) project, what is it? :)
moparisthebest 07:05:54
flow, reverse proxy https://github.com/moparisthebest/xmpp-proxy still super early, just started dogfooding it on my server recently :)
flow 07:38:51
moparisthebest, so you are competing with dwd now? :)
dwd 07:39:41
Competition always welcome.
dwd 07:39:53
Especially as I've not had any time for Metre recently.
jonas’ 07:39:57
I think metre doesn’t do c2s, does it?
dwd 07:40:03
It does not, no.
dwd 07:40:33
Going to go out on a limb and suggest that xmpp-proxy doesn't do S2S, it's just a connection proxy, too.
moparisthebest 07:41:18
https://github.com/surevine/Metre ? quite a bit different I guess
moparisthebest 07:43:24
xmpp-proxy does c2s and s2s, and multiplexes both along with starttls + direct tls all on the same port(s), but also limits stanza sizes
dwd 07:43:49
But inbound only if I follow this right?
moparisthebest 07:44:02
correct, inbound only, and provides TLS
dwd 07:44:19
And that StanzaFilter looks scary. :-)
Kev 07:44:34
How does it cope with presenting the client cert to the server?
Kev 07:45:01
Or is there a ‘I have verified trust, just do external’ flag being passed to the server-proper somehow?
dwd 07:45:21
Kev, Yeah, doesn't seem to be any support for that.
Kev 07:46:11
Probably more applicable to C2S where client certs are uncommon than to S2S then, I guess?
moparisthebest 07:46:23
no support for client cert auth, server just trusts connections from it is encrypted implicitly (also for the PROXY header)
Kev 07:46:25
Although I guess it probably gets in the way of -PLUS too?
dwd 07:46:55
It'd break channel bindings, yes. Also I think the start-tls handling makes me cry.
moparisthebest 07:46:58
yes I agree the StanzaFilter is scary haha
moparisthebest 07:47:16
these are all correct reactions I think
Kev 07:47:21
I keep getting tempted to write one of these for M-Link, but I keep deciding it needs to be more than a naive proxy because of the interactions between TLS and auth.
Kev 07:48:05
(Also because I’d like to use it as a mechanism for seamless upgrades).
moparisthebest 07:48:30
(personal opinion for my use-cases only incoming) interactions between TLS and auth are dead, let them lie
dwd 07:48:32
Funnily enough, I thought about adapting the Metre code into an Openfire C2S connection manager.
dwd 07:48:53
moparisthebest, I think that's not true given channel bindings especially.
moparisthebest 07:49:10
do they work with TLS 1.3 yet ?
Kev 07:49:10
moparisthebest: You might not like channel binding, but surely you’re not opposed to S2S strong auth?
dwd 07:49:17
moparisthebest, Thanks to Sam, yes.
moparisthebest 07:49:23
will they work with QUIC
dwd 07:49:52
moparisthebest, Your xmpp-proxy doesn't work with QUIC, so somewhat irrelevant, but I think they should given they're simply based on an exported key.
moparisthebest 07:50:56
unfortunately if the web doesn't use them, we can't rely on them being supported anywhere
dwd 07:51:30
moparisthebest, Except key exports *are* used and seem well supported, so Sam's approach seems much more likely to be generally available.
moparisthebest 07:54:24
and QUIC is the future so I wouldn't call it irrelevant, still a hair early though, we'll see
dwd 07:55:23
moparisthebest, Sure. I expect we'll have XMPP over QUIC at some point. But I don't think key exporting is affected by QUIC versus TLS/TCP, so everything should "just work".
dwd 07:55:59
moparisthebest, Obviously there's very much a place for xmpp-proxy as a bridge to uplift existing servers and services with QUIC etc support though.
jonas’ 07:58:10
moparisthebest, so that effectively forces use of dialback? or what?
dwd 07:58:12
moparisthebest, I think your StanzaFilter works for idiomatic XMPP XML, but I'm not entirely sure it'll work correctly in some edge cases. I suspect if I'm sufficiently abusive I can at least sneak a closing stream tag past it.
jonas’ 07:58:37
I’ll just say CDATA
dwd 07:59:05
jonas’, Oh, yeah, good point, that too.
moparisthebest 08:02:02
also '< stream:stream' and a few other things, probably ok, it accomplishes it's goal
dwd 08:03:58
moparisthebest, And '<str:stream', and all sorts. Likely it'll work in all non-abusive cases. But yeah, I can get it to terminate the session without the XMPP server thinking its over, and I can get the XMPP server to close it without xmpp-proxy to work. And any server advertising -PLUS basically won't allow auth. Hopefully. But as I say, it'll mostly work and it's a useful tool for a number of cases.
Kev 08:05:36
dwd: The server needs to know there’s a TLS terminator in the way anyway, so presumably should not present any auth mechs that don’t proxy in any case.
moparisthebest 08:06:57
yep I agree, it was written in a bit of a hurry for one specific use-case which it accomplishes, I'll elaborate more later, and in the meanwhile think about if it can support CDATA sanely without bringing in an XML parser :/
Kev 08:07:59
I would be inclined to bring in an XML parser, personally.
Kev 08:08:32
Well, let me rephrase that.
Kev 08:08:40
If I wanted this to be generally useful, I would bring in an XML parser.
dwd 08:09:19
Kev, Oh, I think it *is* generally useful, but possibly limited to testing out deployment strategies you'd then incorporate into server mainline code.
Kev 08:09:22
If it just needs to meet a use case and it meets it, *shrug*.
moparisthebest 08:09:38
I only hesitate because historically XML parsers have been known to have worse bugs than "closes connection wrongly sometimes"
Kev 08:09:48
That is not unfair.
dwd 08:10:25
moparisthebest, This is true, but I can smell the faint odour of a DoS attack there somewhere.
moparisthebest 08:11:15
I think yes, but only in terms of killing s2s connections ?
Kev 08:11:38
Killing S2S is probably less bad than *not* killing S2S
Kev 08:11:58
(I have no idea if such an issue can present)
flow 08:23:19
moparisthebest, jxmpp has a XmlSplitter, which probably does something similar to yours. It doess not contain a full blown XML parser, but is a little bit more robust as yoursI think
flow 08:25:21
I guess what I am trying to say is, that there exists probably a middle ground between having a full blown XML parser and too trivial parsing for things like that
moparisthebest 08:25:35
flow, thanks, I'll have a look https://github.com/igniterealtime/jxmpp/blob/master/jxmpp-core/src/main/java/org/jxmpp/xml/splitter/XmlSplitter.java
mathieui 08:49:23
So… it has come to my attention that profanity is essentially using 140-chars ids for every stanza
mathieui 08:49:53
was there any rationale for not limiting the length of the id attributes? we do cap the size of JID, afair
flow 08:56:37
mathieui, sadly the length restriction of JID parts is the
flow 08:56:42
exeception, not the rule
flow 08:56:48
we do not limit pubsub IDs either
mathieui 08:59:40
indeed :/
mathieui 09:00:31
but as far as I understand it, the only properties we want for identifiers is "we can fit something non-guessabled and non-bruteforceable in it, and all the better if we can write some arbitrary data for testing"
mathieui 09:00:44
but going over 30 or 50 characters seems overkill in every case
jonas’ 09:01:28
https://modules.prosody.im/mod_client_proxy.html would like to have a word with you ;)
flow 09:02:58
mathieui, yes, I think you are mixing two aspects here: the missing length restriction for many ID things in XMPP *and* the question what a sufficiently long ID is
flow 09:03:07
that said, 140 chars is way to long
mathieui 09:03:44
flow: true, but one could influence the other
flow 09:03:52
not really
flow 09:04:17
just because a random ID can be shorter, there may be very well use cases for long IDs which carry some semantics
flow 09:04:32
arguably, this may not be likely for stanza IDs, but very true for e.g. pubsub IDs
flow 09:05:17
and I am not sure if it isn't true in some case for stanza IDs
flow 09:06:28
in any way, we should point the profanity guys to https://www.grc.com/haystack.htm
dwd 09:32:30
Every now and then, I wonder about translation to a UUID in all cases. Mostly for the database efficiency.
edhelas 09:33:25
dwd +1
Zash 09:35:17
36 octets for 122(?) bits of entropy feels inefficient, plus they don't help with sort order
dwd 09:40:53
Yes, you need both a sequential id and a uuid in the database. But cheap to lookup a 128 bit number in an index.
larma 09:41:49
Also UUIDs can be compressed on wire if octet optimization is what you aim for
jonas’ 09:43:41
something about https://github.com/ulid/spec
dwd 09:44:42
I mean, if we wer etalking XMPP 2.0, I'd be saying stanza ids are alwasys UUIDv4 and any duplications detected cause immediate session termination.
Kev 09:46:43
Of course, even a 128bit integer in string form is less than 140 characters :D
Zash 09:48:01
Profanity with their signed IDs?
mathieui 09:48:02
the worst part of the profanity thing is that the id is a base64 of some form of humongous uuid in string form
mathieui 09:48:10
so it’s like thrice the size for no entropy gain
Zash 09:49:33
Uuid + hmac(key, uuid) IIRC
mathieui 09:50:05
but why
dwd 09:50:36
So you can detect forged responses and bounces, I imagine.
Ge0rG 09:50:50
Except nobody does it.
Ge0rG 09:51:09
I've looked into that source code before. But then I erased my memories with alcohol.
dwd 09:51:25
But it seems to me that the most interesting attacks based around forging responses would be based around witnessing an existing outbound and just copying the ID, which isn't affected by whether or not the id is signed.
Ge0rG 09:52:25
Yeah.
Ge0rG 09:53:08
IIRC I did a benchmark of "who does the longest message IDs" on my server, and #1 was bifrost, which stuck whole XMPP messages into the ID, for reasons nobody can anticipate, and #2 was profanity
jonas’ 10:05:14
what
edhelas 10:07:32
time to to XMPP stanza over XMPP messages ids
edhelas 10:07:42
*do
dwd 10:09:10
Years ago, a security audit on some stuff I did decided the id space was a side-channel attack. (As were arbitrary XML attributes etc).
dwd 10:09:37
I mean, it's fair enough, but in cases where that matters you need a protocol break anyway.
mdosch 12:23:06
https://github.com/profanity-im/profanity/issues/1520
mdosch 12:23:20
> Our brilliant plan to make Profanity famous among the XMPP community was a huge success. > Profanity became a constant hit for discussions in the community. Everybody was and is talking about its huge IDs. > Now that we have achieved making people aware of Profanitys existence it's time to make them love Profanity. > So let's have shorter IDs.
Zash 12:24:58
"If you want an answer on the Internet, post the wrong answer first."
edhelas 12:25:28
the community manager of Profanity Inc. is a genius
edhelas 12:26:04
i propose that the XSF hire him
DebXWoody 12:32:16
#1520 \o/ We will be seen by movim :-)
Kev 12:47:21
Certainly the best issue I’ve read today.
deuill 13:04:39
Another one for XMPP2
jubalh 13:52:56
Hi guys
Zash 13:53:16
👋️
jubalh 13:54:51
Profanity actually used UUIDs before. We later added an identifier (instance id + barejid) and hashed that together with an uuid and take a base64 from it. The reason for this was that we didn't have any database and we used this to filter messages in MUCs.
Ge0rG 13:55:15
that sounds like insanity
jubalh 13:55:26
And yeah I just used a UUID instead of a shorter value to hash together because I was lazy and the XEP didn't forbid it :)
Zash 13:55:48
The XEP‽
jubalh 13:56:07
I mean no rfc or XEP said anything (to my knowledge) about the length of an ID
Zash 13:56:58
Maybe every RFC and XEP should have the text "Use your common sense." somewhere.
mathieui 13:57:00
jubalh, I am curious though, apart from the hack with the hmac, what is the base64 for?
Ge0rG 13:57:35
mathieui: because base-16 is not sufficiently inefficient
Ge0rG 13:57:43
so it needs to be wrapped
jubalh 13:58:24
Zash: it was not about common sense. It was about having few time and solving an issue quickly ;)
jubalh 13:59:14
mathieui: I don't remember right now. Maybe it had to do with using barejid or something and the allowed values? Not sure anymore.
jubalh 13:59:49
I'll read the code in the next days and change that. We also have a DB now so we could actually check in there if we send the message ourselves.
jubalh 14:00:42
And the part about making Profanity famous is half-true too :) Because I knew (and more people brought it to my attention) pretty long value and devs will notice ;)
Ge0rG 14:00:53
I noticed.
jubalh 14:00:58
Hahaha
Ge0rG 14:01:01
Other people noticed too.
jubalh 14:01:05
I love you too Ge0rG :)
mathieui 14:01:07
Ge0rG, you did not complain hard enough!
Ge0rG 14:01:21
I even spent ~half an hour trying to understand what devil has ridden you for doing it this way
moparisthebest 14:01:25
> Zash: it was not about common sense. It was about having few time and solving an issue quickly ;) amen brother
Ge0rG 14:01:31
mathieui: I did, but probably in the wrong place
mathieui 14:01:32
But having the messages not display in movim because the DB field for the ID is too short was certainly more fun
Zash 14:01:56
Don't trust remote IDs?
jubalh 14:02:20
Isn't having a limit length for IDs in the DB even more wrong? :) There is no max length defined AFAIK
moparisthebest 14:02:21
a way to send messages that only display in certain clients you say mathieui ? nice attack
mathieui 14:02:46
jubalh, edhelas trusted "common sense", and having an index on a TEXT column is meh
Ge0rG 14:03:21
also relevant here: https://dev.mysql.com/doc/refman/8.0/en/innodb-limits.html#:~:text=The%20index%20key%20prefix%20length,REDUNDANT%20or%20COMPACT%20row%20format.
jubalh 14:03:53
i never trust common sense. I have seen all kinds of IDs so our DB doesnt limit it for example. There are some clients that start from 1 and increase their IDs upon each message. If you restart they start from 1 again. I think Pidgin did something funny too (at one point)
mathieui 14:04:24
jubalh, is it still the case though? (for the increasing onces) I though every client switched to some kind of uuid
Ge0rG 14:04:27
jubalh: yeah, that was funny.
Ge0rG 14:04:32
mathieui: BWAHAHAHA
Ge0rG 14:04:34
sorry.
mathieui 14:04:43
let me dream Ge0rG
mathieui 14:04:45
you evil man
Zash 14:05:30
Why did I even click a link to MySQL docs‽
jubalh 14:05:30
mathieui: I'm not sure. I just remember seeing it. And I think Profanity was using the increasing IDs too. And one of my first contributions was to switch it to UUIDs if I remember correctly.
Ge0rG 14:05:32
there was a "nice" bug in bifröst, where its xmpp backend de-duplicated stanzas based on their ID, and presence stanzas that didn't have an ID got into the same deduplication slot and were dropped
Ge0rG 14:05:53
smack will use $random_prfix-$autoincrement
Kev 14:06:05
ids do have a max length, BTW.
Zash 14:06:17
Kev, ~10MB?
Ge0rG 14:06:18
Kev: where is it defined? In XML?
jubalh 14:06:20
Kev: which is? And where is it written?
Kev 14:06:25
Because a stanza is allowed to have a max length of anything down to 10k, it means an id can’t be longer than that, less the rest of the stanza ;)
jubalh 14:06:48
you see guys. So Profanity is all good. Maybe we should stay as is then :)
Kev 14:07:07
I’d say you had a bit more room to play with, even.
jubalh 14:07:18
I'll think of something
Zash 14:07:38
(unique stream-id + counter), Ge0rG? Like the thing we keep mentioning as The Best Thing? 🙂
Ge0rG 14:07:46
Kev: there is no upper limit on the upper limit for stanza sizes, so technically it's unbounded.
Kev 14:07:54
hmac(base64(stanza))?
Ge0rG 14:07:54
Zash: not quite.
mathieui 14:08:01
jubalh, tbh if you get the binary uuid & hash and encode it in ascii85 (instead of picking the ascii repr and growing it with base64) you would fit easily into most lower limits without losing information :p
Zash 14:08:02
Almost
Ge0rG 14:09:14
Zash: https://github.com/igniterealtime/Smack/blob/48f5e349b9a318ba2a1d82aef9fa069e62da10bb/smack-core/src/main/java/org/jivesoftware/smack/packet/id/StandardStanzaIdSource.java#L30
Zash 14:09:44
Ge0rG, so, per process?
Ge0rG 14:10:39
Zash: yes
deuill 14:25:43
Perhaps the there can/should be an XEP that provides some sort of best practice for ID generation? There's been a number of advances over UUID algorithms that avoid clashes while retaining sortability.
Zash 14:26:12
deuill, are you volunteering? 😉
deuill 14:26:37
Twitter Snowflake IDs were, I think, the first of their kind, but others have emerged since.
Ge0rG 14:27:14
deuill: yes, and that XEP should be RFC6120'
Zash 14:27:18
I looked at those. Also LUID and various variants of concat(timestamp, random)
jonas’ 14:27:40
ITYM ULID
Zash 14:27:58
[IDLU]{4} 🤷️
mathieui 14:28:06
IIII
deuill 14:28:09
Actually, maybe that's a good way for me to contribute. What I can't tell is why the original RFC says stanzas SHOULD have a unique ID assigned (not MUST?). I need to re-read those passages.
Zash 14:28:36
For fire-and-forget, who cares what the ID is
deuill 14:29:16
I think some of these algorithms assume Infinite computational power as well, or at least cycles to spare.
deuill 14:30:51
Conversations apparently tries to de-dup based on ID (even empty IDs)? People's definition of fire-and-forget may vary lol
Zash 14:31:11
Here's a start: ``` xeps$ pandoc -t ./tools/2xep.lua >inbox/best-id.xml <<. > % Best practices for stanza IDs > > Use UUIDv4. > . ```
Zash runs away and hides `base64url(random.bytes(12))` 14:32:16
deuill 14:32:48
Co-Authored-By: Alex P
deuill 14:33:08
Thanks Zash
Ge0rG 14:37:14
jonas’: I didn't make a github PR for the CVE formatting back then, but I'd still love to move forward with it.
Ge0rG 14:37:29
jonas’: with your Editor hat on, what do you suggest me to do next?
jonas’ 14:37:51
Ge0rG, remove the background color, make a PR?
Ge0rG 14:38:08
jonas’: but I like the background color!
Zash 14:38:19
Does anyone know offhand where it says that 128 bit IDs should be enough until the end of time, for use as reference?
Zash 14:38:46
(UUID is less because version numbers and stuff tho)
jonas’ 14:38:55
Ge0rG, I don’t, it makes for a lack of contrast. And if even *I* find the contrast lacking, any a11y tool will throw it in your face.
jonas’ 14:39:27
Zash, we use 128 bit strength for cryptography, suggesting that you cannot reasonably brute-force a 128 bit thing
Zash 14:40:18
jonas’, cargo cult?
deuill 14:40:21
(Tangentially related but this was an interesting comparison of GUID algorithms, albeit all implemented in Go: https://blog.kowalczyk.info/article/JyRZ/generating-good-unique-ids-in-go.html)
jonas’ 14:41:07
Zash, https://crypto.stackexchange.com/a/48669/16902
Ge0rG 14:42:06
jonas’: you only disliked the contrast to the links, right? Can I make the links red-colored in the cve box to satisfy you?
jonas’ 14:42:19
Ge0rG, no
jonas’ 14:42:26
also the text<->background was lacking IMO
jonas’ 14:42:42
but I may be misremembering
jonas’ 14:43:01
you could refresh my memory with a screenshot
Ge0rG 14:45:23
jonas’: https://op-co.de/tmp/xep-template.html#example-1
jonas’ 14:45:41
so red-on-red would definitely make this worse
jonas’ 14:45:57
and yeah, weave is contrast-error marking all of that
jonas’ 14:46:09
ok, the black-on-red not, but green-on-red it doesn’t like
Ge0rG 14:46:28
jonas’: I've reduced contrast of the red, can you shift+reload
Ge0rG 14:46:33
maybe you still have my old version cached
Zash 14:46:35
~~jonas’, IIRC even counting to 2⁶⁴ in boil-the-oceans levels, but for IDs it's more about the chances of accidentally generating the same ID twice during the lifetime of the scope.~~ ✎
jonas’ 14:46:38
Ge0rG, same
jonas’ 14:48:58
~~z https://en.wikipedia.org/wiki/Birthday_attack#Mathematics~~ ✎
Zash 14:48:59
jonas’, IIRC even counting to 2⁶⁴ is boil-the-oceans levels, but for IDs it's more about the chances of accidentally generating the same ID twice during the lifetime of the scope. ✏
jonas’ 14:49:01
Zash, https://en.wikipedia.org/wiki/Birthday_attack#Mathematics ✏
Ge0rG 14:49:42
jonas’: maybe it's because the default text color is #444
Zash 14:49:54
Oh look, it has a table
jonas’ 14:50:01
Ge0rG, it’s complaining about the coloured text, not about the grayscale text
Ge0rG 14:51:13
jonas’: another shift-reload?
jonas’ 14:52:19
I don’t see the link as link anymore
jonas’ 14:52:22
looks black to me
jonas’ 14:52:39
(but weave is happy about the contrast… which is pointless though)
jonas’ 14:53:16
welcome to the world of making an accessible thing.
Ge0rG 14:53:37
jonas’: is your monitor calibrated?
jonas’ 14:53:54
do I need a calibrated monitor to read XEPs?
Ge0rG 14:54:10
jonas’: you need a calibrated monitor to complain about lack of contrast ;)
jonas’ 14:54:19
no, I don’t
Ge0rG 14:54:23
jonas’: I *really* want that reddish background, because without it just looks naked
jonas’ 14:54:30
sorry for your loss
jonas’ 14:54:55
I won’t accept a reddish background just because you prefer it if it renders XEPs less readable for people with visual deficiencies or under bad lighting conditions or whatever
jonas’ 14:55:09
I took great care during the XEP redesign to avoid such pitfalls, I’m not going to let you ruin that ;P
Ge0rG 14:55:40
said the person who uses #444 instead of black for text.
Ge0rG 14:56:04
jonas’: shift-reload again for the naked box.
Daniel 14:56:09
> said the person who uses #444 instead of black for text. That's better than #666
jonas’ 14:56:16
Ge0rG, LGTM
Ge0rG 14:56:20
Le Sigh.
Ge0rG 14:56:31
now how do I add a huge red warning sign there?
jonas’ 14:56:32
but I knew that already, I tested your design with background-color disabled and was immediately happy
Ge0rG 14:56:52
so we can't both be happy, right?
jonas’ 14:57:17
yes
Ge0rG 14:57:56
well, I can be happy if that box has a huge red ⚠️ in the left, but I don't know how to make it
jonas’ 14:58:02
Ge0rG, you need to do something with <img/> or background-image
Kev 14:58:06
Why should we be making these things prominent anyway?
jonas’ 14:58:09
to get a warning sign without disturbing a11y tools
Kev 14:58:14
Security considerations aren’t, for example.
Sam 14:59:32
Yah, this seems like something to be aware of but not something that you absolutely definitely must see at all costs (as opposed to security considerations which are that)
Sam 14:59:38
These are just non-normative examples of what can go wrong.
jonas’ 15:00:16
and examples of where the security considerations have been ignored, so placing them really prominently is something I’d deem useful
Ge0rG 15:00:36
What jonas’ said
Sam 15:01:06
Prominent placement is fine, but it's not important to specifically draw the users attention to them over other things like the security considerations
Ge0rG 15:01:35
let's add a red background to the security considerations then!
jonas’ 15:02:51
and make them blinking /s
Ge0rG 15:03:26
jonas’: at least it won't complain about contrast then!
Zash 15:03:36
`<blink><marquee>🚨️ ⚠️ ACHTUNG!</blink></marquee>`
jonas’ 15:03:53
so my opinion is roughly: If you don’t read the security considerations, you are a bad developer. No matter how emphasized they are compared to other sections. But there’s nothing wrong with us pointing out and highlighting exceptional cases where there are documented, wide-spread exploits because of such neglect with extra emphasis.
Kev 15:11:23
This is not a hill for me to die on, but I don’t understand why we would say the most important thing about a XEP is a section saying someone once got it wrong.
Sam 15:12:00
+1 ^
jonas’ 15:12:36
do you have other proposals to improve the situation that people clearly ignore security considerations?
Kev 15:13:17
Accept that it’s not the spec author’s responsibility, nor is it within their ability, to make implementors read the spec?
Ge0rG 15:13:42
write shorter specs.
Ge0rG 15:13:51
also: write specs that are harder to implement in insecure ways.
Kev 15:13:52
Focus on writing clear specs that are easy to get right.
jonas’ 15:13:53
Kev, that’s not an improvement to the situation, only to your (our) perception of it
Sam 15:14:01
What Ge0rG said.
jonas’ 15:14:13
but we can’t fix e.g. carbons and such
Ge0rG 15:14:20
XMPP 2.0!
Sam 15:15:33
I don't think this will do what you think you're doing either though. This is just a distraction that may or may not actually have anything to do with the security considerations and may or may not actually contain anything of value that's likely to be repeated.
Sam 15:16:01
It's a nice thing to have, it just doesn't seem like the thing we want to drag users eyes to (and possibly away from other important normative things like security considerations)
Sam 15:16:18
Everyone only reads the examples and not the normative text, let's not also make them only read the CVEs and not the security considerations.
jonas’ 15:16:32
Sam, but they’re *already* not reading the security considerations.
Sam 15:16:37
So don't make it worse.
Sam 15:16:55
It's a separate problem is what I'm saying
MattJ 15:17:14
I don't have full context here, and don't have time to review the entire conversation, but if this is about relevant CVEs being highlighted in XEPs, that's definitely a thing we should do
Sam 15:17:28
Add CVEs, that's a good idea I think. If you want to make people read the security considerations, highlight that, not the new non-normative may or may not exist or be relevant examples.
Ge0rG 15:17:54
MattJ: it's about the format of that: https://op-co.de/tmp/xep-0280.html#security
Sam 15:18:38
I have no strong feelings about how this should be formatted, FWIW, I just want to suggest that adding more and more stuff isn't doing what you think it's doing.
Sam 15:19:08
On first impressions the one Ge0rG just linked looks great to me and is enough extra formatting.
Kev 15:19:10
Especially without context.
MattJ 15:20:53
Ok, if it's just about bikeshed formatting, I definitely have other things to do... :)
MattJ 15:21:53
Oh no, but I can't help it... FWIW I would group all the CVEs into a single box with "The following security vulnabilities have previously been found in some implementations of this specification:"
Ge0rG 15:22:22
MattJ: that can be achieved by different XSLT, I'm sure.
Kev 15:23:38
MattJ: This isn’t all CVEs though, Jonas’ said it’s only those that were widely applicable.
jonas’ 15:23:59
"worth highlighting" for whatever definition of that
MattJ 15:24:02
I didn't say "all", did I?
Kev 15:24:03
Oh, and exploited widely, in fact.
Kev 15:24:36
You didn’t, but just saying “here are some CVEs” implies it’s somewhat more exhaustive than CVEs that have been widely exploited.
MattJ 15:25:59
Then insert "notable" somewhere, or something. Or don't :)
Kev 15:26:51
Anyway, if we’re only talking about vulnerabilities that were widely exploited (which has been precisely 0 vulneralibilities that I can remember), my concerns about them being overemphasised are probably overblown.
jonas’ 15:27:25
Kev, sorry, I didn’t mean to say "exploited"
jonas’ 15:27:42
but widely exploitable with simple-enough PoCs
jonas’ 15:28:04
but I am also fine with less strict constraints
flow 15:35:32
listing related CVEs is probably fine, but not like https://op-co.de/tmp/xep-0280.html#security please
flow 15:36:29
the text of the security considerations is more improtant than the related CVEs. but the currently proposed visualization puts the focus on the CVEs
jonas’ 15:40:25
flow, see above for my rationale for that
flow 15:41:40
jonas’, not sure if I looked at the right rationale, at least I fail to see how this counters my argument
jonas’ 15:42:02
> so my opinion is roughly: If you don’t read the security considerations, you are a bad developer. No matter how emphasized they are compared to other sections. But there’s nothing wrong with us pointing out and highlighting exceptional cases where there are documented, wide-spread exploits because of such neglect with extra emphasis.
flow 15:44:54
that is what I read. But I don't interpret this as something that argues in favor of making the CVEs visually more prominent than the security considerations
flow 15:45:12
if anything, be sparse with visual effects
jonas’ 15:45:28
-ENOTIME
flow 15:45:33
It feels like Ge0rG had the changelog of a software in mind when he designed this
Sam 15:46:39
What flow says. I don't have strong feelings about the current formatting, but we definitely shouldn't do much more than this (stop signs and big red backgrounds and whatever was being discussed before). And even as is now I still think it will be just like examples where users automatically read the shiny things and ignore the text assuming the examples (or CVEs) cover all the things they need to know.
Ge0rG 15:47:41
Sam: if the people read the CVE, that's a huge win already.
Sam 15:48:08
I kind of doubt it in most cases, especially if it comes at the cost of ignoring the text, but maybe.
Ge0rG 15:49:05
Sam: as jonas’ said before, the text is already being ignored
flow 15:49:39
I am not sure if adding big red boxes under the text being ignored helps with that
Sam 15:49:53
Ge0rG: right, and this won't fix it and could make it worse.
Sam 15:50:39
And now the only thing they maybe look at is some random examples that may or may not actually be a good representation of the actual problems and weren't actually a part of the XEP process.
flow 15:51:10
~~If anything, a short, potentially visually featured, sentenced at the beginning of the xep that states "This XEP has Security Considerations, make sure to read them", would be better IMHO~~ ✎
flow 15:51:23
If anything, a short, potentially visually featured, sentence at the beginning of the xep that states "This XEP has Security Considerations, make sure to read them", would be better IMHO ✏
Ge0rG 15:53:18
well, if they read the CVE text, the chances are higher that they'll come to the same conclusion that's implied in the security considerations
Sam 15:54:22
I doubt it. I'd bet most of them end up with one or two things that are mostly unrelated, or only cover one tiny part of the security considerations.
Ge0rG 15:55:52
Well, the list of CVEs is evidence that the current aproach does *not* work. Let's try the alternative suggestion as an A/B test then
Sam 15:56:55
If that's actually the problem you're trying to solve, flow's alternative seems better.
Zash 15:57:28
A/B tests \o/
Sam 16:01:25
(FWIW if we had a way to actually A/B test this that would be great, but I don't think we do)
Ge0rG 16:03:11
let's bikeshed A/B testing infrastructure then!
Daniel 16:05:43
> Well, the list of CVEs is evidence that the current aproach does *not* work. Let's try the alternative suggestion as an A/B test then Do we have evidence that the clients effected by the CVE ever read the xeps?
jonas’ 16:06:08
how would the know how to enable carbons if not?
Daniel 16:06:12
The developers of those clients
Ge0rG 16:06:15
Daniel: are you saying that you can implement a XEP without reading it?
Daniel 16:06:19
jonas’: gajim XML console
jonas’ 16:06:24
Daniel, please no
Daniel 16:06:28
Looking at other implementations
jonas’ 16:06:33
please no.
jonas’ 16:06:41
I’m going to get dinner now
moparisthebest 16:06:43
reading the xeps don't even matter, what clients/servers send/accept in practice is what matters :D
jonas’ 16:06:45
because I can’t take that
Daniel 16:06:50
I'm fairly convinced that this is how a lot of those clients were developed
flow 16:08:03
ahh, the thought of XMPP devs not even looking briefly at XEPs when implementing them
flow 16:08:07
enough xsf@ for today
jonas’ cries 16:08:18
Kev 16:08:20
I *know* there have been implementations of features done by looking at sent/received protocol, and not the XEPs.
Kev 16:08:32
That’s not even hypothetical :)
Daniel 16:09:22
That's how I implemented <session/>
moparisthebest 16:09:28
aren't there plenty of things that are well known not to even be implementable looking at XEPs ? just "shared knowledge" stuff ?
Daniel 16:09:40
Didn't know what it was. Just knew that sending this magic made things work
Sam 16:10:44
I'd be willing to be that between what Daniel said, looking at the XEP but only at the examples, and copy/paste from Stack Overflow you'd cover over 99% of all XMPP development. I don't think I'm joking when I say that I'd put money on this.
moparisthebest 16:10:51
if a client dev implemented OMEMO from the XEP today they'd probably be pretty sad when they couldn't interop with a single other client ?
Sam 16:10:51
*bet
Sam 16:11:18
And I'd bet that the other <1% are 99% people in this room :)
moparisthebest 16:12:10
there is a massive difference between what client/servers *should* accept/handle and what they actually *will* accept/handle in practice, you can't get that from looking at XEPs
Ge0rG 16:12:18
moparisthebest: that's actually also a statement about the horrible state of the OMEMO XEP
moparisthebest 16:12:37
same situation with RFCs, and everything in web-land too
moparisthebest 16:12:44
computers: they bad
mathieui 16:13:35
Sam, knowing ex-coworkers who have worked on XMPP for a client website project, that probably sums up 99% of all private-sector XMPP development for people not familiar with standards, yes
mathieui 16:13:53
"I just loaded xmpp.js, why doesn’t it work!"
moparisthebest 16:14:15
when I implemented XEP-0368 in Conversations I didn't know the XSF or XEPs existed
Sam 16:14:45
mathieui: indeed, I was specifically thinking of a colleague at HipChat who asked me a question about the protocol and I wasn't sure off the top of my head so I said "Here, pull up RFC 6120" and their response was "wait, you actually read these things?"
Sam 16:15:36
Although as far as libraries go, "I just loaded <library>, it should work" seems reasonable.
moparisthebest 16:15:55
my reasoning was roughly: email clients are fine doing direct-tls instead of starttls without any formal specification, why not XMPP
Sam 16:15:57
I mean, assuming they actually called it somehow, you know what I mea.n
mathieui 16:16:03
Sam, well, except you need to at least know some part of the semantics and how they relate with what you want to do
Kev 16:16:28
I think there’s multiple things here. * People understanding how they should learn what to do * People being willing to do stuff properly * People being competent to do stuff properly * Doing stuff properly being possible to do * Doing stuff properly being easy to do
Kev 16:16:30
And probably others.
mathieui 16:16:41
Most certainly others, yes.
Sam 16:16:57
mathieui: yah, I feel like that's a library design problem though. For most people I suspect they want to call conn = xmpp.Connect("domain.com"); conn.SendMessage("Hi") or something. I really want to eventually get my own stuff to that point
Zash 16:17:02
Do we need to stick "The Official XMPP $Language SDK" sticker on some libraries?
Kev 16:17:07
I think it’s unduly naive to think we can have much influence over some of those, and showing that people aren’t getting it Right doesn’t, in itself, mean that if we just shout at them louder it’ll get better.
Sam 16:17:27
What Kev said.
Kev 16:17:34
OTOH, some of them (particularly the last two, but also the first) are entirely within our remit.
mathieui 16:18:02
Sam, well, if I remember correctly it was something like they did not expose bosh or websocket in their ejabberd container
Sam 16:18:27
Fair; ops is hard no matter what.
moparisthebest 16:18:36
and don't get me wrong XEPs are super valuable and we should cram all the relevant info needed in them, if only for a place to point to other than "look what client X does" which sucks
moparisthebest 16:19:16
still, many times you have to dig into what X does, and assume many/most people use this instead of XEPs on how to learn it
moparisthebest 16:19:42
TLS had the same problem with people hardcoding version numbers right?
moparisthebest 16:20:03
"it works for the current input" not realizing it'd break on 1.3, 1.3 ended up working around it entirely
moparisthebest 16:54:36
https://mastodon.xyz/@nextcloud/106058947562901204 :)