XSF Discussion - 2018-02-12

Seve 06:23:24
daniel, Matrix already does that. http://mirror.onet.pl/pub/mirrors/video.fosdem.org/2018/H.1309%20(Van%20Rijn)/matrix_webvr.webm
rion 06:52:00
looks cool
zinid 07:01:56
daniel: Psi gets crazy when it receives muc self-presence, it renders "empty" participant in muc roster
rion 07:06:23
O_o
zinid 07:09:25
yeah
rion 07:11:09
What server should I install to have such a self-presence? ejabberd?
zinid 07:12:55
rion, no, it's on local machine
rion 07:13:00
in any case last two day I rewrote muc roster. so it won't happen anymore. not commited yet
zinid 07:13:08
good
zinid 07:13:26
the problem is that now I'm not sure if other clients don't break though
zinid 07:14:06
rion, Psi requests MUC vCard on every join?
rion 07:14:14
yep
rion 07:14:46
I think with self-presence and caps I'll change this too
rion 07:18:15
and I have to improve avatars caching for mucs too as well as their requesting algo. otherwise it's a lot of traffic on join.
zinid 07:21:27
yeah...
zinid 07:21:38
poezio has the same problem wrt avatars 😉
zinid 07:46:27
gajim seems doing fine, doesn't render anything strange
Seve 10:21:29
By the way, regarding the email I sent to JUser, is there a better mailing list for that, please?
jonasw 10:21:48
I still haven’t subscribed to juser :/
Seve 10:22:20
That's why I would like to send it somewhere else too, seems not many people are subscribed there
jonasw 10:22:40
Seve, I think operators would be more fitting, maybe
jonasw 10:22:45
possibly with a cross-post to members@
jonasw 10:23:19
I’m also not sure if a separate MUC for this is a good idea
Zash 10:23:35
Posted something to some KDE list?
jonasw 10:25:52
frankly, I’m not sure XMPP can fulfil the requirements at this instant in time
jonasw 10:27:07
> Stickers are available
jonasw 10:27:09
we doomed
Zash 10:27:29
Find out which are requirements and which are wishlist
jonasw 10:27:45
Zash, they don’t even know that yet
jonasw 10:27:50
> Requirements are not prioritized (incl. must-have vs. nice-to-have), that comes at a later step.
Zash 10:32:58
jonasw: All are wishlist. There's online one requirement, that everyone you care about is already using it. :)
Seve 10:38:26
jonasw, the point of the MUC is to discuss this topic with people interested in helping out. Almost everybody here would find this discussion annoying here.
jonasw 10:40:05
Seve, I’m running out of screen space for more MUCs.
marc 10:47:04
Ge0rG, jonasw can you review https://git.zapb.de/xeps.git/commit/?h=fix_xep_0401&id=52725e993987f205dc253cc5a4e6937fe3955d81 and merge please?
jonasw 10:48:57
marc, EBUSY right now
jonasw 10:49:28
marc, if you refuse to make a github PR (which would really be the most useful thing for me), please send an email with the first line of the commit message as subject to editor@xmpp.org
marc 10:49:49
jonasw, I can do a GitHub PR if you like
jonasw 10:49:57
marc, that would be much appreciated, thanks
marc 10:50:03
jonasw, you're welcome
jonasw 10:50:12
gotta run now, see you
marc 10:50:20
bye
marc 10:55:52
done
jonasw 11:47:10
zinid, regarding the server capabilities invalidation, how about a message?
jonasw 11:47:57
server changes caps -> server sends <message/> to local clients. for server-to-server links, it could simply close the link so that the peer re-opens it, thus getting new stream features
zinid 11:56:16
jonasw, closing and opening thousands of s2s will create avalanche effect
jonasw 11:56:25
right
zinid 11:56:27
we see this on jabber.ru for examlpe
zinid 11:57:00
so I just think about nonza
zinid 11:57:06
stream-level element
jonasw 11:57:30
zinid, thanks for your input, I’ll post a few suggestions to the mailing list and I hope you’ll comment on them :)
zinid 11:57:41
sure 😉
SaltyBones 12:18:17
Is there a good resource on the problems with message ids?
jonasw 12:18:42
~~zinid, replied, happy to hear from you d)~~ ✎
jonasw 12:18:45
zinid, replied, happy to hear from you :) ✏
jonasw 12:18:48
SaltyBones, I’m not sure
SaltyBones 12:18:53
I'm looking at XEP-0359 but the problems mentioned at the summit are not in there.
jonasw 12:19:05
maybe the summit notes as first starting point
SaltyBones 12:23:21
1.2 Stable IDs Discussion on entropy ensues. dwd: <stanza-ids> don't work for unique addressing because we don't trust clients to do them properly. Kev: lots of possible attacks with spoofable stanza IDs.
SaltyBones 12:23:29
Not verbose enough.
jonasw 12:24:26
SaltyBones, I think the gist of that is: we need to generate stanza IDs on the server because weak/constructed stanza IDs are a problem. but the client needs to know the stanza ID for archive operations and possibly other things
zinid 12:26:38
jonasw, well, dunno, for me message looks too complex
zinid 12:27:13
can't we just send <c/> as a nonza?
jonasw 12:27:56
zinid, I think RFC 6120 would slap you in the face for that ;-)
jonasw 12:28:02
really, it needs negotiation
jonasw 12:28:20
it’s a shame that we have no way for a client to send stream features :(
zinid 12:28:39
yeah, because some implementation may close a stream if they receive unrecognized nonzas
jonasw 12:28:45
yupp
jonasw 12:28:47
I know at least one.
zinid 12:29:00
however, I tried to implement such behaviour (closing a stream) and got lots of problems and left the idea
jonasw 12:29:26
never had issues so far; did you encounter things which sent unsolicited nonzas?
zinid 12:29:54
alas, I've already forgotten what that was exactly 😕
jonasw 12:31:02
zinid, the best we could do is probably say "okay, if the client sent a presence update with caps, we can send a nonza with a caps update"
jonasw 12:31:13
and similarly for servers ("offered caps in stream feature -> send nonza")
jonasw 12:31:34
but I don’t know enough about s2s to be sure that this works
jonasw 12:32:23
this would definitely need a namespace bump though
zinid 12:33:17
well, we're talking about new caps xep?
zinid 12:33:21
we can do it there
jonasw 12:33:29
yeah, it requires a namespace bump there too ;-)
zinid 12:33:50
can't we just negotiate this feature?
zinid 12:33:56
not sure how
jonasw 12:34:03
negotiation will add a round-trip
jonasw 12:34:21
which people will react very adversely to
zinid 12:34:28
ah
jonasw 12:34:35
but please comment on-list, hoping to get more input on that
zinid 12:34:43
I don't know what to say
jonasw 12:35:05
what you said here, essentially?
SaltyBones 12:49:32
There are stanza-id and origin-id in XEP-0359. Somebody at the summit also mentioned "message-id" is that defined somewhere?
MattJ 12:50:05
They were probably referring to the id attribute, I guess?
SaltyBones 12:51:05
MattJ, this https://tools.ietf.org/html/rfc6120#section-8.1.3 ?
MattJ 12:51:26
Yes
SaltyBones 12:51:46
thx
MattJ 12:52:21
The reason XEP-0359 exists is because the id attribute is controlled (and can only be trusted by) the original sender of the stanza
MattJ 12:53:18
It's really just designed for tracking errors, though a couple of XEPs have re-used it for other purposes
SaltyBones 13:00:09
The id-attribute you mean?
jonasw 13:00:17
yes
SaltyBones 13:04:30
So, I guess, stanza-id is used by MAM and origin-id is used to detect MUC reflections (that's what 0359 says anyway). And id-attribute is used for errors but the error will have the same id-attribute iiuc, right?
jonasw 13:04:45
yes, pretty much
jonasw 13:04:53
except that origin-id won’t work with all MUCs
SaltyBones 13:05:00
Why?
jonasw 13:05:14
because not all MUCs may be able to reflect that ID for whatever implementation specific reason
jonasw 13:05:25
(just like not all mucs can reflect the message @id)
MattJ 13:07:09
There is only one implementation I'm aware of that doesn't (didn't?) reflect @id, and I'm not even sure of the current status of that
SaltyBones 13:07:15
Okay, so why does the standard not just mandate that it be reflected?
MattJ 13:07:41
Simple oversight I suspect
Dave Cridland 13:07:45
SaltyBones, Non-MUC things pretending to be MUC, in part.
Dave Cridland 13:08:04
SaltyBones, Also by the time people had noticed this was a problem, enough implementations didn't.
SaltyBones 13:08:51
So, could this be fixed by changing the standard and requiring that origin-ids be reflected?
SaltyBones 13:09:18
Dave Cridland, what do you mean by "non-muc things pretending to be muc"?
jonasw 13:09:24
SaltyBones, changing the standard doesn’t fix the implementations magically
MattJ 13:09:26
Dave Cridland, enough = 1? Also in the case of bridging to non-MUC, isn't it as simple as 1) if the non-MUC supports acks, wait for the ack and reflect the id or 2) reflect the id immediately?
jonasw 13:09:33
SaltyBones, IRC transports are "non-muc things pretending to be MUC" for example
SaltyBones 13:09:34
jonasw, not magically but ...
Dave Cridland 13:09:51
MattJ, More than one, I think. One classic MUC implementation, but quite a few transports.
Dave Cridland 13:10:46
FWIW, I've gone back and forth over whether MUC ought to reflect ids. It's a special case of maintaining the id on bradcast, which is itself both useful in some cases and bad in others.
SaltyBones 13:11:01
Why is this reflection necessary?
Dave Cridland 13:11:33
SaltyBones, Reflection isn't - we could add in a subelement which indicated the original id.
Dave Cridland 13:12:04
SaltyBones, Broadcast ids are harder to work around - there's a few protocols which use id as reference.
Dave Cridland 13:12:37
This all said I *really* dislike having a zillion ids as subelements of stanzas when we already have an id attribute.
SaltyBones 13:13:56
Well, what is the subelement necessary for? Is this just as an ACK to the client?
SaltyBones 13:14:03
What do you mean by broadcast-id?
MattJ 13:14:35
SaltyBones, if you're asking why MUC reflects messages to the sender in the first place (some systems, e.g. IRC don't) - it has a few benefits
MattJ 13:14:47
Such as ensuring everyone in the room sees the same messages in the same order
jonasw 13:15:32
it also is natural as soon as multiple clients are in the play, somewhat like message carbons
Dave Cridland 13:15:32
SaltyBones, So the id attribute in the message stanza for a groupchat message sent to the MUC can be reused in the reflected message, and/or the broadcast messages to other participants in the group.
MattJ 13:15:33
In IRC if two people send a message at the "same time", the messages will be shown in a different order on both their clients
MattJ 13:15:46
and yes, multiple clients from one user in the room
jonasw 13:16:11
it allows MUCs to do fancy things to messages and still have everyone see the same thing
Dave Cridland 13:16:19
(As a side-note, Matrix achieves a similar goal by a complex hash chain)
MattJ 13:16:39
SaltyBones, and also servers modifying messages (e.g. in the Prosody MUC we automatically convert long multi-line messages to a pastebin link). The reflection allows the sending client to see what everyone else sees.
jonasw 13:16:59
(in contrast to IRC, where you don’t see that your message was truncated)
SaltyBones 13:17:03
Okay, pretty convincing...
MattJ 13:17:03
Dave Cridland, and as many people at FOSDEM noted, much RAM
Dave Cridland 13:17:32
MattJ, Yes but BLOCKCHAIN.
jonasw 13:17:41
#bingo #triggered
SaltyBones 13:17:49
🔲‍⛓️
Dave Cridland 13:18:15
I hope there's a ZWJ there.
SaltyBones 13:20:34
*Yes, there is.*
SaltyBones 13:21:20
Okay, so we want reflected messages. Sounds fair. Let's suppose for a second that we can just make them mandatory in the standard and people would add them.
jonasw 13:21:37
they *are* mandatory
jonasw 13:21:44
the issue is with message IDs
jonasw 13:21:54
clients need to be able to recognize the reflection
SaltyBones 13:22:04
You mean id-attribute?
jonasw 13:22:12
or any ID really
jonasw 13:22:16
id attribute would be easiest
Dave Cridland 13:22:28
We could indicate in disco if ids were stable.
jonasw 13:22:52
wasn’t that proposal rejected-ish?
SaltyBones 13:22:57
Okay, 1. is the origin-id used for anything else and 2. is the whole point of origin-id not to detect reflection?
jonasw 13:23:33
speaking of MUC, I think the vote on https://github.com/xsf/xeps/pull/559 expired, didn’t it?
Ge0rG 13:24:16
I've attemtped to fix MUC reflected IDs some years back, not even in the normative language but merely by fixing the examples
Ge0rG 13:24:23
And got significant flack for it.
Ge0rG 13:25:26
I also attempted to make origin-id == message-id, but it was refused by the XEP author.
SaltyBones groans. 13:25:54
Ge0rG 13:26:16
Now I'll just sit and wait, and ocassionaly proclaim: *I told you so!*
SaltyBones 13:26:19
So, is the point of origin-id to be used for reflection or is there something else?
Flow 13:26:30
SaltyBones, that is one use case
SaltyBones 13:26:35
Ge0rG, great, if you could also occasionally chime in with reasons for things and explanations that would be awesome.
SaltyBones 13:26:43
Flow, like?
Ge0rG 13:26:46
SaltyBones: it's mainly for reflection, except that MUCs are not guaranteed to keep non-body elements in the reflection
Flow 13:27:07
other use cases include finding your own messages in a archive
Ge0rG 13:27:18
SaltyBones: https://wiki.xmpp.org/web/XEP-Remarks/XEP-0045:_Multi-User_Chat#Matching_Your_Reflected_Message
SaltyBones 13:27:19
Flow, MAM uses stanza-id, right?
Flow 13:27:34
SaltyBones, it does
Ge0rG 13:27:36
SaltyBones: yes, but you don't know the stanza-id for the messages you sent
SaltyBones 13:27:45
Ge0rG, would you say this is a requirement by nature or just a problem with implementations?
daniel 13:28:03
i agree that the xep should specify that the sender should set origin-id=message-id
Ge0rG 13:28:05
SaltyBones: what?
Ge0rG 13:28:14
daniel: MUST set.
SaltyBones 13:28:21
Ge0rG, the fact that MUCs don't reflect non-body attributes.
daniel 13:28:24
yes
Ge0rG 13:28:25
because anything different is just a path into insanity
daniel 13:28:35
(i didn't mean SHOULD)
SaltyBones 13:28:54
Ge0rG, also, why do I need the stanza-id for a message I sent?
Ge0rG 13:29:02
SaltyBones: have a look at biboumi, a modern IRC transport. It doesn't reflect your message ID, it doesn't reflect any non-body elements and it mangles multi-line messages
Ge0rG 13:29:06
Welcome to message tracking hell.
daniel 13:29:29
i think thinks will already break if some client doesn't do that and then expects a delivery reciept or something
Ge0rG 13:29:35
SaltyBones: because when you ask for a MAM sync, you'll get yout sent messages copied to you as well
daniel 13:29:37
when sending messages to Conversations i mean
Ge0rG 13:30:09
Yes, message correction, delivery receipts and anything else that references IDs is a mess already.
SaltyBones 13:30:15
Ge0rG, and why is that a problem? They should have their origin-id and be recognizable, right?
Ge0rG 13:30:44
SaltyBones: you can't rely on origin-id being there, and you can't rely on it matching the message-id.
SaltyBones 13:30:49
daniel, what breaks when a client does what?
Ge0rG 13:31:15
SaltyBones: but yes, you can match MAM archives based on origin-id
daniel 13:31:21
if an origin id is set Conversations will use that as a reference in the receipt
daniel 13:31:41
so if your client doesn't do id=origin-id and expects the receipt for the id it won't work
daniel 13:32:11
same with everything else that references ids
SaltyBones 13:32:29
So, there seems to be consensus at least in this MUC right now that attribute-id should be the same as origin-id?
daniel 13:33:00
i'd argue there is no good reason not do make this a MUST
daniel 13:33:10
i mean most client would naturally do this anyway
SaltyBones 13:33:16
Ge0rG, who was the xep-author who was against this and do you remember why?
jonasw 13:33:21
SaltyBones, Flow
daniel 13:33:24
because why would you generate a second id if you can reuse the existing one
zinid 13:33:31
Who is generating origin-id? A client?
Ge0rG 13:33:33
SaltyBones: https://mail.jabber.org/pipermail/standards/2017-September/033415.html
jonasw 13:33:36
zinid, the original sender
daniel 13:33:38
zinid, yes
jonasw 13:33:40
i.e. most of the times a client
daniel 13:34:34
note that muc rewrites a irrelevant to the scenerio described above
Dave Cridland 13:34:46
jonasw, #559 expired, yes.
zinid 13:34:51
This is to track its own messages?
daniel 13:34:59
zinid, yes
daniel 13:35:05
or references to that
Dave Cridland 13:35:25
jonasw, But with no veto and the rest are +1, so apply it.
SaltyBones 13:35:30
jonasw, what is this ID-rewriting MUC shit? :)
daniel 13:36:01
some mucs are known to change the 'attribute' id
Ge0rG 13:36:15
SaltyBones: have a look at examples 44 and 45 in https://xmpp.org/extensions/xep-0045.html#message
daniel 13:36:21
so tracking your own message are parsing references don't work
zinid 13:37:03
daniel: why, you can rely on origin-id in this case
Ge0rG 13:37:18
If we have a MUC that rewrites message IDs, can we mandate that it also MUST rewrite references in all XEP payloads that reference message IDs, please?
Ge0rG 13:37:47
zinid: https://wiki.xmpp.org/web/XEP-Remarks/XEP-0045:_Multi-User_Chat#Matching_Your_Reflected_Message
daniel 13:38:23
zinid, in the case of muc? because most mucs won't remove child elements from the stanza
zinid 13:38:47
I'm lost
zinid 13:39:05
daniel: but that's exactly what's needed
daniel 13:39:19
now i'm lost :-)
zinid 13:39:36
So you can fetch origin-id and check if this is your id
zinid 13:40:37
Why do you need to rely on message-id if you inject origin-id
daniel 13:41:11
i just said the sender should set id=origin-id
SaltyBones 13:41:12
zinid, I think there was a sub-discussion about forcing message-id = origin-id
daniel 13:41:45
and/or deal with delivery receipts that reference the origin-id instead of the id
daniel 13:41:59
which is made easier if you id=origin-id
zinid 13:41:59
Ok, I don't get it 🤔
daniel 13:42:22
doesn't matter. that client to client stuff anyway :-)
zinid 13:45:09
From what I understand we just need to ditch IRC transports
zinid 13:45:17
😃
zinid 13:45:36
That's really their problem
daniel 13:48:06
oh yeah. or make irc transports track the messages themselves and re-add origin-id and stuff on reflection
daniel 13:48:38
it's probably better if the irc transport does the right thing(tm) than letting each and every client figure it out
zinid 13:49:08
daniel: agreed, nobody said writing transport is simple
daniel 13:49:15
irc transport should also reassemble the message if they previously split it and stuff like that
daniel 13:49:22
because they know best if they did
SaltyBones 13:49:39
Hm.
SaltyBones 13:50:22
Okay, are there any actual problems with the current state of three IDs except that we don't like having so many?
daniel 13:50:31
yes
daniel 13:50:36
read what i said before
SaltyBones 13:51:02
Uh, assuming a well-behaved server?
daniel 13:51:33
yes
daniel 13:51:42
my argumentation has nothing to do with servers
jonasw 13:52:34
daniel, IRC doesn’t even reflect
jonasw 13:52:40
so the transport generates the reflections on its own
SaltyBones 13:52:43
Okay, can you please point out what you are referring to then?
jonasw 13:52:55
biboumi however decided to reflect the split version of the message, and there’s some argument to doing that
daniel 13:53:07
assume i sent: <message id="1"><body>hi</body><request/><origin-id="2"></message> and then i will receive from conversations: <message><receipt id="2"></message>
zinid 13:53:44
who cares about IRC, really, are we designing a protocol for old nerds?
daniel 13:54:26
but we are talking about two different things. the how should irc transports behave has noting to do with the problem i'm describing
daniel 13:54:36
this applies to 1:1 chats as well
jonasw 13:54:41
daniel, yes, I don’t argue that
jonasw 13:54:54
I’m just replying to your message from "13:49:15Z" because I was away from keyboard.
daniel 13:57:01
the irc transport with the name i can not pronounce or remember does a few things that i don't like :-)
zinid 13:57:27
daniel: if we don't care about IRC then we probably don't need origin-id and thus there is no the problem you just described
daniel 13:58:06
there are non transport mucs which also rewrite the id
Ge0rG 13:58:09
daniel: https://lab.louiz.org/louiz/biboumi/issues/3283
daniel 13:58:31
zinid, if it weren't only for transports i wouldn't care
daniel 13:58:45
Ge0rG, is that an issue to change the name to something i can remember?
daniel 13:58:51
or pronounce?
zinid 13:58:52
daniel: if they are abandoned, then I personally give zero fucks
Ge0rG 13:59:01
daniel: no, it's about maintaining IDs on reflection
Ge0rG 13:59:50
zinid: everything in XMPP is abandoned. Now stop giving fucks.
daniel 14:00:29
i should probably write my own irc transport
edhelas 14:00:40
biboumi is not good ?
Ge0rG 14:00:49
daniel: you should just patch biboumis two or three warts.
daniel 14:00:50
i haven't used irc though ever since counterstrike 1.6 came out
zinid 14:00:52
Ge0rG: not everything, so I still have a few fucks to give
daniel 14:02:17
there are a number of things i don't like that seem to be design decisions rather than bugs
daniel 14:02:34
so i'm not sure if they even want me to change them
SaltyBones 14:04:13
daniel, so regarding your example, what you are saying is that the read receipt might reference the message-id or the origin-id and it is not properly specified which?
zinid 14:04:14
Ok, whatever, so what do you guys think about muc subscriptions?
SaltyBones 14:04:30
Or are you getting at the fact that origin-id requires "by=" and is therefore sometimes not applicable?
daniel 14:04:48
> daniel, so regarding your example, what you are saying is that the read receipt might reference the message-id or the origin-id and it is not properly specified which? This
daniel 14:05:23
not just read recepits but everything that references something
daniel 14:05:27
but yes
zinid 14:05:49
daniel: isn't there a place in the xep which says you should copy the message-id?
daniel 14:05:59
zinid, no.
zinid 14:06:06
Really?
daniel 14:06:09
that's what Ge0rG and I want to add to the XEP
daniel 14:06:19
that's pretty much what the entire discussion is about :-)
zinid 14:06:26
So add it 😀
daniel 14:06:40
i can't without permission of the author
zinid 14:06:59
No shit?
SaltyBones 14:07:01
So, there is some problem that I am still not aware of....
SaltyBones 14:07:10
Flow, why is the by attribute in the origin-id?
zinid 14:07:11
Who's the author?
SaltyBones 14:07:15
The one that causes the privacy problems...
Kev 14:07:31
Not entirely true, BTW, that it's impossible to do things without the author. But it's the path of least resistance.
daniel 14:07:49
i don't think there is a by attribute in the origin id
SaltyBones 14:08:08
Ah, sorry that is only for stanza IDs. origin-id does not have by.
zinid 14:08:20
Peter is the author
SaltyBones 14:08:54
~~That might almost always be correct buy 0359 is Florian Schmaus~~ ✎
SaltyBones 14:09:10
That might almost always be correct but 0359 is Florian Schmaus ✏
zinid 14:10:15
> In addition, it SHOULD include an 'id' attribute that echoes the 'id' attribute of the content message.
zinid 14:11:25
And you want this SHOULD to be a MUST?
SaltyBones 14:12:13
zinid, what some people here want is that origin-id = message-id
SaltyBones 14:12:36
Ge0rG, you had an objection to that in https://mail.jabber.org/pipermail/standards/2017-September/033415.html but I don't quite understand it. Why do you want to know if somebody creates strong message-id?
zinid 14:13:10
SaltyBones: but the sender controls this itself, so what is a problem to set them equal?
daniel 14:13:39
zinid, we want wording that tells the client developer to do this
daniel 14:13:46
there is no 'problem'
Ge0rG 14:14:10
SaltyBones: I thought it would matter, but it doesn't, because there always can be malicious entities
zinid 14:14:53
daniel: if they don't then they only harm themselves, I guess
Ge0rG 14:15:19
zinid: no, they harm the other participants
zinid 14:15:29
Ge0rG: ah
SaltyBones 14:15:36
How?
zinid 14:15:46
Yeah, how?
Ge0rG 14:15:57
zinid: also, it's well possible that stanza ids are generated by the xmpp library, and origin ids by the client, causing a mismatch
Ge0rG 14:16:28
By making all message references break
Ge0rG 14:16:43
We had that above already.
SaltyBones 14:16:45
Before we move to stanza-ids....
SaltyBones 14:16:59
There is nothing wrong with requiring origin-id = message-id?
zinid 14:18:05
If we require this then why origin-id is needed, wtf?
SaltyBones 14:19:14
zinid, it's possible that it was a mistake
SaltyBones 14:19:44
daniel, does requiring origin-id = message-id actually solve your problem or are there cases when stanza-id might be used to refer to a message instead??
daniel 14:20:05
It solves my problem
SaltyBones 14:22:44
daniel, wouldn't it be better to just mandate that clients always use the origin-id to avoid the problem of dealing with id-rewriting MUCs?
SaltyBones 14:26:27
I mean, certainly removing an unnecessary id would also be desirable but this might be a good intermediate step.
SaltyBones 14:41:49
Would everything be better if clients would generate proper stanza-ids?
SaltyBones 14:42:15
For ...uh...some definition of proper that makes them UUIDish.
zinid 14:43:05
they should be strictly monotonically increasing, so we don't need XEP-0198
SaltyBones 14:43:11
And by "would everything be better" I specifically also mean, wouldn't that allow us to ditch the other IDs?
zinid 14:43:12
and not UUIDs
jonasw 14:43:45
zinid, "strictly monotonically increasing" is not gonna happen
zinid 14:44:50
because?
SaltyBones 14:44:53
zinid, would increasing IDs really solve all the same problems as stream management? Seems unlikely?
jonasw 14:45:06
zinid, also, increasing IDs have security implications
jonasw 14:45:13
or rather: predictable IDs
zinid 14:45:15
SaltyBones, it solves in data replication, but of course XMPP is too unique
zinid 14:45:30
version vectors are based on such ids
zinid 14:46:03
jonasw, what security implications?
jonasw 14:46:18
zinid, I think there were some IQ response injection attacks based on predictable IDs
jonasw 14:46:32
even though in that case you probably already made the mistake of not verifying the sender of the response
jonasw 14:46:48
error injections would most likely work though because errors can come from entities different than the original recipient
zinid 14:47:23
jonasw, this can be fixed by adding routing information
SaltyBones 14:47:27
jonasw, maybe this is too much of an assumption for all of XMPP but don't we have transport encryption?
zinid 14:47:45
we anyway need this functionality already
jonasw 14:48:15
SaltyBones, that doesn’t stop me (jonas@zombofant.net) from sending an iq type="error" id="whateverIguessed" to="you" to break whatever you were doing
SaltyBones 14:48:20
zinid, I agree, if this is a problem I don't see why it cannot be exploited right now by somebody who randomly sees the message first
jonasw 14:48:39
SaltyBones, if I can guess the ID, I can attack you from off-path
jonasw 14:48:47
I can’t do that when I can’t guess the ID
jonasw 14:48:54
if I’m on path, you’re right, everything is lost already in XMPP.
zinid 14:49:04
but if you cannot match incoming errors against requests you sent then you should really consider to change the job
jonasw 14:49:25
zinid, how’d you match incoming errors against requests other than the ID?
jonasw 14:49:36
you can’t really use from, because as I said, it might come from an entity you didn’t know about yet
MattJ 14:50:11
jonasw, no?
zinid 14:50:18
jonasw, you can use (from, ID) I think
MattJ 14:50:27
errors come from the original recipient that you addressed
jonasw 14:50:35
MattJ, even if an s2s error causes an error?
MattJ 14:50:37
Yes
jonasw 14:50:43
oh okay
MattJ 14:50:48
Otherwise it would be a nightmare
Dave Cridland 14:53:07
jonasw, There's a "by" attribute, if I remember right, that tells you the generator/reporter of the error.
SaltyBones 14:54:14
So, I assume that if I run a h4xx0r server and try to send replies to messages that were not directed to me to some other server they will be discarded?
SaltyBones 14:54:30
And the same happens between client and server...?
MattJ 14:54:41
SaltyBones, by "replies", you mean error replies?
SaltyBones 14:55:11
Whatever magical interferring replies that jonasw was referring to :)
SaltyBones 14:55:15
yes, errors ;)
MattJ 14:55:39
For that to work you would have to know the original sent message id, and the original sender's full JID
SaltyBones 14:55:55
But if I did, I could?
MattJ 14:56:05
and then yes, you could send them whatever you wanted - it would be up to their client to match it up (or not) to one of its outgoing messages
SaltyBones 14:56:34
So, the statement that predictable IDs would allow me to spoof responses remains correct?
MattJ 14:56:35
so as said above, the client should use (from, id) to identify error responses, not just the id
SaltyBones 14:56:58
Because I cannot spoof "from"?
MattJ 14:57:02
Correct
SaltyBones 14:57:08
Yes, okay...
SaltyBones 14:57:12
That's what I was looking for. :)
MattJ 14:57:36
id "abc123" from userA@domainA is different from "abc123" from userB@domainB
SaltyBones 14:58:19
And servers only accept s2s with from=...@domainA if they know that the other server is in charge of domainA?
Dave Cridland 14:59:34
SaltyBones, That's what they're supposed to do, yes.
SaltyBones 14:59:43
^^
MattJ 14:59:56
SaltyBones, yes, they do
Dave Cridland 15:00:07
MattJ, No, they don't, but I love your optimism.
MattJ 15:00:10
If they don't, it's a bad bug or not an XMPP server
Dave Cridland 15:01:24
SaltyBones, So Metre (my S2S proxy mad thing) does this very strictly, and as a result I can't join this chatroom from my personal server.
Dave Cridland 15:01:55
SaltyBones, Not sure if MattJ is saying that's a bad bug in this server, or if he's saying it's not an XMPP server. :-)
MattJ 15:02:04
Dave Cridland, for what reason?
SaltyBones 15:03:28
Dave Cridland, so which messages do you get that don't pass?
SaltyBones 15:04:15
Bunneh, version xmpp.org
Bunneh 15:04:15
SaltyBones: xmpp.org is running Prosody version 0.9.12 on Linux
Dave Cridland 15:04:17
MattJ, It's sending a response down the stream obtained by reversing the stream to/from, and not by reversing the stanza to/from. I forget the details, Zash knows.
Dave Cridland 15:05:08
MattJ, I don't know if Prosody would accept that or not. Metre doesn't, Openfire does (but because it knows it's multiplexed on the outbound, so weirdness.)
Dave Cridland 15:06:09
MattJ, For maximum fun, Openfire used to do similar "implicit" auth on outbound, but I stopped that as from 4.2.0.
MattJ 15:07:21
I don't really understand, but maybe Zash will enlighten me
Dave Cridland 15:09:49
MattJ, Metre sends traffic for d.c.n -> muc.xmpp.org over a stream for d.c.n -> xmpp.org (after adding that domain via dialback).
Zash 15:09:55
How far back should I read to have any idea what you are talking about?
SaltyBones 15:10:18
15:58 should suffice
MattJ 15:10:30
SaltyBones, and a time machine
Dave Cridland 15:10:39
MattJ, Prosody (0.9) responds to the traffic on the reverse stream, xmpp.org -> d.c.n - which it hasn't requested authorization for yet.
SaltyBones 15:11:03
Do you mean a timezone machine or a MAM machine?
Zash 15:11:35
No MAM here
Dave Cridland 15:11:36
SaltyBones, For your amusement and mild confusion, XMPP authorizes streams by a 3-tuple of (from-domain, to-domain, direction).
Zash 15:11:40
Page up button tho
Dave Cridland 15:12:03
SaltyBones, Well. One or more of those 3-tuples anyway.
MattJ 15:13:11
Dave Cridland, who hasn't requested authorization for what? You mean Prosody connects to d.c.n as 'xmpp.org' and sends stanzas from 'muc.xmpp.org'?
Dave Cridland 15:13:23
MattJ, Yup, that.
Zash 15:16:17
The thing
Dave Cridland 15:16:43
MattJ, Like this: DEBUG 2018-02-12T15:14:35 /home/dwd/src/Metre/src/xmlstream.cc:95 : NS296914 - G ot [399] : <?xml version='1.0'?><stream:stream xmlns:db='jabber:server:dialback' xmlns:stream='http://etherx.jabber.org/streams' version='1.0' from='xmpp.org' t o='dave.cridland.net' xml:lang='en' xmlns='jabber:server'><iq id='472-452048' ty pe='error' to='dave.cridland.net' from='xsf@muc.xmpp.org/tim@boese-ban.de'><erro r type='cancel'><not-acceptable xmlns='urn:ietf:params:xml:ns:xmpp-stanzas'/></e rror></iq>
dwd 15:17:15
This time I'm joined. Timing, I guess.
Zash 15:19:58
Uh, what was the thing with that again?
Dave Cridland 15:20:35
Zash, The bug itself? Responding to inbound traffic on a multiplexed stream by reversing the wrong domain pair.
MattJ 15:20:57
dwd, I don't see how timing would play a part
Zash 15:21:15
Dave Cridland: Do you remember where we discussed this?
Dave Cridland 15:21:43
MattJ, I think whether I can join is dependent on what streams are open, which is dependent on the session lifetime and activity.
Dave Cridland 15:21:55
MattJ, Not timing as in a race condition, mind.
Dave Cridland 15:22:17
Zash, Erm. 1:1 messages, I think, until we figured out it definitely wasn't a security issue.
Dave Cridland 15:22:30
Zash, Possibly some discussion in jdev.
dwd 15:23:28
FWIW, I have to put in something similar to an implicit auth for X2X, anyway. So this may provide a workaround.
Zash 15:24:43
MattJ: https://hg.prosody.im/trunk/file/0de0018bdf91/plugins/mod_s2s/mod_s2s.lua#l203
MattJ 15:25:04
Zash, exactly where I ended up
Zash 15:25:31
stanza from/to may differ from the stream to/from in case of dialback multiplexing
Dave Cridland 15:44:38
MattJ, You switching to FreeBSD?
Dave Cridland 15:44:47
MattJ, https://svnweb.freebsd.org/base?view=revision&revision=329166
zinid 15:50:08
svn o_O
edhelas 15:52:40
gitis too mainstream
zinid 15:53:07
capitalist pigs use git!
Zash 15:53:30
gititis
Holger 15:53:51
NetBSD has Lua in the kernel for years!
Zash 15:53:52
Dave Cridland: Heh, nice
Holger 15:54:08
https://www.phoronix.com/scan.php?page=news_item&px=MTMwMTU
Holger 15:54:11
And it has CVS :-)
zinid 15:54:24
damn, I only get used to svn...
Dave Cridland 15:56:01
Holger, Can you download NetBSD yet, or is it still only available on tape?
zinid 15:56:44
Holger, cool stuff btw
SamWhited 15:57:09
More importantly, once I download it can I run it on any machines made after 1995? (this is always the problem I had trying to run NetBSD; a few of the ops people at work run it, but they all use *very* old machines)
Dave Cridland gets prepared to explain "tape" to the younger readers. 15:57:15
Dave Cridland 15:57:45
SamWhited, Sure! It runs pretty well even modern machines like the Amiga 4000.
Dave Cridland 15:58:11
(Sorry, that was 1993).
SamWhited 15:58:29
Exactly.
Holger 15:58:37
Well yes it's about as dead as XMPP :-P
SamWhited 15:58:52
I would like to use at *least* a P3.
Holger 15:58:53
But it always worked okay-ish for me on non-recent ThinkPads.
Holger 15:59:15
I no longer do that though.
Holger 16:00:02
cpu0 at mainbus0 apid 0: Intel(R) Core(TM) i3-2120T CPU
Holger 16:00:15
... is the one box I still run NetBSD on.
Zash 16:00:46
Uh, was NetBSD the one being difficult, or OpenBSD?
Holger 16:00:57
Difficult?
SamWhited 16:00:57
Actually, jokes aside, that might be a nice thing to do with my broken old thinkpad; doesn't work very well as a laptop anymore, but it could be a {Free,Net,Open}BSD or Illumos "desktop".
Holger 16:01:15
Zash: Isn't all computers difficult except for Apple?
SamWhited 16:01:39
This is true… ⤴
Zash 16:02:34
Et Apple, Holger
Dave Cridland 16:02:57
I find Apple incomprehensibly difficult to use, I must admit.
Dave Cridland 16:03:12
I've yet to figure out how to go up a directory consistently in file dialogs.
Kev 16:06:11
cd ..
Kev 16:06:19
Happy to help.
Holger 16:07:05
Managing FreeBSD feels more or less like Debian to me, the two others feel a bit more basic but not really harder to use. All dead simple compared to the dances you had to go through when partitioning hard disks or getting X11 to run 20 years ago with any Linux or BSD.
Dave Cridland 16:08:11
Holger, Ah, X11. Kids these days don't know they're born.
Dave Cridland 16:08:39
Kev, Don't get me started on the antiquated command line tools Apple foists upon you.
Holger 16:08:43
Yes, we're horribly old.
Dave Cridland 16:09:02
Holger, We're very experienced. I'm sure that's what you meant.
Zash 16:09:04
I'm sure wayland will bring back the good old times for those who miss configuring Xorg
Holger 16:09:20
Precisely.
SamWhited 16:09:25
The only thing I want out of a machine is to sync an old ipod without Rhythmbox crashing…
Dave Cridland 16:09:52
SamWhited, Rhythmbox never crashes.
Dave Cridland 16:09:59
SamWhited, At that speed it's called "parking".
SamWhited 16:11:12
Rhythmbox is one of those people who parallel parks by backing up until they hit the car behind them, then driving forward until they hit the car in front.
SaltyBones 16:14:23
So, why is it that a client cannot create the stanza-id?
SaltyBones 16:15:07
Is this a thing about malicious clients or what's going on there?
jonasw 16:15:35
SaltyBones, no, unaware clients are sufficient; colliding IDs would lead to interesting issues with MAM
SaltyBones 16:16:19
jonasw, but if the client is only unaware the server could just return an error informing the client that this ID has been used.
Dave Cridland 16:16:54
SaltyBones, But we could use, say, stream-id + attribute-id to form a stable, non-colliding reference identifier.
jonasw 16:17:14
SaltyBones, that’d be annoying to handle in the client, and the server would need an O(1) (or similar) way to determine that the ID has been used already...
SaltyBones 16:17:17
Yeah, or we could hash things and god knows what else...it seems very solvable...
Dave Cridland 16:17:37
SaltyBones, I mean, unless the client isn';t generating unique ids within its stream, in which case it's presumably not going to be fixed to use some other identifier.
jonasw 16:17:39
Dave Cridland, stream id + sm-counter?
jonasw 16:17:46
that’s verifiable by the server
jonasw 16:17:48
and predictable for both
Dave Cridland 16:18:11
jonasw, True. But I think we had some ideas on predictability outside the stream?
SaltyBones 16:18:16
throw a hash on top to avoid privacy issues, done
SaltyBones 16:18:33
Dave Cridland, I thought we dicussed those into oblivion earlier? :)
Dave Cridland 16:18:47
SaltyBones, Really I've lost track.
jonasw 16:19:00
Dave Cridland, hmmm ... hmac(stream-id, sm-counter)?
Dave Cridland 16:19:03
SaltyBones, I have the feeling that nobody quite knows the entire picture here.
SaltyBones 16:19:11
I almost lost track and I basically didn't work at all today and just discuss here. :p
Dave Cridland 16:19:25
jonasw, HMAC() requires a secret to be of any use.
jonasw 16:19:47
Dave Cridland, stream-id.
SaltyBones 16:20:01
Seems reasonable on first sight...
jonasw 16:20:02
(post-TLS obviously...)
Dave Cridland 16:20:05
jonasw, Is that secret? And why HMAC over a hash?
SaltyBones 16:20:21
Oh, that would leave out the hash...
jonasw 16:20:24
Dave Cridland, I’m fine with hash(stream-id || sm-counter) too, but that’s nearly an hmac ;-)
SaltyBones 16:20:28
hmac basically IS a hash
SaltyBones 16:20:32
yeah :)
Dave Cridland 16:20:49
jonasw, I'm pretty sure it's not. :-)
SaltyBones 16:20:56
I am very sure it is. :)
jonasw 16:20:58
that’s why I said "nearly"
jonasw 16:21:03
I think the concat is slightly different
SaltyBones 16:21:06
(yes, nearly)
SaltyBones 16:21:21
https://wikimedia.org/api/rest_v1/media/math/render/svg/4edcf0bd8b403c93564b8d7ea91338b3208dea03
jonasw 16:21:28
doesn’t matter anyways
Dave Cridland 16:21:38
jonasw, An HMAC is two nested hashed with concats and masks, yes.
SaltyBones 16:21:51
Okay, so we only disagree on our definition of nearly. ;)
Dave Cridland 16:21:59
jonasw, But the security properties are distinct.
jonasw 16:22:04
Dave Cridland, I don’t argue that
jonasw 16:22:16
(and I’m aware)
SaltyBones 16:22:56
Anyway, the suggestion of HMAC(key=stream-id, msg=sm-counter) seems good, doesn't it?
Dave Cridland 16:23:04
In any case, given a stanza with a known id on a given stream, we clearly want to be able to predict the MAM id.
SaltyBones 16:23:27
I mean, I am also fine with SHA-*(stream-id || counter-sm)
SaltyBones 16:24:04
Dave Cridland, but mam-id = stanza-id which is what I am proposing to set to this...
Dave Cridland 16:24:13
Question is, do we think the MAM id is likely to become public, and if so, can someone relatively easily figure out the next (or previous) value, and then do Something Bad?
SaltyBones 16:24:56
~~Dave Cridland, that might be the question bit if it only costs us a hash per message to not answer it might also not be...~~ ✎
SaltyBones 16:25:06
~~Dave Cridland, that might be the question but if it only costs us a hash per message to not answer it might also not be...~~ ✎ ✏
SaltyBones 16:25:15
Dave Cridland, that might be the question but if it only costs us a hash per message to not answer it, it might also not be... ✏
Dave Cridland 16:25:43
What I'm wondering, you see, is whether we give clients an algorithm to generate attribute-id and then let them signal to the server that they're going to use globally-unique attribute-ids and the server is allowed to call them out if they don't.
jonasw 16:25:48
Dave Cridland, okay, right, this is about the MAM ID. this *probably* doesn’t matter, but if the goal is to consolidate all IDs at some point, choosing a way where the ID can become public is safer
Dave Cridland 16:26:07
jonasw, That's roughly my thought, yes.
jonasw 16:26:48
and at that point, HMAC(…) seems like a sane choice.
Dave Cridland 16:26:58
jonasw, Well, that and "But we *HAVE* stanza ids - right there in the stanza!"
Kev 16:27:14
How would the server know that they didn't use a globally unique id?
SaltyBones 16:27:33
Kev, if the server can simply re-run the generation algorithm and compare results..?
jonasw 16:27:39
Kev, if the ID doesn’t follow the algorithm for a globally (predictable, for the server and client only) unique ID
Dave Cridland 16:27:41
Kev, Well, it'd know if they used no id at all, and if it saw a collision within a stream.
Kev 16:28:28
If we only cared about collisions within a stream, we could trivially solve all issues already.
SaltyBones 16:28:28
And that is how you can tell that Kev is a VIP
SaltyBones 16:28:36
when he says something three people reply!
Dave Cridland 16:29:03
Kev, Well, we care about collisions for a user's account for MAM, and no further.
Kev 16:29:08
But having a useful id to refer to a message would be jolly useful, and I don't see how we can get there yet.
SaltyBones 16:29:52
Okay, so you're saying when two servers federate and you assume one of them is malicious how can the other make sure he doesn't get duplicate IDs?
Kev 16:30:07
Or non-malicious server with a malicious client.
Dave Cridland 16:30:10
SaltyBones, Do they care? ANd if not, should they?
jonasw 16:30:12
SaltyBones, you don’t use other servers stanza-ids in your own MAM
SaltyBones 16:30:22
Well, the malicious client would be detected by the server.
Kev 16:30:23
jonasw: Bloody useful if you could, though, no?
Kev 16:30:37
SaltyBones: How would the server know it's malicious, if it doesn't know all globally generated ids?
SaltyBones 16:30:52
Kev, it can simply check if the client generates their IDs correctly...
Dave Cridland 16:30:54
Kev, Again, why would a server care?
Kev 16:30:55
Without exploding the size of IDs, at least.
jonasw 16:30:56
(we should mandate the use of shakespeare-inspired adjectives in each bloody sentence in this room, by the way)
jonasw 16:31:12
Kev, how would it be useful?
Kev 16:31:36
I'd like to send a reference to a stanza, where the reference makes sense in my archive, the MUC/MIX archive, and the user's archive.
Dave Cridland 16:31:42
jonasw, I don't think "bloody" is Shakespeare. "Submarine" is, though, as far as I remember.
SaltyBones 16:31:45
Kev, you simply make it a hash or hmac of something that both the client and the server know and the server can check the calculation.
Kev 16:32:16
"simply" :)
SaltyBones 16:32:28
Kev, but doesn't that only involve you, the muc/mix server and $other_client? i.e. only one server...?
Dave Cridland 16:32:31
Kev, Is the (from, id) attributes of the stanza sufficient? If not, why?
Zash 16:32:43
If this is for MAM purposes, then I will cry if I can't stick some time based bits into the ID
Kev 16:32:48
Dave Cridland: No, because the from gets rewritten.
Dave Cridland 16:32:56
Kev, When?
SaltyBones 16:33:09
Zash, what?
Kev 16:33:12
In a MUC or a MIX.
Zash 16:33:33
My MAM ids are basically yyyy-mm-dd-random
jonasw 16:33:41
Kev, those are different use-cases I think.
Kev 16:34:01
jonasw: If we cherry-pick the trivial use cases, it's going to be trivial to solve them :)
jonasw 16:34:06
Kev, this is where origin-id makes more sense. It is generated by the client. If the client doesn’t ensure that there’s enough entropy in there, references to their message won’t work.
SaltyBones 16:34:08
Zash, and this is important because?
Dave Cridland 16:34:09
Kev, Ah, so given a message from a MUC fanout, do you want to be able to identify the originating id? I'm not actually sure you always want to allow that.
SaltyBones 16:35:01
I still don't get it. Why does from get rewritten?
SaltyBones 16:35:09
And what does that have to do with anything? :D
jonasw 16:35:32
SaltyBones, when you send a message from saltybones@yourdomain.example/client-resource to this MUC, we see the message from xsf@muc.xmpp.org/SaltyBones
jonasw 16:35:37
so the from is being rewritten.
jonasw 16:35:52
and if references are based on (from, id) pairs, you can’t use the same reference we all can use for your message
jonasw 16:36:20
(simplified, of course we could still refer to the reflection of the message, but that would break down with MIX, too, I think)
Kev 16:36:42
jonasw: Well, no, because if you can fake someone else's id, suddenly you can maliciously change the target of a reference.
jonasw 16:37:09
Kev, good point. so we indeed need something like (from, id) :(
SaltyBones 16:37:17
Wait what?
zinid 16:37:24
lol
SaltyBones 16:37:25
How can you now fake stuff again?
jonasw 16:37:26
but given that all our fanouts and from-rewriting things do actually do reflections, I’m not sure if that’s so bad, Kev?
jonasw 16:37:57
your local archive woudl still be able to resolve everything
Kev 16:38:00
jonasw: Well, it enforces a round-trip before you can refer to or correct anything, which isn't ideal.
jonasw 16:38:19
Kev, you normally know the "from" you’ll be having.
jonasw 16:38:26
and since you set the origin-id yourself, you already know everything you need
jonasw 16:38:44
(meh, races with server-enforced nickname changes in MUC)
Kev 16:38:44
Point.
jonasw 16:39:26
(but I guess those will be rare enough for us not to care)
SaltyBones 16:39:27
halp! :(
Kev 16:39:39
SaltyBones: Because there are potentially malicious entities.
SaltyBones 16:40:14
Are these only clients or servers as well?
Kev 16:40:14
jonasw: Still doesn't help where a user generates the same id twice, I think, and you're left with ambiguous references/corrections/whatever.
Kev 16:40:17
SaltyBones: Both.
SaltyBones 16:40:34
So why do you connect to a malicious server and expect things to work?
SaltyBones 16:40:40
Maybe that's not a good approach.
jonasw 16:40:54
Kev, that’s not an issue. put 256 bits of entropy in there and you are officially allowed to not care
Kev 16:41:02
Ah, if you have a mechanism for identifying malicious intent on the Internet, you could be a very famous person.
jonasw 16:41:05
Kev, and if we want to have defined behaviour in that case, always assume the most-recent message
Kev 16:41:32
jonasw: That only works for accidents, rather than manipulation, no?
SaltyBones 16:41:50
Kev, but you cannot manipulate as long as the server is checking.
jonasw 16:41:50
Kev, indeed
jonasw 16:41:58
but if you manipulate your own origin-ids, it’s your own fault?
SaltyBones 16:42:01
Of course if we have malicious servers now a bit more work is required. :)
Kev 16:42:13
And I'm concerned that 'most recent' falls apart if you can manage different people receiving different subsets.
jonasw 16:42:18
can one inflict harm by producing duplicate origin-ids?
jonasw 16:42:42
I’m not convinced that this is actually an issue.
Kev 16:42:44
Most likely.
Kev 16:42:54
If I can cause the 'wrong' message to get corrected.
jonasw 16:42:59
maybe with something like Reactions based on (from, origin-id) references.
Kev 16:43:02
Or a like to apply to the wrong message.
Kev 16:43:42
If I can manipulate you into liking a Godwin instead of something sensible, that's not ideal.
jonasw 16:44:18
meh
SaltyBones 16:44:32
But messages aren't authenticated anyway, if you are a malicious server you can claim to have received whatever likes from me...
jonasw 16:44:39
Kev, then the only way is a MUC/MIX stanza ID generated by the MUC/MIX and waiting for a round-trip before references can be done.
Kev 16:44:55
SaltyBones: Other way around.
jonasw 16:44:59
SaltyBones, all it needs for now is a malicious client though
jonasw 16:45:14
SaltyBones, or a malicious client+server pair if we do the hmac-stanza-id thing for origin-id
jonasw 16:45:22
I can still run my own server and make it fake origin-ids
zinid 16:46:49
your own malicious server?
jonasw 16:46:49
Kev, I’d however argue that actively, from a client/own server side, make participants in a MUC/MIX receive only parts of the conversation would be rather trikcy
jonasw 16:47:00
and targeted parts even
jonasw 16:47:22
and that without them suspecting that things are broken in funny ways and thus not trusting the system anymore
Kev 16:47:25
jonasw: Ah, that's ok then, no exploits have ever involved doing anything tricky :)
jonasw 16:47:39
Kev, I see your point, but I’m not sure this is actually something which is reasonably exploitable.
jonasw 16:47:43
but yes
jonasw 16:47:55
unless we let the ID be generated by the fanout service, there’s no way to be sure I’m afraid
jonasw 16:48:04
I mean, I don’t have an issue with that, I’m fine with that even.
Kev 16:48:11
Yes. I don't see a way of solving this, which was why I brushed it under the carpet at the Summit.
jonasw 16:48:14
complicates things though
jonasw 16:52:02
Kev, I’m having a really hard time constructing a successful attack which wouldn’t be seen by the victim in the MUC case
jonasw 16:52:10
even when I omit arbitrary messages
jonasw 16:52:13
to only some participants
jonasw 16:52:21
(which would be really really tricky to achieve targetedly I think)
jonasw 16:53:10
ah, now I have it
jonasw 16:53:56
this is the scenario: 1. user A, "bad statement", origin-id=1 2. user A, "good statement", origin-id=1 3. user B, like (from=user A, origin-id=1) if (2) isn’t seen by all users, they see user B liking "bad statement" instead of "good statement"
jonasw 16:55:12
with an arbitrary amount of messages between (2) and (3), it is also not too difficult to make people not see (2) in a MAM-less MUC.
Kev 16:55:42
Indeed.
jonasw 16:56:32
I don’t think there’s a way unless the MUC does things there
jonasw 16:56:55
~~(i.e. if the MUC generates the ID?)~~ ✎
SaltyBones 16:56:55
But, all it requires to prevent that is for the MUC to check that the ID is unique just like the assumed-non-malicious server did in our previous discussionn...
jonasw 16:56:56
(i.e. if the MUC generates the ID) ✏
jonasw 16:57:09
SaltyBones, but that is a rather expensive check to do
jonasw 16:57:22
you need to record IDs for all eternity, or have a defined way to generate the ID which can be executed by clients
jonasw 16:57:33
the latter would be tremendously tricky to get to work synchronously
jonasw 16:57:41
but possible...
SaltyBones 16:57:59
Which is a little more tricky because there is no obvious common "secret" like the stream-id but a simple Hash("counter") would do
jonasw 16:58:14
something like hash(message-counter || presence-id), where presence-id is an ID assigned to the client on join
SaltyBones 16:58:16
~~Kev, not that in this case I used the word simple again but I was totally lying...~~ ✎
SaltyBones 16:58:22
Kev, note that in this case I used the word simple again but I was totally lying... ✏
jonasw 16:58:32
hash counter doesn’t really work with multi-session nicks I think, it would lead to collisions or races
SaltyBones 16:58:58
jonasw, yeah it was just to get the idea. Indeed you would at least need some salt.
SaltyBones 16:59:27
But if the server gives you the salt on join and you use that to generate the IDs it should be good again.
jonasw 16:59:30
that would be verifable by the MUC and the MUC would drop stanzas which do not adhere to this schema
jonasw 16:59:38
(or rather, reject)
SaltyBones 16:59:49
and since the counter can be per-muc there is no issue with having it
jonasw 16:59:55
per client even
SaltyBones 16:59:57
I think :p
jonasw 17:00:02
Kev, ^
SaltyBones 17:00:03
well, yes per client and per muc
SaltyBones 17:00:11
just saying it doesn't seem to be a privacy issue
jonasw 17:00:14
SaltyBones, yeah
jonasw 17:00:32
so origin-id = one-way-function(presence-id, message-counter), where presence-id is assigned on MUC join
jonasw 17:00:52
only need to define what happens when message-counter wraps around or becomes large or something
jonasw 17:01:05
(same thing for SM by the way, SM has wraparound semantics)
SaltyBones 17:01:39
O_o
SaltyBones 17:01:52
Y U SEND SO MANY MESSAGES!?!!
jonasw 17:02:01
SaltyBones, itym "y u have so long sessions"
SaltyBones 17:02:35
still, if this is defined somewhere we can probably renegotiate the salt at that point...
jonasw 17:02:36
while wrapping around a 64bit counter in a single session is sure challenging, we need to be prepared if this is to be solid :)
jonasw 17:02:40
yeah
zinid 17:03:04
I hear counter?
zinid 17:03:10
sorry, I don't track the discussion
jonasw 17:03:40
zinid, I’m not going to repeat everything you can read in the backlog :)
zinid 17:03:54
as you wish
zinid 17:04:00
not that I care
SaltyBones 17:04:08
counter works in this case because you can restart counting when rejoining the muc
zinid 17:04:14
whatever you will end up will be shit anyways, so
zinid 17:05:52
you already created 4 fucking ids: stanza-id, origin-id, attribute-id and SM id
zinid 17:06:01
maybe it's time to stop and think?
jonasw 17:06:18
what is an sm ID?
zinid 17:06:23
h
jonasw 17:06:25
right
jonasw 17:06:44
zinid, all of this is about reducing the number of IDs, so I thnik this is kinda what we’re doing?
zinid 17:07:06
jonasw, from what I see you want to add yet another counter
jonasw 17:07:16
zinid, to generate origin-ids, yes
jonasw 17:07:20
for MUCs and MIXes
jonasw 17:07:25
to make them verifiable by the service
jonasw 17:07:45
zinid, or rather, we’re replacing origin-id by some one-way-function(counter)
zinid 17:08:23
how this will cover sm id?
SaltyBones 17:09:11
We haven't discussed sm-id, yet. Only the other three.
SaltyBones 17:09:18
I actually have no clue what sm-id is. :)
zinid 17:09:31
SaltyBones, then I need to wait while you recognize the problem 😉
SaltyBones 17:09:57
zinid, so reducing 4 -> 2 is not enough, eh? :p
jonasw 17:10:18
zinid, sm-id stays, but stanza-id (and attribute-id) could potentially become one-way-function(stream-id, sm-id)
zinid 17:10:20
2 is enough, however I think first should be counter and second should be routing information, and not an ID
jonasw 17:10:52
stanza-id being used for the archive only, origin-id being used throughout the network together with the sender address for refernecing a specific message
zinid 17:11:40
what ID will be used to sync?
zinid 17:12:53
with SM IDs you just create a pointless queue in c2s state
SaltyBones 17:15:24
Okay, I think we are trying to solve a problem that is very orthogonal to stream management. But we have only discussed this a bit and it might very well not work even for what we want it to do.
zinid 17:15:49
SaltyBones, I don't think SM should be separated from archive
zinid 17:15:52
it's pointless
SaltyBones 17:16:38
Why?
jonasw 17:16:53
zinid, stanza-id is used for ysnc
jonasw 17:16:56
(for MAM sync that is)
zinid 17:17:10
SaltyBones, because why you need this separation?
zinid 17:17:29
SaltyBones, you keep messages in MAM and then put them in c2s SM queue
zinid 17:17:48
why can't you just inc(counter), store in MAM and send via c2s?
SaltyBones 17:18:24
Why are you asking me that? I am 100 % sure that you know more about SM than I do!
zinid 17:18:56
what I know about SM is that I need to maintain stupid queues inside c2s processes, which sucks as hell
zinid 17:19:10
even though a client can request those messages via MAM
zinid 17:20:39
if you receive an ID out of order, you just reconnect and ask everything started from the ID you received
zinid 17:20:52
and server will send you this from archive
zinid 17:20:55
and vice versa
SaltyBones 17:22:01
But SM-IDs are per-stream not per-message, right?
SaltyBones 17:22:22
At least that's what it looks like in https://xmpp.org/extensions/xep-0198.html
zinid 17:22:22
when you connect a server, you provide last seen ID and it will resend you everyting what's greater than this ID
zinid 17:22:39
SaltyBones, yes, they are totally separated instances
SamWhited 17:22:57
What if the thing you missed was an IQ or a presence that isn't stored in MAM?
SamWhited 17:23:18
If you temporarily disconnect and miss something, SM acks allow you to find out. Doesn't help as much if it only covers messages.
zinid 17:23:26
SamWhited, I think we can drop them
zinid 17:23:46
SamWhited, we already don't care about IQs with Push, so why would we start care?
zinid 17:23:57
try to make jingle call when I'm in "push" mode
SamWhited 17:24:15
Does that not work? That does seem like something we need to care about to me
zinid 17:24:33
SamWhited, but we don't and we cant with push stuff
jonasw 17:24:45
shouldn’t an IQ trigger a push? :-O
zinid 17:24:52
SamWhited, what if I want to receive your software version and you're in push mode?
SamWhited 17:24:56
That seems like a problem that needs fixing then, not an example of something being done right that we should copy.
zinid 17:25:29
anyway, you can keep IQs in MAM if you prefer
zinid 17:25:52
you need to keep subscription requests for sure there
zinid 17:26:11
we keep them already, but in a separate database due to historic reasons only
jonasw 17:28:43
IQs (which are inherent request-response, with exactly one response) in MAM sounds like a terrible idea.
SamWhited 17:29:47
yah, now you have to try and figure out which IQs are time sensitive, which are important to store, etc. it seems like a comlicated way to work around having a stanza counter.
zinid 17:31:26
SamWhited, but you have to decide now too: for example, after 5 minutes of inactivity (by default) a server just bounce all IQs from SM queue
zinid 17:31:37
*bounces
SamWhited 17:31:47
I'm not saying it's perfect, just that this doesn't seem like a good fix.
zinid 17:32:02
a fix? the behaviour is the same
zinid 17:32:14
but we let a client to decide which IQ to reply
SamWhited 17:33:04
Except now you have to store all stanzas in MAM, or not store some stanzas and risk those being the ones that are dropped and you don't know you missed something. The point of SM is to make sure you know if you lose something, which is often important.
zinid 17:33:36
do you really want to loose incoming call?
zinid 17:33:56
that's how it works now: if somebody calls you and you're not connected you loose any track
SamWhited 17:34:32
As I said, I'm not claiming that the current solution is good; just that this makes it worse.
zinid 17:34:42
no, it makes it better
zinid 17:35:06
you just need to define what to store and what not, well, it's too much to redesign, yes
SamWhited 17:35:31
If you don't store everything you now lose the ability to detect connection drops though
zinid 17:35:58
sigh
zinid 17:38:55
not sure how will you address missing call though
zinid 17:39:01
with your approach
SamWhited 17:40:12
I do not have an approach; I agree that is a problem that needs solving though.
SaltyBones 17:48:45
It seems like we need storage for management messages and storage for actual messages. I don't see why these should be mixed up.
zinid 17:55:19
we need to decide what to store and what not
SaltyBones 17:55:42
hm..that distinction was wrong, yeah
SaltyBones 17:55:56
there should be a storage until delivered and an optional long term storage
SaltyBones 17:56:04
users might not even want mam :p
zinid 17:56:19
if you don't want mam then just lose messages
SaltyBones 17:57:44
Okay, let me rephrase, some users might not want long-term storage of messages...
zinid 17:58:12
not sure why this should be specific to any approach
SaltyBones 17:59:01
Because I think that's what MAM is intended for and that's why using it for all messages as you suggest is strange and might have unexpected results if people built it under the assumption that it is for long term storage...
zinid 17:59:53
servers can drop MAM archives on reconnect for example, what's the problem?
SaltyBones 18:00:01
Okay...
zinid 18:00:04
or later, when it's delivered
SaltyBones 18:00:09
this is just me confusing MAM and the other thing again...
SaltyBones 18:00:16
for the one-millionth time...
Ge0rG 18:00:49
What if I want my messages both on my pc and my mobile? I can't just drop MAM when my pc is online
zinid 18:01:23
you can if everything is delivered
zinid 18:02:43
anyway, what you are trying to say is a partial replication, and this problem is very hard to resolve
SaltyBones 18:02:56
what is the other thing again, server side archiving?
SaltyBones 18:05:34
zinid, does MAM know if everything was delivered?
Ge0rG had a very concerning realization about guessable IDs and packet filters in smack earlier today. 18:07:00
zinid 18:07:41
SaltyBones, well, no, I think you cannot know that in general case
SaltyBones 18:07:54
zinid, but that is what SM does, right?
zinid 18:08:16
SaltyBones, well, if you introduce acks, then yes
SaltyBones 18:08:48
Hm.... :)
Holger 18:09:21
All this is orthogonal to the question whether having two separate stanza/message queues is sane. I agree with zinid that it isn't.
Zash 18:09:59
Two whatnow
Ge0rG 18:10:31
I'm saying for many years now that we need to replace 0184, 0198, 0280 and 0313 with one single proper message syncing thing.
Holger 18:10:41
SM is already mostly just an optimization, and I think we should fix the remaining issues to make stream management superfluous.
SaltyBones 18:10:49
I just want to point out that this is orthogonal to our earlier discussion about IDs :)
zinid 18:11:04
SaltyBones, I said "no" in general case because we have Bysantine General Problem, it's unresolvable, what we can gurarantee is sequential consistency
Ge0rG 18:11:07
SaltyBones: please show your ID before being allowed into this room :P
SaltyBones 18:11:34
Holger, does anybody already know what those issues are?
Zash 18:12:59
Ge0rG: Kidnap some server devs and lock yourself in a room until that one single proper message syncing thing to rule them all is properly implemented and XEP'd
Holger 18:13:18
SaltyBones: Syncing of outgoing messages (in a sane way) and maybe avoiding some round trips during session startup.
zinid 18:13:36
Zash, and watch them die 😉
Ge0rG 18:14:14
Zash: I'll lock you and zinid up in that room, and watch the live stream
SaltyBones 18:16:10
Holger, so, can we just turn off stream management and leave MAM and see what happens to find out?
Holger 18:16:47
Sure you can :-)
SaltyBones 18:16:53
Or do you think fixing up MAM is a bad idea and something new is required?
Zash 18:18:04
SaltyBones: That's what I did, actually. Haven't died from SM-lessness yet.
Holger 18:18:51
SaltyBones: I think you can already implement proper sync with MAM as-is. But something new is required to let clients implement this in a sane way, without having to de-duplicate and whatnot.
zinid 18:19:09
that's my point: what we need is to store messages and some other restricted stuff and call it a day
SaltyBones 18:21:40
Holger, so would unique message IDs solve this?
SaltyBones 18:23:59
I mean, at least de-duplication would be reasonably easy then, I guess.
SaltyBones 18:24:27
Of course some sort of counter makes more sense in this case...
Zash 18:25:04
SM has counters. MAM has server-issued, guarante to be unique message ids.
SaltyBones 18:25:25
Why are IDs not a counter?
Holger 18:25:31
Not sure what sort of uniqueness we need to solve what problem. Didn't read the backlog, sorry :-) The thing missing for proper sync of outgoing messages is an algorithm to compute the MAM IDs of outgoing messages.
SaltyBones 18:25:37
I mean the MAM-IDs....or is that the same as the stanza-ids?
Holger 18:25:47
(Sorry, typed this before reading the last few messages.)
zinid 18:25:55
Zash, now answer the question "get me last messages I didn't receive" with current SM and MAM approach 😉
SaltyBones 18:25:59
Holger, what do you mean by outgoing? Messages that we sent?
Holger 18:26:07
Yes.
zinid 18:26:31
if you say "time", then no
Zash 18:26:34
zinid: query after = id of last message I saw
SaltyBones 18:26:35
Okay, that should be solved by the hash idea....if it works :p
Zash 18:26:52
cry over outgoing messages sent after that
Holger 18:27:11
SamWhited: I need to know their IDs so I can tell the server "give me all messages since $id".
SaltyBones 18:27:12
But indeed if we want to query MAM by a "point in time" or "counter" unique IDs are not really the best thing :)
SaltyBones 18:27:54
Holger, but then the server still has to search the MAM archive for that ID and give you everything after it....
zinid 18:27:55
Zash, define after in distributed system 😉
SaltyBones 18:27:59
So a counter would be much better.
Holger 18:28:13
SaltyBones: Sure?
Zash 18:28:19
zinid: MAM archive ids on incoming messages
SaltyBones 18:28:21
Right?
Zash 18:28:26
zinid: after is a MAM term
zinid 18:28:34
Zash, but you need some ordering
Zash 18:29:38
zinid: huh
Zash 18:29:54
zinid: XMPP streams are ordered
zinid 18:30:53
yes, but you need to maintain a timestamp index in the database
Zash 18:31:05
You've lost me
zinid 18:32:26
well, you're probably right that if we assume timestamp ordering then we don't need counters and SM at all
Zash 18:33:09
Huh?
Zash 18:33:29
In a MAM query, 'after' is a field that carries a MAM archive ID
zinid 18:33:50
so, what's the ordering? 🙂
zinid 18:34:02
ID+1?
zinid 18:34:23
from what I understand you use timestamp+id, which means time ordering
Zash 18:34:25
Ordering?
Zash 18:34:47
As I said, you lost me. I have no idea what any of us are talking about anymore.
zinid 18:34:57
ok
Ge0rG 19:10:06
https://summerofcode.withgoogle.com/organizations/?sp-page=5 no xsf?
zinid 19:10:46
BEAM community is there 😛
moparisthebest 19:21:44
SaltyBones: like 98% of what you were talking about is here https://github.com/moparisthebest/xmpp-ircd
moparisthebest 19:22:55
Basically it works if IRC users don't want nickserv or chanserv but they do and I never got back to it :)
SaltyBones 19:23:37
moparisthebest, you mean of a way to connect to a muc using the irc protocol?
moparisthebest 19:23:43
Yes
moparisthebest 19:24:16
Running that makes a muc look like an IRC server to an IRC client
SaltyBones 19:24:42
I'm pretty sure nobody wants that. At least I don't! :D
SaltyBones 19:24:55
But I think it might provide an excuse for people who have to convince irc users ;)
moparisthebest 19:28:11
SaltyBones: yesterday you said
moparisthebest 19:28:14
Hm...maybe the transport should be the other way round. Offer an IRC server that connects to MUCs.
moparisthebest 19:28:25
That's what I was referring to
vanitasvitae 19:28:25
Ignite didnt make it into GSoC
vanitasvitae 19:28:28
:(
SaltyBones 19:32:30
moparisthebest, I know, I said that regarding the discussion of the KDE folks looking for an IM solution
Flow 19:32:53
`> * Ge0rG had a very concerning realization about guessable IDs and packet filters in smack earlier today Care to share?
Flow 19:33:34
Or is it just something in the ancient smack library yaxim uses?
moparisthebest 19:35:08
SaltyBones, right so they could use a MUC, but also have an IRC server that IRC users could use
Dave Cridland 19:35:15
moparisthebest, I like this, BTW.
moparisthebest 19:35:18
and everyone would end up in the same place, but it'd be a muc
Dave Cridland 19:35:37
moparisthebest, Is the GPLv3 your addition or from telepaatti?
Dave Cridland 19:37:10
Anyone know what servers support XEP-0288 these days?
Dave Cridland 19:37:26
Does this Prosody instance, for example?
moparisthebest 19:37:34
Dave Cridland, looks like the original telepaatti is gone, but GPLv3 goes back to at least the next fork looks like
moparisthebest 19:38:20
honestly I kind of abandoned it because I like rust so much more than python nowadays but can't be asked to rewrite everything yet haha :)
Flow 19:38:22
-xep 288
Bunneh 19:38:23
Flow: Bidirectional Server-to-Server Connections (Standards Track, Draft, 2016-10-17) See: https://xmpp.org/extensions/xep-0288.html
moparisthebest 19:38:29
I still run an IRC server I would love to rm -rf
SamWhited 19:38:56
> I need to know their IDs so I can tell the server "give me all messages since $id". I know, apparently I wasn't clear about something, sorry. I'm not suggesting we get rid of MAM or leave everything exactly as it is today, just that MAM doesn't cover some important parts of SM and probably can't be made to cover it without significant downsides.
Ge0rG 19:39:06
Flow: it's affecting the old smack for sure, I'll have a look into smack 4 and let you know
SamWhited 19:39:16
(sorry for the long delay, got pulled into a meeting and then was AFK for a bit)
Dave Cridland greps his logs 19:40:22
Dave Cridland 19:40:37
So there's actually only one server I talk to that does bidi. That's scary.
Flow 19:45:32
Ge0rG, let me know if you didn't just re-discover CVE-2014-0364
Ge0rG 19:53:45
Flow: it's related but different
Zash 19:54:27
Dave Cridland: Is it mine? :)
Dave Cridland 19:55:37
Zash, No, Lance's. Although actually grep might have gone into Annoying Binary Mode.
SaltyBones 19:55:56
-I ?
Dave Cridland 19:56:18
-a, but yeah.
Dave Cridland 19:56:55
OK, so that's actually lots of servers doing bidi. Happy bunny, now. I've been putting the support into Metre.
Zash 19:57:15
Oh? Hm, feeling up for doing a survey?
Dave Cridland 19:57:39
Zash, What sort of survey?
Zash 19:57:54
"How many servers do 288?"
Zash 19:58:06
Hooooold on now
Zash 19:58:13
Is that the same number as ...
Zash 19:58:44
https://www.youtube.com/watch?v=azEvfD4C6ow !!!
Dave Cridland 20:00:53
Zash, Looks like it's PSYC and Prosody.
Zash 20:01:28
Prosody doesn't do it out of the box, you need to go a bit out of your way to install a community module.
Dave Cridland 20:01:59
Really? Quite a few people have, then.
Zash 20:05:35
Dave Cridland: Question is, how much self-selection bias is there among people that have you in their roster? :)
Dave Cridland 20:05:43
Zash, Lots.
Dave Cridland 21:36:02
.
Zash 21:36:11
,
Kev 21:52:59
M-Link does bidi, but we disabled it because of bug reports from Prosody about us not accepting stanzas down the right streams. Some of which I'm starting to question :)
Dave Cridland 21:53:28
:-)
dwd 22:41:46
So, Metre now does Bidi.
dwd 22:54:25
OK, this is weird. Metre is successfully negotiating Bidi with various Prosody servers. OK, great. But absolutely nothing ever tries to negotiate bidi with it, despite it offering the feature.