jdev - 2020-08-11

Zash 11:10:03
mod_smacks did have such limits for a short time, but they caused exactly this problem and were then removed until someone can come up with a better way to deal with it
Ge0rG 11:11:21
was that the thing that made my server explode due to an unbound smacks queue? ;)
Zash 11:12:40
I got the impression that yesterdays discussion was about the opposite problem, killing the session once the queue is too large
Zash 11:12:51
So, no, not that issue.
lovetox 11:14:48
it turned out the user which reported that problem had a queue size of 1000
Ge0rG 11:14:53
what lovetox describes sounds like a case of too much burst traffic, not of socket synchronization issues
lovetox 11:14:58
the current ejabberd default is much higher
lovetox 11:15:08
but a few versions ago it was 1000
Ge0rG 11:15:19
if you join a dozen MUCs at the same time, you might well run into a 1000 stanza limit
lovetox 11:15:35
1000 is like nothing
lovetox 11:15:48
you cant even join one irc room like #ubuntu or #python
Ge0rG 11:15:51
lovetox: only join MUCs once at a time ;)
lovetox 11:15:54
you get instantly disconnected
Ge0rG 11:16:00
Matrix HQ
Ge0rG 11:16:17
unless the bridge is down ;)
lovetox 11:16:41
the current ejabberd default is 5000
lovetox 11:16:46
which until now works ok
Ge0rG 11:17:50
I'm sure the Matrix HQ has more users. But maybe it's slow enough in pushing them over the bridge that you can fetch them from your server before you are killed
Kev 11:18:44
Ge0rG: Oh, you're suggesting that it's a kill based on timing out an ack because the server is ignoring that it's reading stuff from the socket that's acking old stanzas, rather than timing out on data not being received?
Kev 11:18:59
That seems plausible.
Ge0rG 11:19:48
Kev: I'm not sure it has to do with the server's reading side of the c2s socket at all
Holger <- still trying to parse that sentence :-) 11:19:48
Zash 11:19:52
Dunno how ejabberd works but that old mod_smacks version just killed the session once it hit n queued stanzas.
Ge0rG 11:20:00
Holger: I'm not sure I underestood it either
Ge0rG 11:20:27
Kev: I think it's about N stanzas suddenly arriving for a client, with N being larger than the maximum queue size
Holger 11:20:48
Zash: That's how ejabberd works. Yes that's problematic, but doing nothing is even more so, and so far I see no better solution.
Kev 11:21:04
Ah. I understand now, yes.
Ge0rG 11:21:43
Holger: you could add a time-based component to that, i.e. allow short bursts to exceed the limit
Ge0rG 11:21:51
give the client a chance to consume the burst and to ack it
Holger 11:22:12
I mean if you do nothing it's absolutely trivial to OOM-kill the entire server.
Ge0rG 11:22:22
Holger: BTDT
Zash 11:23:11
Ge0rG: Wasn't that a feedback loop tho?
Ge0rG 11:23:50
Zash: yes, but a queue limit would have prevented it
Kev 11:27:29
It's not clear to me how you solve that problem reasonably.
Ge0rG 11:28:38
Keep an eye on the largest known MUCs, and make the limit slightly larger than the sum of the top 5 room occupants
Ge0rG 11:28:51
And by MUCs, I also mean bridged rooms of any sort
Holger 11:30:20
Get rid of stream management :-)
Zash 11:30:37
Get rid of the queue
Ge0rG 11:30:51
Get rid of clients
Kev 11:30:58
I think this is only related to stream management, no? You end up with a queue somewhere?
Zash 11:31:00
Yes! Only servers!
jonas’ 11:31:12
Zash, peer to peer?
Zash 11:31:15
NOOOOOOOO
Ge0rG 11:31:21
Holger, Zash: we could implement per-JID s2s backpressure
Zash 11:31:23
Well no, bu tyes
Holger 11:31:24
Kev: You end up with stored MAM messages.
Ge0rG 11:31:37
s2s 0198, but scoped to individual JIDs
Ge0rG 11:31:51
also that old revision that allowed to request throttling from the remote end
Zash 11:32:05
You could make it so that resumption is not possible if there's more unacked stanzas than a (smaller) queue size
Zash 11:32:31
At some point it's going to be just as expensive to start over with a fresh session
Ge0rG 11:33:02
Zash: a client that auto-joins a big MUC on connect will surely cope with such invisible limits
Holger 11:33:07
Where you obviously might want to implement some sort of disk storage quota, but that's less likely to be too small for clients to cope. Also the burst is often just presence stanzas, which we might be able to reduce/avoid some way.
Zash 11:34:38
Soooo, presence based MUC is the problem yet again
Holger 11:34:38
Anyway, until you guys fixed all these things for me, I'll want to have a queue size limit :-)
Zash 11:37:33
I remember discussing MUC optimizations, like skipping most initial presence for large channels
Ge0rG 11:38:26
we need incremental presence updates.
Holger 11:39:17
ejabberd's room config has an "omit that presence crap altogether" knob. I think p1 customers usually press that and then things suddenly work.
eta 11:39:37
isn't there a XEP for room presence list deltas
eta 11:39:59
I also don't enjoy getting megabytes of presence upon joining all the MUCs
Zash 11:42:22
eta: Yeah, XEP-0436 MUC presence versioning
eta 11:42:48
does anyone plan on implementing it?
Zash 11:46:33
I suspect someone is. Not me tho, not right now.
Zash 11:48:02
~~Having experimented with presence deduplication, I got the feeling that every single presence stanza is unique, making pretty large~~ ✎
Zash 11:48:11
Having experimented with presence deduplication, I got the feeling that every single presence stanza is unique, making deltas* pretty large ✏
eta 11:51:37
oh gods
Zash 11:56:32
And given the rate of presence updates in the kind of MUC where you'd want optimizations... not sure how much deltas will help.\
Holger 11:57:09
Yeah I was wondering about the effectiveness for large rooms as well.
Zash 11:57:51
Just recording every presence update and replaying it like MAM sure won't do. Actual diff will be better, but will it be enough?
Zash 11:58:03
Would be nice to have some kind of numbers
Ge0rG 11:58:06
So we need to split presence into "room membership updates" and "live user status updates"?
Zash 11:58:22
MIX?
Zash 11:58:54
Affiliation updates and quitjoins is easy enough to separate
Ge0rG 11:58:59
and then we end up with matrix-style rooms, and some clients joining and leaving the membership all the time
Zash 12:00:04
So we have affiliations, currently present nicknames (ie roles) and presence updates
Zash 12:04:07
I've been thinking along the lines of that early CSI presence optimizer, where you'd only send presence for "active users" (spoke recently or somesuch). Would be neat to have a summary-ish stanza saying "I just sent you n out of m presences"
Zash 12:04:39
You could also ignore pure presence updates from unaffiliated users and that kind of thing
Ge0rG 12:09:22
also you only want to know the total number of users and the first page full of them, the other ones aren't displayed anyway ;)
Zash 12:09:43
Yeah
flow 12:16:19
Zash> Soooo, presence based MUC is the problem yet again I think the fundamental design problem is pushing stanzas instead of recipients requesting them. Think for example a participant of a high traffic MUC using a low throughput connection (e.g. GSM). That MUC could easily kill the participants connection
Zash 12:35:40
You do request them by joining.
flow 12:40:59
~~Zash, sure, let me clarify: requesting them on smaller batches (e.g. MAM pagination style)~~ ✎
flow 12:41:06
Zash, sure, let me clarify: requesting them in smaller batches (e.g. MAM pagination style) ✏
Zash 12:41:18
You just described how Matrix works btw
flow 12:42:12
I did not know that, but it appears like one (probably sensible) solution to the flow control / traffic management problem we have
jonas’ 12:47:18
or like MIX ;D
Ge0rG 12:48:49
let's just do everything in small batches.
flow 12:52:54
correct me if I am wrong, but MIX's default modus operandi is still to fan-out all messages
jonas’ 12:54:12
I think only if you subscribe to messages
jonas’ 12:54:25
also, I thought we were talking about *presence*, not messages.
flow 12:54:34
I think the stanza kind does not matter
flow 12:54:55
if someone sends you stanzas with a higher rate than you can consume some intermedidate queue will fill
jonas’ 12:55:11
yeah, well, that’s true for everything
flow 12:55:26
hence I wrote "fundamental design problem"
jonas’ 12:55:28
I can see the case for MUC/MIX presence because that’s a massive amplification (you send single presence, you get a gazillion and a continuous stream back)
jonas’ 12:55:39
yeah, no, I don’t believe in polling for messages
Kev 12:55:43
The main issue is catchup.
jonas’ 12:55:43
if you’re into that kind of stuff, use BOSH
flow 12:55:52
I did not say anything about polling
Kev 12:56:11
Whether when you join you receive a flood of everything, or whether you request stuff when you're ready for it, in batches.
Kev 12:56:19
Using MAM on MIX is meant to give you the latter.
flow 12:56:32
and yes, the problem is more likely caused by presence stanzas, but could be caused by IQs or messages as well
Kev 12:57:02
If you have a room that is itself generating 'live' stanzas at such a rate that it fills queues, that is also a problem, but is distinct from the 'joining lots of MUCs disconnects me' problem.
flow 12:57:14
Kev, using the user's MAM service or the MIX channel's MAM service?
Kev 12:57:41
Both use the same paging mechanic.
jonas’ 12:57:59
12:41:06 flow1> Zash, sure, let me clarify: requesting them in smaller batches (e.g. MAM pagination style) how is that not polling then?
jonas’ 12:58:17
though I sense that this is a discussion about semantics I don’t want to get into right now.
flow 12:58:25
right, I wanted to head towards the question on how to be notified that there are new messages that you may want to request
jonas’ 12:58:49
by receiving a <message/> with the message.
flow 12:59:24
that does not appear to be a solution, as you easily run into the same problem
jonas’ 12:59:47
[citation needed]
flow 13:00:17
~~I was thinking more along the lines of infrequent/slightly delayed notifications with the current stanza/message head IDs~~ ✎
Holger 13:00:36
MAM/Sub!
flow 13:00:55
I was thinking more along the lines of infrequent/slightly delayed notifications with the current stanza/message head ID(s) ✏
flow 13:02:23
but then again, it does not appear to be a elegant solution (or potentially is no solution at all)
Zash 13:31:33
Oh, this is basically the same problem as IP congestion, is it not?
Zash 13:32:28
And the way to solve that is to throw data away. Enjoy telling your users that.
Zash 13:33:20
> The main issue is catchup. This. So now you'll have to figure out what data got thrown away and fetch it.
Zash 13:33:31
(Also how Matrix works.)
eta 13:37:48
the one thing that may be good to steal from matrix is push rules
eta 13:38:02
i.e. some server side filtering you can do to figure out what should generate a push notification
Zash 13:38:45
Can you rephrase that in a way that doesn't make me want to say "but they stole this from us"
eta 13:39:14
well so CSI filtering is an XMPP technology, right
eta 13:39:17
but there's no API to extend it
eta 13:39:28
like you can't say "please send me everything matching the regex /e+ta/"
Zash 13:39:49
"push rules" meaning what, exactly?
pep. 13:39:50
Zash: it's just reusing good ideas :p
Zash 13:40:15
You said "push notifications", so I assumed "*mobile* push notifications"
Ge0rG 13:40:17
Zash: a filter that the client can define to tell the server what's "important"
Zash 13:40:27
AMP?
eta 13:40:39
Zash, so yeah, push rules are used for mobile push notifications in Matrix
Zash 13:40:45
Push a mod_firewall script? 🙂
Ge0rG 13:40:59
for push notifications, the logic is in the push server, which is specific to the client implementation
Zash 13:41:12
eta: So you mean user-configurable rules?
eta 13:41:15
Zash, yeah
Ge0rG 13:41:35
not rather client-configurable?
eta 13:41:36
I mean this is ultimately flawed anyway because e2ee is a thing
Zash 13:41:48
Everything is moot because E2EE
Ge0rG 13:41:59
I'm pretty sure there is no place in matrix where you can enter push rule regexes
pulkomandy 13:41:59
Is the problem really to be solved on the client-server link? What about some kind of flow control on the s2s side instead? (no idea about the s2s things in xmpp, so maybe that's not doable)
eta 13:42:38
Ge0rG, tada https://matrix.org/docs/spec/client_server/r0.6.1#m-push-rules
Zash 13:42:40
Ge0rG: Keywords tho, which might be enough
eta 13:42:53
you can have a "glob-style pattern"
Zash 13:43:10
Ugh
Ge0rG 13:43:50
eta: that's not what I mean
Ge0rG 13:44:04
eta: show me a Riot screenshot where you can define those globs
eta 13:44:27
Ge0rG, hmm, can't you put them into the custom keywords field
pulkomandy 13:44:37
If you try to solve it on client side you will invent something like tcp windows. Which is indeed a way to solve ip congestion. And doesn't work here because congestion on the server to client socket doesn't propagate to other links
eta doesn't really care about this argument though and is very willing to just concede to Ge0rG :p 13:44:41
Zash 13:45:17
What was that thing in XEP-0198 that got removed? Wasn't that rate limiting?
Ge0rG 13:46:01
Zash: yes
eta 13:46:13
I think the presence-spam-in-large-MUCs issue probably needs some form of lazy loading, right
eta 13:46:21
like, send user presence before they talk
eta 13:46:30
have an API (probably with RSM?) to fetch all user presences
Zash 13:48:12
eta: Yeah, that's what I was thinking
eta 13:48:29
the matrix people had pretty much this exact issue and solved it the same way
Zash 13:48:46
Oh no, then we need to do it differently!!11!!11!!1 eleven
eta 13:49:01
Zash, it's fine, they use {} brackets and we'll use <> ;P
Zash 13:49:08
Phew 😃
eta 13:49:24
the issue with lots of messages in active MUCs is more interesting though
eta 13:49:39
like for me, Conversations chews battery because I'm in like 6-7 really active IRC channels
eta 13:49:43
so my phone never sleeps
eta 13:50:03
I've been thinking I should do some CSI filtering, but then the issue is you fill up the CSI queue
Zash 13:50:08
A thing I've almost started stealing from Matrix is room priorities.
Zash 13:50:33
So I have a command where I can mark public channels as low-priority, and then nothing from those gets pushed trough CSI
Ge0rG 13:50:35
eta: the challenge here indeed is that all messages will bypass CSI, which is not perfect
eta 13:50:40
Zash, yeah, there's that prosody module for that
Ge0rG 13:50:54
eta: practically speaking, you might want to have a wordlist that MUC messages must match to be pushed
eta 13:51:02
I almost feel like the ideal solution is something more like
eta 13:51:06
I want the server to join the MUC for me
eta 13:51:13
I don't want my clients to join the MUC (disable autojoin in bookmarks)
eta 13:51:22
and if I get mentioned or something, I want the server to somehow forward the mentioned message
Ge0rG 13:51:28
eta: your client still needs to get all the MUC data, eventually
eta 13:51:36
Ge0rG, sure
eta 13:51:42
but, like, I'll get the forwarded message with the highlight
eta 13:51:49
then I can click/tap on the MUC to join it
Ge0rG 13:51:53
eta: so CSI with what Zash described is actually good
eta 13:51:56
and then use MAM to lazy-paginate
eta 13:52:14
Ge0rG, yeah, but it fills up in-memory queues serverside
Ge0rG 13:52:18
eta: but I think that command is too magic for us mortals
Ge0rG 13:52:39
eta: yes, but a hundred messages isn't much in the grand scheme of things
eta 13:52:53
Ge0rG, a hundred is an underestimate ;P
eta 13:53:03
some of the IRC channels have like 100 messages in 5 minutes or something crazy
Holger 13:53:05
https://jabber.fu-berlin.de/share/holger/EuIflBOiuR0UyOtA/notifications.jpeg
Holger 13:53:06
C'mon guys this is trivial to solve.
Ge0rG 13:53:07
my prosody is currently consuming ~ 500kB per online user
Holger 13:53:11
https://jabber.fu-berlin.de/share/holger/aIlgwvzEMWv66zF9/notifications.jpeg
Holger 13:53:22
Oops.
eta 13:54:23
Zash, also ideally that prosody module would use bookmarks
eta 13:54:28
instead of an ad-hoc command
Ge0rG 13:54:53
eta: naah
Zash 13:55:05
Bookmarks2 with a priority extension would be cool
Ge0rG 13:55:31
we need a per-JID notification preference, like "never" / "always" / "on mention" / "on string match"
Ge0rG 13:55:43
which is enforced by the server
eta 13:56:37
Ge0rG: that's a different thing though
Ge0rG 13:56:47
eta: is it really?
Ge0rG 13:57:07
eta: for mobile devices, CSI-passthrough is only relevant for notification causing messages
eta 13:57:19
Ge0rG: ...actually, yeah, I agree
Ge0rG 13:57:20
you want to get pushed all the messages that will trigger a notification
Ge0rG 13:57:37
which ironically means that all self-messages get pushed through so that the mobile client can *clear* notifications
Ge0rG 13:57:56
which ironically also pushes outgoing Receipts
Ge0rG 13:58:14
eta: I'm sure I've written a novel or two on standards regarding that
Ge0rG 13:58:24
or maybe just in the prosody issue tracker
Ge0rG 13:59:45
eta: also CSI is currently in Last Call, so feel free to add your two cents
Zash 14:05:37
Ironically?
Ge0rG isn't going to re-post his "What's Wrong with XMPP" slide deck again 14:08:08
Ge0rG 14:08:20
Also the topic of notification is just a TODO there.
Zash 14:08:28
Heh
Zash 14:09:13
> you want to get pushed all the messages that will trigger a notification and that's roughly the same set that you want archived and carbon'd, I think, but not exactly
eta 14:13:01
Ge0rG: wait that sounds like an interesting slide deck
eta 14:13:20
Zash: wild idea, just maintain a MAM archive for "notifications"
eta 14:13:34
I guess a pubsub node would also work
eta 14:13:47
and you shove all said "interesting" messages in there
Ge0rG 14:14:09
eta: https://op-co.de/tmp/whats-wrong-with-xmpp-2017.pdf
Zash 14:16:31
eta: MAM for the entire stream?
Zash 14:16:51
Wait, what's "notifications" here?
Zash 14:18:15
Stuff that causes the CSI queue to get flushed? Most of that'll be in MAM already.
eta 14:18:35
Zash: well mentions really
Ge0rG 14:19:26
eta: MAM doesn't give you push though
eta 14:21:03
Ge0rG: okay, after reading those slides I'd say that's a pretty good summary and proposal
Ge0rG 14:50:47
eta: all it needs is somebody to implement all the moving parts
Zash 15:02:37
Break it into smaller (no, even smaller!) pieces and file bug reports?
Zash 15:02:51
/correct feature requests*
Ge0rG 15:05:49
when I break it into this small pieces, the context gets lost
Ge0rG 15:06:41
like just now I realized there might be some smarter way to handle "sent" carbons in CSI, than just passing all through
Zash 15:07:00
One huge "do all these things" isn't great either
Ge0rG 15:07:11
but maybe a sent carbon of a Receipt isn't too bad after all because it most often comes short after the original message that also pierced CSI?
Ge0rG 15:07:37
did I mention that I'm collecting large amounts of data on the number and reason of CSI wakeups?
Zash 15:07:38
~~Possibly~~ ✎
Ge0rG 15:07:54
and that the #1 reason used to be disco#info requests to the client?
Zash 15:07:56
Possibly (re carbon-receipts) ✏
Zash 15:08:45
Did I mention that I too collected stats on that, until I discovered that storing stats murdered my server?
Ge0rG 15:09:23
I'm only "storing" them in prosody.log, and that expires after 14d
Ge0rG 15:09:34
but maybe somebody wants to bring them to some use?
Zash 15:10:31
disco#info cache helped a *lot* IIRC
Zash 15:11:07
I also found that a silly amount of wakeups were due to my own messages on another device, after which I wrote a grace period thing for that.
Zash 15:12:43
IIRC before I got rid of stats collection it was mostly client-initiated wakeups that triggered CSI flushes
Ge0rG 15:13:03
Zash: "own messages on other device" needs some kind of logic maybe
Ge0rG 15:13:34
like: remember the last message direction per JID, only wake up on outgoing read-marker / body when direction changes?
Zash 15:13:42
Ge0rG: Consider me, writing here, right now, on my work station. Groupchat messages sent to my phone.
Ge0rG 15:13:59
just waking up on outgoing read-marker / body would be a huge improvement already
Ge0rG 15:14:17
Zash: yes, that groupchat message is supposed to clear an eventual notification for the groupchat
Ge0rG 15:14:23
that = your
Zash 15:17:13
After the grace period ends, if there were anything high-priority since the last activity from that other client, then it should push.
Zash 15:17:19
Not done that yet tho I thkn
Zash 15:18:00
But as long as I'm active at another device, pushing to the phone is of no use
Zash 15:19:22
Tricky to handle the case of an incoming message just after typing "brb" and grabbing the phone to leave
Zash 15:20:12
Especially with a per-stanza yes/no/maybe function, it'll need a "maybe later" response
Ge0rG 15:25:28
Zash: yeah. Everything is HARD
eta 15:39:08
also for all slack's complicated diagrams their notifications don't even work properly either
eta 15:39:14
like it doesn't dismiss them on my phone, etc
flow 19:29:19
Zash> And the way to solve that is to throw data away. Enjoy telling your users that. I'd say that's where there is TCP on top of IP (where I'd argue, the actual congestion and traffic flow control happens)
Zash 19:32:18
flow: With TCP, same as XMPP, you just end up filling up buffers and getting OOM'd
flow 19:33:49
Zash, I don't think those two are realy comperable: with tcp you have exactly two endpoints, with xmpp one entity communicates potentially with multiple endpoints (potentially over multiple different s2s links) ✎
flow 19:34:05
Zash, I don't think those two are realy comparable: with tcp you have exactly two endpoints, with xmpp one entity communicates potentially with multiple endpoints (potentially over multiple different s2s links) ✏
flow 19:37:07
Zash, I don't think those two are really comparable: with tcp you have exactly two endpoints, with xmpp one entity communicates potentially with multiple endpoints (potentially over multiple different s2s links) ✏
Zash 19:37:35
(me says nothing about mptcp)
Zash 19:38:11
So what Ge0rG said about slowing down s2s links?
flow 19:38:35
I did not read the full backlog, could to summarize what Ge0rG said?
flow 19:38:45
(otherwise I have to read it first)
Zash 19:39:16
13:31:21 Ge0rG "Holger, Zash: we could implement per-JID s2s backpressure"
flow 19:39:29
~~besides, arent in MPTCP still only two endpoints involved (but using potentially multiple paths)?~~ ✎
flow 19:39:36
besides, aren't in MPTCP still only two endpoints involved (but using potentially multiple paths)? ✏
flow 19:40:40
I am not sure if that is technically possible, the "per-JID" part here alone could be tricky
flow 19:41:28
it appears that implementing backpressure would likely involve signalling back to the sender, but what if the path the sender is also congested?
Zash 19:42:46
I'm not sure this is even doable without affecting other users of that s2s link
flow 19:43:42
as of now, the only potential solution I could come up with is keeping the state server side, and have servers notify clients that the state changes, so that clients can sync whenever they want, and especially how fast they want
flow 19:44:07
but that does not solve the problem for servers with poor connectivity
jonas’ 19:46:40
let’s change xmpp-s2s to websockets / http/3 or whatever which supports multiple streams and will of course solve the scheduling issue of streams competing for resources and not at all draw several CVE numbers in that process :)
Zash 19:47:41
Not impossible to open more parallell s2s links...
jonas’ 19:48:55
~~one for each JID? :)~~ ✎
jonas’ 19:48:58
one for each local JID? :) ✏
Zash 19:50:11
Heh, you could open a secondary one for big bursts of stanzas like MUC joins and MAM ....
Zash 19:50:31
Like I think there were thoughts in the past about using a secondary client connection for vcards
jonas’ 19:51:52
haha wat
Zash 19:57:43
Open 2 c2s connections. Use one as normal, presence, chat etc there. except send some requests like for vcards over the other one, since they often contain big binary blobs that then wouldn't block the main connection :)
pulkomandy 22:41:39
Well… at this point you may start thinking about removing tcp (its flow control doesn't work in this case anyway) and do something xml over udp instead?
Zash 22:51:03
At some point it stopped being XMPP
moparisthebest 23:23:22
QUIC solves this