XSF Discussion - 2021-05-08

L29Ah 05:19:15
is it me, or it's counter-productive to use XEP-0198: Stream Management on flaky connections? without XEP-0198: you are offline, so the messages for you are conveniently stored indefinitely and relayed to your client when it goes online, and your frens know that you're offline so they shouldn't expect an immediate reply with XEP-0198: you are offline, but for XMPP purposes you are "online" for 10 minutes or so after you disconnected, the messages sent to you in the meantime are moved into /dev/null in case you don't reconnect in time, and your frens are confused
Daniel 05:22:48
The messages don't end up in /dev/null
L29Ah 05:38:57
Daniel: where they do end up then? they certainly don't get relayed to the recipient when it goes online like regular "offline" messages
Daniel 05:39:52
Depending on configuration they get returned to the sender with an error message or they end up in the offline queue
jonas’ 05:45:56
L29Ah, your assumption for the non-198 case is not correct
jonas’ 05:46:09
the common case on a flakey connection is that the connection looks alive to the server
jonas’ 05:46:16
for several minutes, maybe longer
jonas’ 05:47:00
any message during that interval will *actually* end up in /dev/null (barring MAM, Carbons, but those would also apply in your with-198 case), because the server has no way of figuring out whether or not they were delivered to your resource (no acking mechanism)
jonas’ 05:47:57
i.e. thanks to the Two Generals problem, the server cannot know for sure when your connection got interrupted exactly. 198 does not solve two generals (obviously), but the approximation of the real state is better due to the explicit acking of received stanzas
jonas’ 05:49:54
so with-198 is better in that regard because: - *if* a flakey connection allows you to reconnect within the 10 minute timeframe, you can resume without any lost state - the server gets a better approximation of what got delivered and what not and can make better decisions based on that regarding rerouting/offline store etc. (not that either of the two is really relevant with MAM/Carbons) but also IQ error replies (which are a nicety for other entities)
jonas’ 05:50:42
the only downside is the potentially fake online state while the connection is resumable; given that most clients do not show the online state prominently anymore anyway, I wouldn’t say it’s much of a bother
jonas’ 05:51:19
(sidenote: some implementations will prolong the '198 "hibernation" lifetime if the client has registered for push notifications to the time of the next push notification + some interval)
menel 05:52:49
It's only a problem if the server hides the error and has no mam and offline storage. Hopefully nobody configures it like that.
L29Ah 05:54:43
> because the server has no way of figuring out whether or not they were delivered to your resource (no acking mechanism) no, the server can ask the OS how many bytes had the client ACKed
L29Ah 05:56:06
https://www.ejabberd.im/faq/tcp/ indeed i can ask ejabberd to save the messages instead of losing them; the lose-by-default behavior looks insane to me
L29Ah 05:56:13
thanks
menel 06:30:55
L29Ah: don't you use mam? That would solve the problem for you
L29Ah 06:33:42
no
L29Ah 06:34:15
and i'd prefer it to be solved for everyone, not just me, tbh
L29Ah 06:34:32
MAM is cumbersome to implement so we won't see it everywhere ever
Holger 07:24:41
Oh so you cross posted. I responded in the ejabberd room.
Holger 07:26:07
But yes the proper solution is MAM. You can't really implement reliable delivery without (as I told you the other day already).
Holger 07:27:40
> https://www.ejabberd.im/faq/tcp/ indeed i can ask ejabberd to save the messages instead of losing them; the lose-by-default behavior looks insane to me I explained the reasoning for the default behavior in that very article. (Which was written back in the days without MAM in mind. MAM solves that crap.)
L29Ah 07:31:31
Holger: the reasoning asserts that losing a message is better than sending it twice
Holger 07:32:07
Returning a proper error message is different from silent loss.
L29Ah 07:32:30
it is effectively a loss in case the sender is no longer around
Holger 07:33:02
But yes bouncing an error is better than potentially large bursts of duplicates.
Holger 07:33:09
The latter is terrible UX.
Holger 07:33:37
Whatever. MAM is enabled by default.
Holger 07:34:28
~~I'm not so motivated the relative terribleness of different non-working workarounds.~~ ✎
Holger 07:34:37
I'm not so motivated to discuss the relative terribleness of different non-working workarounds. ✏
Holger 07:35:14
If you, as an ejabberd admin, prefer a different behavior, I gave you the config settings to do that.
Holger 07:35:55
If there was a behavior that was strictly better (no downside) it wouldn't need those config knobs.
flow 07:36:36
> no, the server can ask the OS how many bytes had the client ACKed a TCP level ack does not automatically imply that the data was processed on the application level
flow 07:37:05
L29Ah, ↑
Holger 07:37:18
Right. And it's not available on all platforms. And terrible to implement. That's definitely not the proper solution to this issue.
L29Ah 07:37:22
flow: sure, but in real world scenarios it's virtually always the case
L29Ah 07:37:43
and using TCP ACKs is strictly better than silently discarding messages
Daniel 07:37:43
Just be aware of the implications and do what ever works for you
flow 07:37:44
L29Ah, maybe, but it's fragile and the assumption is just not correct
Daniel 07:38:07
I have a deployment where we rely on offline messages only and have fairly low sm timeouts
Daniel 07:38:13
Like 60s or so
flow 07:38:47
so I am not even sure if TCP acks are better, but i think we at least agree that application level acks is what you want
Holger 07:39:13
The relative trade offs of non-MAM solutions strongly depend on whether you support multi device.
flow 07:39:22
that ↑
Zash 17:42:21
Did anyone feel inclined to write "XMPP Service Discovery Best Practices" ?
MattJ 21:20:09
Did anyone feel inclined to write a list of XEPs that need writing?
Zash 21:52:09
Or a list of people who could write lists of XEPs that need writing?
edhelas 21:56:28
Zash, you're responsible of that list then, problem solved