jdev - 2020-04-17

lovetox 07:40:16
hm lib dev question
lovetox 07:40:25
if a lib has a JID object, and has a getBare() method
lovetox 07:40:45
what should it return if the jid is only a domain for example "asd.com"
lovetox 07:40:55
would be wrong to return asd.com
lovetox 07:41:09
should it raise an exception, NoBareJID or something?
Link Mauve 07:41:36
lovetox, why would it be wrong? asd.com is a valid bare JID.
lovetox 07:41:36
or is the localpart not mandatory on a bare jid
lovetox 07:41:43
ah ok nice
Link Mauve 07:41:48
asd.com/foo is a valid full JID.
lovetox 07:41:50
so barejid, is just without resource
Link Mauve 07:41:55
Yes.
lovetox 07:42:00
ah k thanks
jonas’ 08:42:15
that ^
flow 09:08:34
lovetox, http://jxmpp.org/releases/0.7.0-alpha5/javadoc/org/jxmpp/jid/Jid.html https://github.com/igniterealtime/jxmpp#jxmpp-jid
lovetox 13:47:14
how likely is it that 2 different hash mechanisms produce the same hash
lovetox 13:47:25
like entity caps its mostly sha-1
lovetox 13:47:33
but theoretically you can also use something else
lovetox 13:47:59
right now i store the hash mechanism beside the hash
Link Mauve 13:48:21
lovetox, XEP-0390 fixed this issue AFAIK.
lovetox 13:48:33
but i wonder if i can just not store the hash mech
jonas’ 13:49:32
lovetox, it is unlikely. but it’s stupid to not store the mechanism, too.
jonas’ 13:49:39
it only costs a few bytes
Link Mauve 13:51:00
Can get down to a single byte if you convert the string into an enum.
Link Mauve 13:51:23
Assuming you know the mechanisms you can use.
jonas’ 13:51:33
there was a thing...
lovetox 13:51:34
its not about storage cost
lovetox 13:51:43
i have a 115 cache
lovetox 13:52:00
which constists of hashmethod/hash -> discoinfo
lovetox 13:52:25
but then there are entitys we have to query and cache which provide no hash or have no presence at all
lovetox 13:52:30
like for example a muc
lovetox 13:52:41
i need to also store these disco info data
lovetox 13:52:47
but its hard to use the same cache
lovetox 13:53:00
because for muc i need a JID -> DiscoInfo kind of cache
lovetox 13:53:55
and that i need 2 different caches for discoinfo data .. somehow i dont like it
flow 13:54:03
but that's how it is
jonas’ 13:54:23
lovetox, in aioxmpp, we listen on presence info and prime the JID -> DiscoInfo cache from the hash -> discoinfo cache
jonas’ 13:54:31
~~that means that disco~~ ✎
flow 13:54:34
or, are you thinking about a generic String→ disco#info cache?
jonas’ 13:54:39
that means that discoinfo lookups themselves only ever use the JID -> DiscoInfo cache ✏
flow 13:54:45
guess one could do that, but I'd personally wouldn't do so
jonas’ 13:54:57
which is pre-populated on the fly by the entitycaps component which listens for presence
flow 13:55:32
especially since you could persist the caps cache to disk, while the jid→disco#info cache is somewhat unreliable and should have a sensible timeout (smack uses 24h)
lovetox 13:55:35
ah nice so when you need caps, you always access only the JID -> Disco info cache
lovetox 13:55:43
thats exactly what i was searching for
jonas’ 13:55:44
lovetox, correct
lovetox 13:55:47
thanks
flow 13:56:57
jonas’, does that jid→disco#info cache only include values obtained via caps?
jonas’ 13:57:02
flow, no
flow 13:57:10
or do you also put in results of actual disco#info requests?
jonas’ 13:57:23
in fact, the jid->disco#info cache is in reality a JID -> Future(disco#info) cache
lovetox 13:57:39
yes thats the goal flow, often i have to disco info instances who dont have presence
lovetox 13:57:53
and i save that in a cache and store to disk
jonas’ 13:58:29
lovetox, I’m not sure storing the jid->disco#info cache on disk is a good idea
flow 13:58:35
store to disk ephemeral data?
lovetox 13:58:39
for example a transport which im not subscribed to, if i load my roster i want to know the identity type so i can display icons that match
jonas’ 13:58:39
that ^
jonas’ 13:58:58
right, but treat it as stale and look it up at some point if you’ve loaded it from disk
lovetox 13:59:11
of course, you have have a last seen attr and perodically request again
jonas’ 13:59:21
:+1:
lovetox 13:59:47
all disco info is stale the second you received it, so this has nothing to do with application restarts
jonas’ 13:59:58
sure, but the longer you wait, the staler it gets :)
flow 14:00:17
plus with caps+presence you will get push notifications if disco#info changes
jonas’ 14:00:25
flow, not for MUC services for example
flow 14:00:29
and can then react on that and invalide your cache etc
jonas’ 14:00:29
which is what he’s talking about
lovetox 14:00:56
flow we are talking specially about entitys that have no presence
flow 14:01:13
I know. I just wanted to state that there is a difference between disco#info data with caps and that without
flow 14:01:38
for those cases like MUC, smack has a cache which expires its entries after 24h
flow 14:02:11
but actually, I am always pondering with that
flow 14:03:23
assume we one day live in a word where the xmpp service address will host most services (which I think is desireable for the services where it is possible). and your service operator updates the service, then you will potentially only become aware of that new feature after 24h
flow 14:03:31
stream:features to the rescue
jonas’ 14:05:03
it’d be nice to have 115/390-like push for services, too
flow 14:05:53
wouldn't that be the case if you are subscribed to the presence?
lovetox 14:06:04
for muc this somewhat exists
flow 14:06:11
they just have to add xep115/390 metadata
lovetox 14:06:13
its called notification of configuration change
lovetox 14:06:23
i always disco after it
lovetox 14:06:37
because the notification does not tell you what changed
jonas’ 14:06:44
flow, you’re assuming that all services expose presence
flow 14:06:51
so it is probably not a issue of a spec filling a hole, but implementations just doing that
flow 14:07:19
jonas’, well services need to know the subscriped entities to push 115/390 data to
jonas’ 14:07:26
that is true
jonas’ 14:07:36
I question whether that needs to be presence though
flow 14:07:36
and I think there is nothing wrong with just re-using presence for that?
flow 14:07:51
I can see the argumented of a polluted roster
jonas’ 14:07:56
I think there’s some reason in not using presence for this at all
flow 14:07:58
but I wouldn't buy it
jonas’ 14:08:00
that, too
jonas’ 14:08:26
not for service-to-client, but for client-to-client presence, it quickly gets expensive to have all that stuff in the presence stanza
jonas’ 14:08:40
so I wonder whether more fine-grained notification models wouldn’t make more sense
flow 14:10:10
hmm I'm sorry I can't follow, we where talking about services using caps to "push" diso#info to interested parties, and now you switch to c2s presences?
jonas’ 14:10:21
flow, yes
jonas’ 14:10:34
I’m questioning using presence for caps
jonas’ 14:10:38
because presence is overloaded
jonas’ 14:10:42
this is mostly relevant for c2s presences
jonas’ 14:10:48
(or, more specifically, for client-to-client presences)
flow 14:10:59
so you basically want to re-open the old discussion between peter and phippo about presence floods? ;)
jonas’ 14:11:06
maybe :)
flow 14:11:20
~~I mean clients do not change their disco#info feature often, do they?~~ ✎
jonas’ 14:11:29
exactly
jonas’ 14:11:35
that’s why *not* having this in presence would be good
flow 14:11:54
I mean clients do not change their disco#info often, do they? ✏
flow 14:12:47
please elaborate, because I think having a client specific caps in presence is potentially the only thing sensible in presence these days
jonas’ 14:13:02
avatar hashes exist too
jonas’ 14:13:12
GPG also
flow 14:13:18
well those should probably go in PEP
jonas’ 14:13:22
exactly
flow 14:13:24
and openpgp is already in PEP
jonas’ 14:13:30
I still see it in presence stanzas
flow 14:13:36
(if you use a modern XEP ;;)
jonas’ 14:13:37
just like avatar hashes
jonas’ 14:13:40
yeah, well
jonas’ 14:14:02
the question is, if we’ve gone through the effort to move this rarely-changing data out of <presence/>, shouldn’t we also move that other rarely-changing data (ecaps) out of presence?
jonas’ 14:14:06
what is the rationale for keeping it there?
flow 14:14:10
ok, so from a protocl PoV we are fine (at least in these areas), seems to be more of an implementation-is-missing issue
flow 14:14:55
jonas’, ahh, I was no thinking that the frequency of change should be a criterion here
flow 14:15:10
I was more thinking of "is this client specific" as criterion
jonas’ 14:15:19
aha
flow 14:15:26
I don't think you want different avatars for different clients
jonas’ 14:15:28
I was coming from the "rarely-changing data in a stanza which is often sent is a waste of bandwidth" angle
flow 14:15:40
(course, if you ask enough people, some people will say they want this…)
jonas’ 14:15:50
yeah, the per-client-ness is an argument pro presence
jonas’ 14:16:17
though we’ve already had enough arguments for the case that per-client caps are rarely useful and most of the time you’ll need something like an aggregated caps over all clients of the user (both min and max aggregates)
jonas’ 14:16:40
and those caps could be distributed by the server in a non-presence vehicle and also contain full caps hashes for the individual clients which are currently online.
flow 14:17:18
yes, but, even if per-client caps are rarely useful, which I do not know if I would aggree, I do not see this as argument to remove them
flow 14:17:44
of course, what we have discussed regarding per-account caps appears still desireable
flow 14:17:48
and we should move towards that
jonas’ 14:18:27
even if per-client caps are useful (which they sometimes are, I agree), the question is whether they belong in presence
flow 14:18:32
and maybe, just maybe, we will discovered that per client caps are no longer needed, but then they will probably vanish automatically
flow 14:19:02
yes, but I am not sure if this is the question we should answer right now
flow 14:19:17
it appears as something deeply baked into the core of xmpp
jonas’ 14:19:26
not really, it’s in '115
jonas’ 14:19:30
that’s not that deep
flow 14:19:48
and since they rarely change, i feel like it is not worth any effort getting rid of them
flow 14:20:38
~~but if you want to work on a spec which puts those into something else PEP, then please do~~ ✎
flow 14:20:46
but if you want to work on a spec which puts those into something else (PEP?), then please do ✏
jonas’ 14:20:46
no, that they rarely change is a reason to move them out of presence
jonas’ 14:21:03
if they changed "sometimes", presence would be a good place. if they changed "often", presence would be a terrible placce.
jonas’ 14:21:16
if they change "rarely", they are dead weight in most of the presence broadcasts which happen
jonas’ 14:21:27
(of course, if they change "often", they cause unnecessary presence broadcasts, which is arguably much worse)
jonas’ 14:21:43
flow, yeah, I’ve been pondering that for '390, which is why I take the opportunity to discuss this
flow 14:22:11
~~ahh you not worried about caps triggering additonal presence broadcasts, but the mere existence of caps in every presence~~ ✎
flow 14:22:18
~~ahh you are not worried about caps triggering additonal presence broadcasts, but the mere existence of caps in every presence~~ ✎ ✏
flow 14:22:28
ahh you are not worried about caps triggering additional presence broadcasts, but the mere existence of caps in every presence ✏
jonas’ 14:22:45
exactly
flow 14:22:54
~~tbh i never thought of this is something of a heavy burder~~ ✎
flow 14:23:00
tbh i never thought of this is something of a heavy burden ✏
jonas’ 14:23:10
it gets heavier when you introduce hash agility (like '390) and modern hash functions
flow 14:23:13
I personally wouldn't invest time to improve here
jonas’ 14:23:13
stuff gets more and longer
flow 14:23:34
true, but I do not think that we will change hashes often
jonas’ 14:23:43
I’m not so sure of that
flow 14:23:50
sha1 has served us well for what? a decade or so?
jonas’ 14:23:59
and even if we don’t change them *often*, the transition period may well be a decade of sending two hashes with each presence
jonas’ 14:24:02
because oldstable
flow 14:24:18
obviously
flow 14:24:41
but if I had to guess i'd say we see 4-5 caps variants per presence at most
jonas’ 14:24:52
which is quite a lot
flow 14:25:22
which surely is not optimal, but something I could sleep with
flow 14:26:04
jonas’, if you really want to reduce wire bytes invent an XML extension where the end element is always empty ;)
jonas’ 14:26:18
ITYM EXI
Syndace 14:26:57
alright this is getting WAY to close to what we discussed for OMEMO just minutes ago, I have to jump in with something slightly off-topic
Syndace 14:28:05
We have the problem that for OMEMO, you subscribe to the PEP node of each of the contacts you want to encrypt with. And then we're flooded with PEP nodes on each connect, because PEP sends an update automatically on connect (right?). We were thinking about compressing stuff with EXI 😀 ✎
Syndace 14:28:24
We have the problem that for OMEMO, you subscribe to a PEP node of each of the contacts you want to encrypt with. And then we're flooded with PEP updates on each connect, because PEP sends an update automatically on connect (right?). We were thinking about compressing stuff with EXI 😀 ✏
jonas’ 14:28:45
Syndace, EXI needs to be done on the stream level
flow 14:28:50
Syndace, a common pattern is to split PEP data into a hash and the actual data
jonas’ 14:29:02
it would be cool if PEP services could do that
jonas’ 14:29:06
that would solve the race condition issues around that
Syndace 14:29:07
Nah, EXI is just compression for XML, not talking about the EXI XEP but the EXI technology
flow 14:29:10
that way you only get the hashes on connect, which may already helps a lit
jonas’ 14:29:26
Syndace, EXI generates binary output though, I’m not sure you lose 99% of the advantages if you have to wrap it in base64 again.
Syndace 14:29:28
Yeah that would actually help a lot
flow 14:29:46
not sure if this is possible in your case though, would need to have a closer look
jonas’ 14:29:54
and if every client comes with an EXI implementation, we’re half way to being able to use EXI on c2s, which would be amazing
Syndace 14:31:01
Did a bit of research on available EXI implementations. It doesn't look super good but there are open implementations for C, Java, JavaScript and C#, though I can't say anything about the quality of those. Seem maintained at least.
jonas’ 14:31:18
I hadn’t checked that far yet
jonas’ 14:31:25
I only checked if libxml supports it (which it doesn’t)
flow 14:31:27
as much as like EXI, I am sceptic if this is the right solution to your problem
Syndace 14:31:29
flow I think that should be possible for OMEMO device lists, I'll mention it as a possible solution, thanks 🙂
Syndace 14:31:50
EXI could be used to compress bodies in general too, not only for the PEP node content
jonas’ 14:31:53
we should really discuss letting PEP services hash the contents of nodes
Syndace 14:32:10
And we encrypt the bodies as binary data anyway (SCE), so we don't have to base64 stuff there
jonas’ 14:32:11
but that’d require c14n, and nobody wants to go near that in XML :)
flow 14:32:28
~~Syndace, thay them the OpenPGP XEP said hello (that is where the idea came from)~~ ✎
jonas’ 14:32:34
Syndace, yes
flow 14:32:37
Syndace, say them the OpenPGP XEP said hello (that is where the idea came from) ✏
Syndace 14:32:59
...should really read the openpgp xep again 😀
flow 14:33:06
jonas’> we should really discuss letting PEP services hash the contents of nodes +1
Syndace 14:33:16
yeah that would be awesome
flow 14:33:29
a generic optional mechanism where the push only contains "a hash" would be nice
flow 14:33:45
could potentially be as easy as s/+notify/+notify-hash/
flow 14:34:27
great, now I wanna write a new protoxep
flow 14:34:50
when I actually wanted to go into the hammock
Martin 14:35:19
You can't do both? Write in the hammock?
jonas’ 14:35:28
flow, go ahead
Syndace 14:41:42
> could potentially be as easy as s/+notify/+notify-hash/ damn that sounds soo good
jonas’ 14:42:36
+notify-hash-hash-hash-hash-hash-ha…
larma 15:23:39
flow, why not versioning instead of hash
larma 15:24:17
after all, a few hundred hash nodes that are the same as last connect also seeems kind of wasted...
jonas’ 15:25:15
larma, what would the advantage be?
larma 15:28:30
jonas’, well if I have >100 contacts, each of them use(d) omemo,avatar,microblog,... and when connecting I +notify-hash, I still receive a few hundred hashes. And often enough those are just the same hash as last time I connected.
larma 15:28:56
With some versioning scheme it could be done such that I only get the changes since last connect
jonas’ 15:29:17
larma, right
jonas’ 15:29:27
I thought versioning per node
jonas’ 15:29:32
where you wouldn’t win anything over hashes
jonas’ 15:29:43
versioning per contact or globally (by your local server) would win of course
larma 15:29:49
true, yeah was considering global versioning
Syndace 15:29:50
+notify-hash-$LASTHASH
Syndace 15:29:55
😀
Zash 15:30:14
Uh, what have I missed here‽
jonas’ 15:30:30
Syndace, you do realize that that immediately causes a loop? :)
jonas’ 15:30:40
ah, no, not a loop
jonas’ 15:30:45
but still terrible things™
Syndace 15:30:54
no I don't actually?
larma 15:31:10
Syndace, that wouldn't work because what would you hash, after all you receive multiple different nodes based on that notify
jonas’ 15:31:13
Syndace, it’s part of your ecaps hash ;)
jonas’ 15:31:42
it doesn’t cause a loop -- that was a mistake on my side -- but it still does fun things. since everytime you receive a pep update, your ecaps hash would change and you’d have to reemit presence
Syndace 15:32:01
oh right, you +notify for the node and not for each contact you want notifications from
Syndace 15:33:04
jonas' I wasn't thinking about updating the hash during runtime, just setting it to the last hash you saw before disconnecting last time. Only to avoid the one single automatic update that you receive on connect.
lovetox 16:15:17
~~hm are you aware that +notify~~ ✎
lovetox 16:15:25
hm are you aware that +notify-hash would change your disco info ✏
lovetox 16:15:58
just saying that would flood me with disco info requests everytime something changes in pep
lovetox 16:16:32
and this brings me to the topic that +notify is really bad in disco info
lovetox 16:16:44
i cant deactivate a feature without changing my disco info
lovetox 16:17:42
but on the other hand its really the most easy way to communicate to remote servers what you want hmm
Syndace 16:19:21
"hash" means the word "hash" here, not any actual value. so you'd put "+notify-hash" in disco info exactly the same way you put "+notify" there now. so no disco info change everytime something changes in pep.
lovetox 16:20:00
then i missed something
lovetox 16:20:10
how does the server know on what version i am?
Syndace 16:21:08
I don't think there even was consensus on doing versioning
Syndace 16:21:21
just a simple hash of the node content
Syndace 16:21:24
nothing more, nothing less
lovetox 16:21:33
where is the hash of the node contetn
lovetox 16:21:43
sent with the pep notification?
Zash 16:21:47
hash of what?
Syndace 16:21:57
hash of the pep node content
Zash 16:21:58
what normalization?
Zash 16:22:04
what c14n?
Syndace 16:22:23
whatever flow thinks of 😀
Syndace 16:22:44
> sent with the pep notification? yeah. instead of getting the content in the notification, you get a hash of the content.
lovetox 16:22:56
so instead of the actual data i get hundreds of pep notifications that contain a hash
Syndace 16:23:00
reduced bandwidth and if you need to know the actual content you can manually query
lovetox 16:23:08
and i have to query more if its not the hash that i have?
Syndace 16:23:14
yup, instead of 100 device lists, 100 hashes
Zash 16:23:14
Isn't something like this in XEP-0060?
lovetox 16:23:26
yes thats already in their
Syndace 16:23:27
(for example)
Zash 16:23:31
notifications without payloads at least
lovetox 16:23:31
its called omit the payload
Syndace 16:24:10
how does that work with the first update you get when you connect to the server and +notify?
lovetox 16:24:27
you get a notification just with the item-id without payload
lovetox 16:24:35
the item-id could be your version or hash
Zash 16:25:01
cf xep84
lovetox 16:25:25
but really thats not worth it for something like a device list on omemo
Zash 16:25:29
I've actually wondered why '84 doesn't just use payloadless notifications instead of a separate node
lovetox 16:25:29
the payload is small anyway
Syndace 16:25:49
lovetox well, the node can contain user-defined labels now
Syndace 16:26:01
so it can be a few times bigger than in legacy omemo
Syndace 16:26:10
for a few 100 contacts that adds up
Syndace 16:27:16
at least larma said that the device list notifications already make up a considerable portion of the traffic on connect
larma 16:28:44
It's probably not the only thing, but it's definitely visible
larma 16:29:08
haven't actually calculated how much bytes it makes
lovetox 16:29:14
you can reduce the payload
lovetox 16:29:32
but this does not change the fact that pubsub in its current form is just not scaleable
lovetox 16:29:42
it works nice until you reach X users in your roster
lovetox 16:29:46
then it becomes a burden
larma 16:29:56
you mean pep, not pubsub
lovetox 16:30:16
pep is a subset of pubsub
Syndace 16:30:21
payloadless notifications actually sound pretty cool, we'd have to set the item id to something with meaning though, like a hash of the content
Syndace 16:30:47
we use "current" at the moment
lovetox 16:31:20
then you need to configure the node to max-items=1
Syndace 16:31:29
and do payloadless notifications work with PEP notifications?
lovetox 16:31:33
which we sidestep with "current" right now
lovetox 16:32:03
Syndace, its a node configuration, and you can configure the default node just to enable it
lovetox 16:32:12
but this would probably break every client
Syndace 16:32:20
heh 😀
Syndace 16:33:06
sounds like something that might be a solution for OMEMO but I don't know enough about PEP/pubsub to push that idea forward
lovetox 16:33:30
what we really would need is a smart idea how we can avoid notifications all together if we already received it
lovetox 16:33:49
Syndace, you say solution like there is a problem
Zash 16:33:49
server-side device tracking?
Zash 16:33:59
pubsub-since?
lovetox 16:34:06
omemo and all other pep based xeps work fine
lovetox 16:34:15
its just not scaleable indefinitely
Zash 16:34:33
it doesn't have to tho, humans don't scale that well either
Syndace 16:34:43
"problem" is a strong word, but e.g. we don't put the ik into the device list because it's too big, so you have to query the bundle manually for every new device.
lovetox 16:35:02
openpgp puts the ik into the notification
Syndace 16:35:10
and if everybody sets a huge label for all of their devices, you'll notice the traffic probably
lovetox 16:35:11
so its not like it isnt already done
lovetox 16:35:48
if the payload gets to big, you do what the other xeps do
lovetox 16:35:53
add a metadata node
lovetox 16:35:58
that tells you only the current version
lovetox 16:36:11
see 0084
Syndace 16:36:30
isn't that exactly what the hash approach would do?
lovetox 16:36:56
yes, my example can be implemented and works tomorrow
lovetox 16:37:05
yours need support in X server implementations first
lovetox 16:37:23
and the result is the same, one is just more elegant
Syndace 16:37:41
I think we're drifting away
lovetox 16:39:30
why? its exactly what you want, you subscribe to a metdatanode, it always contains only the last hash or version
lovetox 16:39:45
and you define in the xep, if the version or hash is outdated, you request the payload node
Syndace 16:40:09
yeah sure, we talked about that for OMEMO
Syndace 16:40:22
I don't know why we're reiterating it now
lovetox 16:40:29
ok if im saying it now, i dont really see where the server could even help us here
Syndace 16:41:02
The server could create the hash for us on-the-fly, without the need for an extra metadata node
lovetox 16:41:21
but the extra node is on the server
lovetox 16:41:26
for the client nothing changes
Syndace 16:41:38
the client has to update the metadata node though
lovetox 16:41:42
he gets a notification with the hash, and requests afterwards a node
Syndace 16:41:46
when it publishes something
lovetox 16:41:57
yeah true it has to publish 2 things
lovetox 16:42:00
instead of one
lovetox 16:42:14
hardley worth a new xep and serverimpl though if you ask me :D
lovetox 16:42:56
its not like you publish daily devicelists
Syndace 16:42:57
if you do it manually, every XEP has to do it manually. If you can just subscribe to #notify-hash, every client can decide to do it without the XEP even mentioning the possibility
Syndace 16:43:37
> its not like you publish daily devicelists the problem is still the PEP update spam you get on connect
lovetox 16:43:38
yeah true, as i said its a bit more elegant
Syndace 16:43:45
yeah
lovetox 16:44:08
Syndace, you also get a pep update spam with notify-hash
Syndace 16:44:34
yes, but (in many cases) less :)
lovetox 16:44:52
yes as it would be if you use a metadata node :)
Syndace 16:44:54
less as in less bytes, not fewer updates
lovetox 16:45:23
but ok, if the server does it for us it indeed elegant for the client
Syndace 16:46:42
yeah. And it's easier than payloadless (is it?), because we can keep using one item with id "current" and don't have to rely on max-items (why not?).
lovetox 16:47:50
the problem with payloadless is its a configuration on the node
lovetox 16:48:15
so you have to get all servers to have this configured
lovetox 16:48:22
which does not make much sense in other cases
Zash 16:48:24
`pubsub#deliver_payloads`
Zash 16:49:25
Hm, I wondered why something wasn't (also) a subscription option. Maybe this was it.
Syndace 16:49:40
but the device list is its own node, isn't it? so you could just set that for the device list?
Zash 16:50:22
Did you not kill the 1+n node scheme?
lovetox 16:50:42
Syndace, node configuration is not nice with client
Syndace 16:50:50
we have two nodes, one with the device list and one with the bundles
lovetox 16:50:53
first you have to pull the node configuration, then you have to set a new one
lovetox 16:50:57
then you have to publish
Syndace 16:50:59
the bundles used to be split into n
lovetox 16:51:33
this is theroretically possible with publish_options, server support this only partly
Zash 16:51:35
single device id item or one per device?
Syndace 16:51:55
single
Syndace 16:52:00
two nodes in total
Zash 16:52:03
lovetox, easily solvable, just make the Conversations Compliance checker cry loudly about it
Syndace 16:52:11
ah items, yeah one item with the list
Zash 16:52:18
Hm
Syndace 16:52:31
lovetox we already require setting pubsub#max_items
Syndace 16:52:40
so might also require the other thing
Syndace 16:53:55
Zash, I think PEP only notifies about the newest item? That's why we want the whole list to be one item.
Zash 16:54:29
Another unshaved yak :/
lovetox 16:54:34
Syndace, max items is supported by publish options on most servers
Syndace 16:54:38
Why? 😀
lovetox 16:54:40
other node configurations are not
Zash 16:55:29
If you could somehow ensure that you get all of the items, it'd be cleaner
Syndace 16:55:29
(the Why was @Zash)
lovetox 16:55:33
but yeah if the option is not set, its not bad
lovetox 16:55:36
then you get the payload
Zash 16:55:48
And then you could use retractions to indicate device removal
Zash 16:56:04
Cleaner mapping
lovetox 16:56:47
retractions are not sent on coming online
lovetox 16:57:14
the one thing per item approach is good for stuff on your own server
Zash 16:57:21
https://xmpp.org/extensions/xep-0312.html uses relative time? 😱️
lovetox 16:57:24
like bookmarks, which you want to request anyway on every start
flow 16:57:24
Syndace, FYI https://wiki.xmpp.org/web/XEP-Remarks/XEP-0373:_OpenPGP_for_XMPP
flow 16:58:01
so yes pubsub#deliver_payloads would be the way to go, the wiki page has a note about that feature being not discoverable though
Syndace 16:59:09
cool! thanks for the link.
flow 16:59:25
I think I had the split metadata and data scheme in mind becaue that is what works with any minimal PEP implementation
Zash 16:59:51
Like 84?
flow 17:00:19
Zash, searching for devlier_payloads in 84 yields no results
Zash 17:00:37
it has split metadata and data tho
flow 17:00:44
and since I don't have any detail of every protocol in mind, I would appreciate what exactly
flow 17:00:47
aah ok
flow 17:01:15
~~I also lookinto into my notes and found a todo item regarding OK about deliver_payloads~~ ✎
flow 17:01:24
I also looked into into my notes and found a todo item regarding OK about deliver_payloads ✏
flow 17:03:26
Syndace> payloadless notifications actually sound pretty cool, we'd have to set the item id to something with meaning though, Do you really have to set the ID explicitly? Often it is enough to go with the pubsub service generated one
flow 17:04:04
soo good news, I don't have to write a protoxep, xmpp already provides what we need, we just have to implemented it in services and clients
flow 17:04:16
and I can go in my hammock
flow 17:04:41
Zash, actually I wonder if that split should be declared an anti pattern
lovetox 17:05:41
before you consider something an anti pattern you should at least provide a different approach to reach the same goal
Zash 17:05:42
flow: Mmmm, borderline. I personally think the (old) OMEMO thing with 1+n nodes was worse. But if it works, it gets deployed.
Syndace 17:05:48
flow actually there is a small but meaningful difference between +notify-hash and payloadless: payloadless has to be configured on the node while +notify-hash can be used on any node if the client wants to
Syndace 17:06:30
+notify-payloadless would be amazing too
Zash 17:06:37
You could invent that
Zash 17:06:50
Would be easier if it was implemented as a subscription option tho :/
Zash 17:07:01
and specced as one
Syndace 17:07:31
and payloadless should probably be made optional with a disco feature to reflect the current state of server implementations
Zash 17:07:58
Is there a feature for it?
Syndace 17:08:26
> the feature is not discoverable, most likely because it appears to be mandatory by XEP-0060
Syndace 17:08:33
from https://wiki.xmpp.org/web/XEP-Remarks/XEP-0373:_OpenPGP_for_XMPP
Zash 17:09:29
It there a feature for `pubsub#deliver_payloads` I mean
Syndace 17:11:28
if https://xmpp.org/registrar/disco-features.html is the list of features then no, can't find anything for "deliver_payloads"
Syndace 17:12:56
Anyway, the situation is quite clear, we can't rely on any of that for OMEMO. If we want to reduce the on-connect PEP update traffic, we have to manually specify some sort of metadata node.
Zash 17:13:45
Account level subscriptions + MAM? :)
Syndace 17:13:45
Any I think we agreed that it's not worth the effort given that the device list node is rather small generally
Syndace 17:14:47
I don't think a hard dependency on MAM is a good idea just for that
Syndace 17:18:51
how does account level subscription work? you subscribe using '60 instead of +notify and then you receive updates as messages that are stored in MAM while you're offline?
Zash 17:20:03
Syndace, you subscribe your account, notifications are sent there and could /in theory/ be forked (instead of the origin sending to each +notify) and MAM'd
Zash 17:20:20
In practice those notifications will just be discarded because type=headline
Syndace 17:20:44
ah, right
Zash 17:21:54
Could be solved by some future magic XEP probably
Zash 17:22:04
IM NG might help actually
Zash 17:22:38
Also possible to configure notifications to be sent as type=chat, but that's a node config, not subscription option :(
Syndace 17:22:59
meh
Zash 17:23:08
More of these fancy things as subscription options would be awesome
Zash 17:23:22
So each subscriber decides
flow 17:59:47
Syndace, yes, but do most xeps, including OMEMO not already specify how nodes should be configured? so I am not sure about how meaningful the difference is in this case
flow 18:01:53
I think we should probably tackle this from two angles: configuring the node to not deliver payloads *and* invent +notify-payloadless
larma 18:03:50
flow, how would you introduce +notify-payloadless to a federated network?
Zash 18:04:09
larma, haha .. :(
larma 18:04:33
well the problem is that it's not relying on your server to be updated, but on every server to be updated
Zash 18:04:35
larma, you do both +notify and +notify-payloadless and the receiver needs to ignore the former?
flow 18:05:07
larma, ^
larma 18:05:10
So as a client I get mixed responses, sometimes payloadless and soemtimes not?
larma 18:05:21
depending on what the other ends servers supported
flow 18:05:29
well ideally the node is also configured to no deliver payloads
flow 18:05:40
I actually think that this should be enough
Zash 18:05:43
Alternatively, mandate local server magic
Zash 18:06:05
Your server could easily hash the content (/me laughs in xml c14n) and forward you the hash
flow 18:06:24
Zash, I don't think c14n is relevant here
larma 18:06:29
If we do local server magic, I'd rather go full server magic and do global pep versioning
Zash 18:06:31
Oh
Zash 18:06:42
I'm confusing the hash stuff with the payloadless stuff
Zash 18:06:44
nm me
Zash 18:07:05
Should be trivial for a server to strip the payload
flow 18:07:19
even there, do not think of it as a hash, but an Abstract (PubSub) Node Data ID
flow 18:07:38
for which is true that if the node data changes that abstract id changes to
Zash 18:07:40
I thought I saw some talk about hashing the payload data somehow
flow 18:07:53
but it is not important that "similar" xml leads to the same id
flow 18:08:05
infact, if the data does not change, but there is a new id, that would be fine too
flow 18:08:37
one could even implement that abstract (PubSub) Node Data ID as counter
flow 18:10:09
i.e. it is the same id that xep395 would use
Zash 18:10:09
Anyways, a server seeing a pubsub notification that includes a payload, but the client has set +notify-payloadless then it should be easy to strip the payload and forward the rest
Zash 18:11:07
IIRC payloadless notifications still include the id, so if you stick a magic hash there you should be golden
flow 18:11:31
why is sticking a magic hash in there important?
flow 18:11:40
couldn't you just use the, you know, item id?
larma 18:12:40
flow, if item id is just 'current' all the time, that's not very helpful
flow 18:13:06
larma, it has not to be that way
flow 18:13:19
~~isn't current just because singleton node?~~ ✎
Zash 18:13:24
yes
flow 18:13:26
isn't 'current' just because singleton node? ✏
larma 18:14:15
yeah, but then taking a hash instead of a random number is a good idea, because changing back and forth will result in same id so no unnecessary requests for those that were not online in between
flow 18:14:19
now the question is if it is possible to keep the singleton semantic but have differnt IDs for different items, which seems desireable anyway
Zash 18:14:44
yes, but you need max_items
flow 18:14:44
larma, it is a good idea without doubt, but it is not strictly required
flow 18:15:29
Zash, and we do not have max_items? or is there no issue?
Zash 18:15:57
I think we do, but there's some extra care involved
Zash 18:16:11
Older prosody doesn't, but also doesn't support >1 items so it's fine.
flow 18:17:41
so, use max_items=1, delivery_payloads=false, service generated item id, → $$$
larma 18:17:47
can't we just build mam for pubsub instead, maybe using some shorthand +notify-archive which will cause the server to automatically subscribe to nodes and deliver updates from the archive when connecting?
Zash 18:17:59
🥇️
Zash 18:18:32
larma, any question including the word "just" automatically get "no" for an answer
Zash 18:18:39
It's never "just" anything :P
larma 18:18:40
😀
flow 18:18:54
life would be boring if things where that easy
larma 18:19:08
It's not about doing something that's easy
larma 18:19:18
It's rather about doing something that's meaningful
Zash 18:20:10
larma, pubsub+mam has been mentioned in the past as the glorious saviour of everything, but it's lacking in actual specification or somesuch
Zash 18:20:21
lacking in "how should it even work?"
Zash 18:20:46
I think MattJ had some stuff to say about it recently
larma 18:20:54
replacing `<item id='current'><devices><device id='1' /><device id='2' /></devices></item>` with `<item id='b5fec669aca110f1505934b1b08ce2351072d16b' />` isn't really a huge improvement IMO
Zash 18:21:47
How many phones and computers do normal people even have?
larma 18:22:09
Sure it's some improvement, but it still means O(n*m) traffic on connect (n = number of contacts, m = number of features that use pep)
larma 18:22:39
I alsways calculate with 3, but I feel it's probably rather 1.7 or something
larma 18:22:57
many don't even use their notebooks/desktops for messaging at all
Zash 18:23:30
I heard computer sales was picking up because everyone needed to work from home :)
larma 18:23:31
"I was sending SMS from my phone for the last 25 years, I'll continue to do so"
lovetox 18:24:31
thats what i said earlier, the whole idea makes it a bit more efficient, but in the whole great scheme of things, where everybody stuffs all into pep, it does not really matter
lovetox 18:25:44
but nevertheless it would be more elegant, and xeps would not need to define metadata nodes anymore
Zash 18:25:57
lovetox, have you heard of https://en.wikipedia.org/wiki/Jevons_paradox ? :)
lovetox 18:30:30
no i did not, but now i know
Syndace 18:40:19
should not forget in the size comparison that there are labels
Syndace 18:40:39
and clients will probably set default labels of a few chars
Zash 18:40:58
Labels?
Syndace 18:41:19
optional labels (=strings) for devices
Syndace 18:41:33
to make it easier to identify keys
Syndace 18:41:55
e.g. Gajim could set "Gajim on Windows" for its default label