XSF Discussion - 2021-09-03

southerntofu 09:55:51
following a comment on HN, someone suggested that the entire protocol be versioned to allow easier guessing of features https://news.ycombinator.com/item?id=28393085
southerntofu 09:56:21
maybe that's a bit extreme, but advertising Compliance Suite version as part of disco could be an idea?
southerntofu 09:57:09
so that your server/client could inform you that a newer client/server may provide a better experience, maybe?
MattJ 09:59:16
I don't see any gains to be had from that, only worse interop and pointlessly nagging users
MattJ 09:59:46
We already have disco on a feature-by-feature basis, so there's no need for a "global" one (i.e. advertise compliance suites)
Maranda 09:59:50
~~Erm but compliance suite is just an informative documento even not an actual feature, that's misleading even~~ ✎
Maranda 10:00:02
Erm but compliance suite is just an informative documento even not an actual feature, that's misleading ✏
southerntofu 10:03:06
fair enough :)
wurstsalat 10:32:27
Displaying Compliance Suites on xmpp.org's Software page is already controversial :)
Zash 10:33:38
I thought it was just not completed and merged yet
Ge0rG 10:35:08
Yeah, it would be a good addition.
Ge0rG 10:35:27
I think the controversy was about client developers being inclined to lie about it, and nobody from the XSF being able to prove
Ge0rG 10:35:57
(or disprove)
southerntofu 11:55:39
isn't there a compliance checker like compliance.conversations.im for Complaince Suites(yet)?
jonas’ 11:57:36
what would it check?
southerntofu 11:58:18
whether all specs mentioned in the compliance suite are implemented correctly by a specific client/server
jonas’ 11:58:22
ahahahhaha
jonas’ 11:58:28
*ahem*
jonas’ 11:58:36
no
jonas’ 11:58:43
testing servers is hard, testing clients is harder.
Zash 11:58:54
Testing the tester isn't trivial either
moparisthebest 11:58:56
a tool to check any server/client for correct implementation would be great
jonas’ 11:58:58
note that compliance.c.im also does not what you say.
southerntofu 11:59:02
i'm a big fan of specification-driven development in principle: https://ttm.sh/2l3.md
moparisthebest 11:59:04
but, likely impossible
jonas’ 11:59:21
too long and badly typeset, didn't raed.
MattJ 11:59:29
> moparisthebest> a tool to check any server/client for correct implementation would be great I've never heard anyone say this before /s
jonas’ 11:59:36
moparisthebest, well, the aioxmpp test suite has found a surprising amount of bugs ;)
jonas’ 11:59:40
... in servers.
southerntofu 12:00:03
Zash, yes but the tester is a single implementation that compares others, so if it raises questionable results about clients which appear to work, it's easy to detect and fix
MattJ 12:00:23
"easy" - off you go ;)
southerntofu 12:00:36
i think we've had this discussion in the past but i'd be curious to integrate Scansion in markdown codeblocks to be part of the specs (although it's only for server testing so far, right?)
southerntofu 12:01:05
MattJ, having a single tester impl say "all clients are broken" is easier to detect/debug than random client/server combinations acting up :P (arguably)
jonas’ 12:01:06
kindof like doctests
Zash 12:01:08
The easy way is to look at the DOAP and hope it's not a pack of blatant lies
southerntofu 12:01:33
jonas’, yeah exactly i love doctests from python/rust and i'd be curious how it could be useful for a protocol spec
MattJ 12:01:37
southerntofu, if you think it's easy, I encourage you to try it. It's a lot of work, and complicated.
Zash 12:01:37
`echo echo BROKEN > compliance-checker.sh`
jonas’ 12:02:00
southerntofu, so, markdown isn't really a format used for XMPP specs to start with ...
MattJ 12:02:15
The Editor is speaking
southerntofu 12:02:17
jonas’, could be XML for all i care, i personally like markdown :)
jonas’ 12:02:24
I personally too like markdown.
MattJ 12:02:25
Meanwhile everyone is using Markdown
jonas’ 12:02:36
one day we should migrate XEPs to use markdown
jonas’ 12:02:42
it would make a lot of things easier.
jonas’ 12:02:45
and more accessible.
jonas’ 12:02:50
it would also make other things harder.
jonas’ 12:02:54
anyway
jonas’ 12:02:56
different sujbect
jonas’ 12:03:10
southerntofu, so, actually, I really like your idea
Zash 12:03:10
XMPP flavored markdown
jonas’ 12:03:41
the doctest-style tests in XMPP specs.
Zash 12:04:00
Whatnow?
jonas’ 12:04:01
it would be interesting to see something like that for a simple spec like '30 and a non-trivial spec like '45
moparisthebest 12:04:14
yea they should be written in XEP-0393 instead
jonas’ 12:04:21
/kickban moparisthebest
jonas’ 12:04:44
servers are easy to automate
jonas’ 12:04:47
clients... not so much
southerntofu 12:04:48
jonas’, i mention multiple (de)serialization formats in "# Other spectest-compliant formats" on that draft blogpost
jonas’ 12:05:01
southerntofu, as I said ... "too long and badly typeset, didn't read."
southerntofu 12:05:08
XEP-compliant XML could be one of those
Zash 12:05:20
what jonas’ said
Zash 12:05:36
It's Friday, not Readlongmarkdownonbacklitscreenday
jonas’ 12:06:18
but in the end, the format is irrelevant
jonas’ 12:06:23
the hard part is actually running the tests
jonas’ 12:06:29
against servers it's doable
jonas’ 12:06:37
but clients... no clue
Zash 12:06:38
e.g. see scansion
southerntofu 12:06:42
Zash, TLDR
jonas’ 12:06:45
exactly, scansion
southerntofu 12:06:46
> As we've seen in the previous articles, open standards and associated test suites are key to achieving expected results, and therefore benefit the whole ecosystem. But often, these two concerns are treated as separate problems to deal with in entirely different ways. As a result, some specifications are so long and complex that coming up with a test suite is a challenge (XMPP/ActivityPub), while other systems are backed by a collection of unspecified test suites that are hard to comprehend (Ansible).
jonas’ 12:06:49
or aioxmpp test suite
southerntofu 12:06:51
> Could we get the best of both worlds by treating specification and compliance (testing) as a single problem? This hypothetical approach i call specification-driven development, whereby a specification document is intended both for human and machine consumption. In that case, the specification contains a written presentation of concepts, in addition to a machine-readable test suite that follows a certain format to programmatically ensure that the concepts and behavior described in the specification are implemented properly. This format for specifications is called a spectest document. (TODO: maybe specdoc ?)
jonas’ 12:07:12
needs more paragraphs
moparisthebest 12:07:32
if you can write a specification such that a program can create tests out of it, then a program could also just implement it correctly for you instead ?
moparisthebest 12:07:37
no programmer needed
jonas’ 12:07:45
moparisthebest, see also IDL
southerntofu 12:07:53
moparisthebest, the tests would be handcrafted *as part of the spec*
Zash 12:08:02
Accept: application/epub+zip delivered onto my e-ink device (it has wifi disabled). Good luck!
jonas’ 12:08:21
southerntofu, any practical idea on how to test clients though?
moparisthebest 12:08:28
I don't actually think that helps in 99% of cases southerntofu
jonas’ 12:08:38
moparisthebest, I think it does.
jonas’ 12:08:42
in many ways
southerntofu 12:08:49
jonas’, i imagine a client could be intrsumented by running in a "test mode" where it exposes a well-known API (over a socket or something) instructing it to react in a certain manner?
jonas’ 12:08:50
imagine all examples in a XEP could actually be run against servers.
moparisthebest 12:08:58
the specs are usually clear, and it's easy to test the things in the XEP
jonas’ 12:09:04
southerntofu, ideally without changing the code of existing clients
moparisthebest 12:09:09
that doesn't help with the 99999 things that actually happen in practice
jonas’ 12:09:09
moparisthebest, I disagree.
Zash 12:09:30
`gajim-control` you say?
Wojtek 12:09:35
> a tool to check any server/client for correct implementation would be great uhm at Tigase we have https://github.com/tigase/tigase-tts-ng (and older https://github.com/tigase/tigase-testsuite) which do just that; probably could be nice to generalise it and make available as a service...
moparisthebest 12:10:13
don't get me wrong, it'd be very helpful, a great start, but you'd still miss a lot of things
Wojtek 12:10:40
I know :-)
southerntofu 12:10:42
moparisthebest, if each XEP contained machine tests in addition to human formats, it would help for two things: it would help to clear ambiguities before reaching implementation stage (because if the test contradits human text then somebody will notice earlier and raise issues) and it would help somewhat-correct implementation because you could just do ./xeptest XEP-0030.xml -- myclient --test-mode
jonas’ 12:10:48
moparisthebest, of course, it will never be perfect, but perfect is the enemy of good.
southerntofu 12:10:57
Zash, i'm not aware of gajim-control
southerntofu 12:11:27
jonas’, without changing client code i don't think is possible? would be much harder to emulate user interactions on every possible client :) :)
moparisthebest 12:11:33
we already have "examples are not the spec" surely "tests are not the spec" would be that way too, so I don't think it helps with ambiguities
jonas’ 12:11:49
moparisthebest, but tests would actually be normative
southerntofu 12:11:51
but maybe the test protocol can be simple enough for all clients to implement?
southerntofu 12:12:06
jonas’, yes exactly, no ambiguity in there :)
jonas’ 12:12:08
southerntofu, I have no clue how that protocol would look like
jonas’ 12:12:14
except if you accidentally XMPP the protocol.
jonas’ 12:12:19
but that's kind of not the goal
jonas’ 12:12:39
like, if you instruct a client to add a roster item .... and such ... you end up respecifying XMPP
jonas’ 12:12:43
to instruct a client to do XMPP
southerntofu 12:13:06
yeah but on a higher-level i guess
jonas’ 12:13:23
maybe, but only slightly
jonas’ 12:13:35
it needs to be pretty fine grained if you want to do things like testing individual MUC interactions
southerntofu 12:13:39
and in fact maybe it can serve as guidelines for library UX? so that if the specs are well-written you could just follow the test API and that would be your client library?
moparisthebest 12:13:45
I'm not even aware of tools to do non-browser UI testing
jonas’ 12:13:45
("grant voice to user", "kick user", "unban user"...)
jonas’ 12:13:56
moparisthebest, they exist
jonas’ 12:14:00
they typically use a11y APIs
moparisthebest 12:14:33
I'm sure they do, just never had the opportunity I guess :)
southerntofu 12:14:58
i mean i guess all clients/libraries have a "mute" or "ban" function, maybe standardizing that could be part of the answer?
southerntofu 12:15:13
then it's just a matter of serializing/deserializing instructions to that "API"
jonas’ 12:15:45
southerntofu, I don't see where that could be standardized really
jonas’ 12:15:48
especially across languages.
jonas’ 12:15:53
and UI paradigms
jonas’ 12:16:03
but that standardisation ... it would be like XMPP itself.
southerntofu 12:16:48
then we could have per-XEP testing interface where we expect you to expose some glue to your client code for specific functions?
moparisthebest 12:17:01
what are you actually after testing though
jonas’ 12:17:09
southerntofu, same thing really
moparisthebest 12:17:24
the client author doesn't want to implement a whole other xmpp protocol to see if his "join muc" works, he wants to click "join muc" and see if that works
southerntofu 12:17:48
jonas’, well per-spec is much easier to implement/maintain/test
jonas’ 12:17:58
southerntofu, doesn't solve the core issue though
southerntofu 12:18:07
and like i said if you're developing a client/library from scratch, it can serve as implementation guideline for making a good library API
jonas’ 12:18:13
it is another XMPP-ish thing besides XMPP
southerntofu 12:18:43
yup
jonas’ 12:18:45
for libraries, the story is slightly different and we are slowly venturing toward the lands of IDLs
southerntofu 12:18:51
IDLs?
jonas’ 12:18:57
interface description languages
jonas’ 12:19:05
("interface" as in API)
southerntofu 12:19:06
ah cool that's sort of what i was talking about :P
jonas’ 12:19:26
like OpenGL publishing an XML file where the entire API is contained so that you can auto-generate headers for any language
jonas’ 12:19:48
but that approach only really works (a) in one single lagnuage/paradigm or (b) for really low-level things
southerntofu 12:20:28
yeah it wouldn't work for UI testing most likely, but personally i'm fine with that
moparisthebest 12:22:12
then you just have library testing I guess
southerntofu 12:25:34
if we reach server & library testing that'd already be a huge milestone :)
southerntofu 12:27:48
anyway we'll see, when i find a complete week to hack on something and see where it goes
jonas’ 12:28:11
southerntofu, feel free to loop me (the XEP Editor) in.
southerntofu 12:28:16
can scansion support multiple clients on different servers to keep S2S?
southerntofu 12:28:22
to test s2S*
Zash 12:28:44
I know of no reason why it couldn't
southerntofu 12:30:23
but does it already?
southerntofu 12:31:24
like can i run two scansion instances with the same "scansion script" where one is client A and one is cliebt B and they're aware of each other's address? and the test can be run on two accounts on the same server (simple e2e test) or across servers (account for s2s bugs)?
southerntofu 12:34:11
fro my reading of the homepage it does but i'd like to be sure: https://matthewwild.co.uk/projects/scansion/
Sam 12:37:54
Tried to catch up on the context, but for my library I wrote https://pkg.go.dev/mellium.im/xmpp/internal/integration which is a framework for spinning up servers and clients and linking them up to test against. Works very well for my own integration tests (except for ejabberd which never shuts down cleanly for some reason, so right now it's only running tests that require a server against prosody) and could in theory be adapted to work for other languages and the like and be more stand alone.
Zash 12:39:31
southerntofu, one scansion script can describe multiple clients, which could likely connect to different servers, but that's not something I've tested
Zash 12:40:13
Many of the ones we use to test Prosody involve multiple actors interacting
southerntofu 12:40:14
ok i'll make sure to test that then :)
Zash 12:40:33
Lots of examples in https://hg.prosody.im/trunk/file/tip/spec/scansion/
Zash 12:41:29
I imagine changing `jid: juliet@localhost` to @example.com etc would work.
southerntofu 12:41:53
yeah that was the gist of my question :)
southerntofu 12:42:18
so it looks like there's at least three different testing frameworks for XMPP lcients/servers (tigase/mellium/scansion)
Zash 12:42:44
Time to make a testing framework compliance tester!
southerntofu 12:42:47
would be interesting to see how they compare and whether they could be reunited as part of a single spec
southerntofu 12:43:08
Zash, maybe not that far haha :P
southerntofu 12:44:53
i mean tigase and mellium appear (from a quick look) to be both object oriented testing APIs.. if they could have the same interface it *could* be easier to maintain a single testing library with bindings to a bunch of supported languages)
Zash 12:46:26
Write a scansion script parser in Go and Java?
southerntofu 12:46:43
yeah wy not, or in any language with C FFI, or..
southerntofu 12:46:54
i don't know what the best answer is, but i know there's possibilities to explore
MattJ 12:47:19
The problem is that most libraries try to abstract the protocol, which is different to testing the protocol. You'll only test that the library author's model and assumptions hold true.
MattJ 12:47:40
That's why Scansion is low-level (but easy), but also tests are typically server-dependent.
southerntofu 12:48:25
why would test be server-dependent? because the server would return more information if it supports further extensions?
southerntofu 12:48:43
or because the fields in a stanza would not be serialized in the same order?
MattJ 12:48:56
Scansion ignores additional elements (in unexpected namespaces) by default
MattJ 12:49:28
Also element order in most cases iirc (there are some different checking modes)
MattJ 12:49:50
But there are multiple ways to implement something and be compliant with the spec in many places
MattJ 12:50:30
Every time you read "MAY" in a spec, that's a gain for implementers and a pain for testers :)
southerntofu 12:50:36
MattJ, surely there's a *finite* number of ways to comply with the spec, right? if so "OR Juliet receives" (boolean logic) could do the trick?
Zash 12:51:25
You could copy the script and have one for each possibility. The challenge is not collapsing into a black hole.
MattJ 12:51:39
The finite possibilities quickly add up
MattJ 12:51:48
I'm not saying it's impossible, I'm just saying it's a lot of work
southerntofu 12:52:01
(although like you said scansion as it is would be ill-suited to test clients/libraries, maybe a unified test API would be more suited in that case)
MattJ 12:52:20
I'm also not saying it's not valuable work, but nobody is motivated to do it after they've already implemented (to their knowledge) a correct XMPP implementation
southerntofu 12:52:25
MattJ, i do understand the possibility of that in an extensible protoco but in regards to XMPP i don't see, do you have any practical example that comes to mind?
southerntofu 12:52:58
i've found so far that specs were rather complete or ambiguous but never offered infinite interpretations
MattJ 12:53:15
I'm not claiming any offer infinite interpretations
southerntofu 12:53:24
(contrary to ActivityPub/Microformats for instance)
MattJ 12:53:26
(or that none do)
MattJ 12:54:46
I encourage you to just try it if you're interested in solving this problem
southerntofu 12:55:58
sure, i'll certainly try with scansion to test multiple servers
southerntofu 12:58:01
in the meantime i'd be curious how Wojtek and Sam see each other's testing framework and whether they could imagine to cooperate in a hypothetical future on a single testing API/suite
moparisthebest 12:58:41
It might be better to have different testing implementations
MattJ 12:58:59
I think it's more valuable to have multiple, especially if real implementations are built on the same libraries
southerntofu 12:59:41
well then cooperate on a test format that could be implemented by both testing systems?
MattJ 13:00:23
A test output format would be nice, to say "I tested XEP-nnnn support, it passed"
Zash 13:00:53
Aren't there a pile of such formats?
goffi 13:02:52
Hey there, I've just been hit by https://docs.python.org/3/whatsnew/3.9.html#urllib-parse (Python 3.9 URL parsing does not recognise ";" anymore as a separator, following W3C changes). As we use ";" as a separator in XMPP URIs, that smells like a more general issue for us
Ge0rG 13:03:48
I've had to work around the Android Java URI parser not recognizing that some years ago already.
goffi 13:04:56
python has a new parameter, so the fix should be easy there
Zash 13:12:43
Mmmm, URI vs URL fun.
moparisthebest 13:25:44
More like URFd
emus 13:49:46
Ge0rG, southerntofu, Zash, wurstsalat: but we host the xmpp providers list, which tries to reference each entry