XSF Discussion - 2021-04-16

Sam 13:34:43
Who else is a treasurer or treasurer adjacent that should have access to the Open Collective? I assume board people? All of them or just some? Anyone else?
Kev 13:35:05
Just Peter, probably.
Sam 13:35:57
I vaguely feel like there should be more than one person with access to reduce bus factor, especially when it comes to things that handle money, but whatever the board wants I suppose.
Sam 13:44:09
We should also consider who is allowed to use the XSF as a fiscal host and how we decide. My instinct is "anything XMPP related" and "at boards discretion" but it would be good to get that confirmed by board and have the treasurer or someone else bring any new applicants before board each week (I am happy to do this if peter doesn't since it's just forwarding names along, I just want to make sure the board is okay with all this since it involves money and I don't want to just make a bunch of stuff up and hope it's fine)
jonas’ 13:44:23
something about a CoC
Sam 13:46:12
This is a little bit different, but also making people agree to follow the CoC once we have one if they want to use us as a fiscal host seems reasonable. I'll draft some text and send it to the board email for discussion. I think it will be relatively non-controversial and we can always change it at any time.
Zash 13:47:10
How do we determine which pieces of software goes on the software listings? Probably some overlap with that selection method.
Sam 13:49:57
In case anyone wants to brain storm: https://pad.disroot.org/p/XSF_Fiscal_Host_Rules
Sam 13:55:30
huh, TIL: "Jabber Open Source License" https://opensource.org/licenses/jabberpl
Sam 13:55:40
I'm assuming that was an early jabberd thing. Glad that got retired.
moparisthebest 14:21:58
dwd, flow, if you have a spare moment you could see if you are less horrified by https://github.com/moparisthebest/xmpp-proxy/blob/master/src/stanzafilter.rs#L224 (and thanks for the state machine hint flow !) again the point is to NOT have a full on XML parser, but simply to reliably split on stanza boundaries so complete stanzas can be passed to a real XML parser later
moparisthebest 14:22:37
on a related note, is anyone aware of some comprehensive XMPP XML stream tests anywhere?
Sam 14:23:51
moparisthebest: what sort of tests are you looking for? More stuff like the ones you linked for splitting XML, or something that matches XML streams to a big jabber;client schema or something?
moparisthebest 14:25:37
I only need to test splitting XML stanzas out of a stream, so strange formatting, CDATA, processing instructions, comments, really anything that might trip such a thing up
moparisthebest 14:26:15
in the end, probably need to investigate creating some type of XMPP XML stream fuzzer, but in the short term I was hoping to steal some test cases from existing projects
Zash 14:26:32
`<x><![CDATA[ lol</x> ]]></x>`
dwd 14:26:56
moparisthebest, I'm wondering if you maybe *do* need an XML parser, but a decent fast one. I used rapidxml (or at least a fork of it) in Metre, which worked really well, and stood up to AFL very well.
moparisthebest 14:30:04
Zash, handles that one fine thanks
moparisthebest 14:30:24
added it to the test
moparisthebest 14:32:38
I just want to split on stanza boundaries, I do not want to allocate memory to parse anything
dwd 14:33:12
moparisthebest, Sure, but rapidxml dopesn't allocate anything either.
dwd 14:33:27
moparisthebest, And you're getting achingly close to an XML parser there anyway.
moparisthebest 14:33:30
http://rapidxml.sourceforge.net/manual.html#namespacerapidxml_1memory_allocation ?
dwd 14:35:01
moparisthebest, And yet, <x a='/>'>This is going to be fun.</x>
dwd 14:36:38
moparisthebest, Yeah, there's a pool for attributes, but since it's a pool it's a single allocation. If you ported it you could *probably* ditch that for the kind of "chopping out elements" work you're trying to do.
Sam 14:38:11
I really need something like this in Go too. I try to keep Mellium relatively fast, but the XML parser is *terrible* and there's not much point to me optimizing things when we're using a parser as slow as the one we're using
dwd 14:38:13
moparisthebest, <a a='![CDATA['/> might be fun too.
Sam 14:38:49
moparisthebest: you might consider fuzzing this. XML is flexible enough that I don't think you'll come close trying to think up edge cases yourself.
Zash 14:39:25
Probably easier to find a generic fuzzer and let it figure out XML syntax anyway
dwd 14:39:33
moparisthebest, What Sam says, besides you'll just be writing the same asusmptions into your tests you've been coding for, like all of us do.
dwd 14:39:44
Zash, AFL can do this, indeed.
dwd 14:39:55
Zash, Dunno if it'll work with Rust, but ... maybe?
Zash 14:40:32
AFL-RIIR is probably a thing already
Sam 14:41:16
Is it worth tying fiscal sponsorship to membership (or saying that at least one person in your project must begin seeking membership)? I don't know if it matters, just seems like something organizations do. That way you've already accepted whatever CoC and other rules we come up with.
Sam 14:41:28
Maybe not, that seems super limiting.
Sam 14:41:38
</thinking-out-loud>
Zash 14:41:59
Seems sensible. (needing membership.)
Sam 14:42:10
But also, why?
Zash 14:42:20
Dunno. Why not? Dunno to that too.
dwd 14:42:21
I don't actually know if that's sensible or not.
Sam 14:42:50
It would be nice to have a representative from every project, but also if this is a service to the community then maybe we want to make it as easy and open as possible.
Sam 14:43:07
Not something we have to decide immediately, I'm just thinking about what a policy write up would look like.
moparisthebest 14:45:04
`<a a='![CDATA['/>` works fine, but indeed I hadn't planned for `<x a='/>'>This is going to be fun.</x>` which will require a "InAttribute" state, thanks dwd
jonas’ 14:46:17
afl works with anything (but is less efficient and less effective) if you run it in qemu mode :)
dwd 14:46:23
moparisthebest, Right - it *feels* like you're basically writing an XML lexer, if not a parser. THough I'll be honest and say this is one of those cases where my lack of a CS degree means I don't really know the difference.
moparisthebest 14:46:28
Metre can't proxy c2s right?
dwd 14:46:37
moparisthebest, Nope.
dwd 14:46:59
moparisthebest, And it only "truly" proxies S2S with the server's consent, as it were.
jonas’ 14:47:05
moparisthebest, right, what dwd says -- check out parser generators and let one of them build a lexer for you based on the official XML grammar
jonas’ 14:47:18
that won’t allocate a lot if anything at all, depending on the implementation
Kev 14:47:35
I’d probably *not* be inclined to encourage membership for the sake of the sponsorship stuff. On the basis that the XSF doesn’t benefit from having lots of members, only from having members who are sufficiently motivated/able to do the few teams that need membership, and otherwise to be on top of things enough to make judgements on Council/Board positions based on people’s interactions with the community. Encouraging people to become members purely to get access to money doesn’t really help with that.
Zash 14:48:08
Kev, good point.
dwd 14:50:26
Indeed.
Sam 14:55:32
*nods* good point
moparisthebest 16:45:38
I guess fuzzing won't really do what I need, I need a stream of XMPP XML and to verify I split it at the correct boundaries
moparisthebest 16:47:45
no one knows of projects that have tests consisting of anything like that for their parsers ?
Kev 16:55:20
Fuzzing is what you need in terms of testing you don’t fall apart in the face of bad input, but not in terms of ensuring boundaries are correct, indeed.
moparisthebest 16:57:03
yea fuzzing is certainly valuable, just also need other things
Sam 17:01:13
Maybe more of a mix of fuzzing and integration testing then. Generate random XML input, pipe it through your splitter and a real parser, when you detect a difference generate a unit test from that.
moparisthebest 17:01:55
generating random-but-valid XMPP-subset-of-XML sounds hard
Sam 17:05:28
Not really. Elements, random cdata, random attributes.
moparisthebest 17:06:29
yea, but then we are back to testing only the things I know about
Kev 17:14:28
Yeah. It’s easy as long as you don’t want anything that you didn’t already think of and could have generated manually :D
moparisthebest 17:15:11
essentially yea :)
Kev 17:15:39
I, once upon a time, wrote an XML-aware (and fairly naive) fuzzing layer for Swiften that would modify stanzas on the way out randomly so we could run ‘good’ Sluift scripts against M-Link and have them modified in malicious ways.
Kev 17:16:19
That was in the days before AFL These days you’d run the same scripts to generate a corpus to feed into a branch-aware fuzzer instead, presumably.
Sam 17:17:28
No, because you have random attributes and the like
moparisthebest 17:18:11
"random" but also that follows the rules I know about like cannot contain " or ' , but those are the rules I know, and have implemented already
moparisthebest 17:20:44
basically for each chunk my splitter spits out, when fed into a real XML parser, it should either: 1. parse a complete stanza 2. error out because of invalid xml (mis-matched tags etc etc) the thing it should never do is: 3. wait for the rest of a partial stanza
flow 17:20:54
moparisthebest, no, jxmpp has a corpus of valid and invalid JIDs, but no corpus of valid and invalid XMPP streams. Wanna team up? :)
Kev 17:22:24
FWIW, I would be inclined to use an XML library for this, unless and until you can see that the performance through that is inadequate.
moparisthebest 17:22:40
even my super naive and known-wrong initial splitter worked perfectly fine with normal-case XMPP, I ran it for days filtering 100% of XML into my server without any errors, it's the other cases that need work
moparisthebest 17:23:21
I'm after zero-memory-allocations rather than performance
Kev 17:23:21
TBH, if you’re worried about ‘working’ rather than ‘correct’, a few days of data on an active and well-peered server usually catches most edge cases, in my experience.
Sam 17:23:24
I guess I don't get what you're trying to test for then. Running random inputs against a real XML parser and your thing seems like it would identify unknown areas where splitting is broken.
moparisthebest 17:25:09
yes I think it'd be valuable, just not as valuable as the horrors-people-have-seen-in-the-wild and added test cases for, but if those don't exist...
Kev 17:25:39
Run AFL against libxml2, generate a corpus, feed that in?
Sam 17:25:58
Yah, I don't know that you'll do well finding specific things from people to test. This is too general for that.
moparisthebest 17:26:07
now that's an interesting thought Kev ...
Sam 17:26:46
Isn't that what I said except recommending specific tools? I am not understanding something about what's being tested here I guess.
Kev 17:26:58
It may be what you said, but not what I read :)
Kev 17:27:15
(Which is probably on me)
moparisthebest 17:27:27
Sam, mainly, if I write a tool to generate all possible XML as I understand it, I might miss something valid that I don't know about
moparisthebest 17:27:40
vs, fuzzing, in theory, should eventually hit all cases ?
Sam 17:28:53
Fuzzing is literally what I said, but yah, I didn't mean "write your own thing". Anyways, what Kev said is what I was suggesting. Do that, it will be better than asking for samples which will never catch the one weird edge case.
Sam 17:29:03
My apologies if I wasn't clear.
moparisthebest 17:29:17
no my bad I appreciate it
moparisthebest 17:30:55
flow, sure, but maybe this is a good path forward already, convince a fuzzer to generate individually good stanzas, combine them in random orders for good streams ? :/
flow 17:37:45
moparisthebest, not saying that it's not, just that a curated corpus would be also nice
moparisthebest 17:38:08
flow, I agree, got any thoughts on gathering that together? :)
Sam 17:38:45
What is this corpus for specifically?
moparisthebest 17:38:59
testing XMPP XML stream parsing ?
Sam 17:39:00
Just where to split XML tokens?
Sam 17:39:27
I'm just wondering how many people actually do their own XML parsing.
flow 17:39:33
I was thinking of a corpus of valid and invalid XMPP streams
moparisthebest 17:39:39
my thing is only concerned on where to split stanzas out of an XML stream, but such a corpus would be more generally useful
Sam 17:39:49
I just don't understand what that tests unless you wrote your own parser
flow 17:40:01
entries in the valid corpus would contain the stream and the indivudual elements that the splitter should identify
moparisthebest 17:40:04
and how many wrongly-use generic XML parsers and allow comments, processing instructions, etc etc Sam ?
flow 17:40:11
and entries in the invalid corpus should be just rejected
moparisthebest 17:40:16
XMPP only allows a subset of XML
Sam 17:40:44
If it's just that you use a parser then you don't really need a corpus except those few things that are forbidden by XMPP, I've got tests for all those things if you want them
moparisthebest 17:40:51
I'm sure many projects actually do this, just like many threw XHTML-IM into a DOM
moparisthebest 17:41:17
if you caught them all, but yes that would be a good starting point
moparisthebest 17:41:37
I think assuming the XML parser you chose actually works well is a mistake
moparisthebest 17:41:51
well, I know it's a mistake...
flow 17:41:58
to be fair, most XML parser I worked with could be easily modified to reject most things XMPP disallows
Sam 17:42:02
I disagree. I mean, you should certainly use a proper XML parser but if you're going to write tests for it you should be upstreaming those, not re-testing what's already been tested
Sam 17:42:47
(or what's likely to have already been tested; obviously if you pick an XML parser that's untested that's a problem, I'm just saying that I don't see why you'd retest it in the XMPP library instead of just writing tests for the parser itself)
moparisthebest 17:44:32
most XMPP things I see test individual stanzas, and not an XMPP-XML-Stream, and that's a mistake
Sam 17:46:53
Why? I mean, I get the need for a test ensuring the parser got limited correctly, but then you can test at the parser level that it correctly rejects comments and the like
flow 17:49:00
~~Sam, re your tests, link pls?~~ ✎
flow 17:49:06
Sam, re your tests, link pls ✏
flow 17:49:11
Sam, re your tests, link pls :) ✏
Sam 17:50:14
I'll have to go dig them up, I think there's one or two in internal/stream, or I may not have ever published them. They do not test the stream in the way moparisthebest wants though, I have separate tests that make sure the parser actually gets wrapped in the "XMPP valid stuff only" wrapper
Sam 17:50:35
But I have a meeting starting in a few minutes, I'll see if I can't find them afterwards.
flow 18:17:26
no worries. I think that also nicely demonstrates the value of a xmpp stream corpus: being able to point people to a repo where they will find plain text files and telling them: your implementation should be able to parse the valid-stream files, and reject the invalid-stream files
flow 18:18:54
whereas what we have right now are probably mostly tests, written in the programming-languages native (unit-)test framework, where you have to carefully extra the test vectors if you want to re-use them ✎
flow 18:19:06
whereas what we have right now are probably mostly tests, written in the programming-languages native (unit-)test framework, where you have to carefully extract the test vectors if you want to re-use them ✏
moparisthebest 18:19:56
yep!
moparisthebest 18:20:39
probably want to include the location where the invalid ones become invalid, maybe byte index of the last successfully-parsed stanza or something?
mathieui 18:22:09
FWIW slixmpp/sleekxmpp has raw stanzas in the unit test suites, but that’s in part because it allows to check that our generated objects are valid, and also it mostly allows copypasting from XEP examples :p
mathieui 18:22:25
(so, not too much value as a parser test)
moparisthebest 18:22:36
you certainly want both types of tests
flow 18:27:40
moparisthebest, for the start i'd probably go with a simple test comment stating where the test is expected to "fail"
flow 18:28:26
but yes, if your parser provides you with the exact coordinates where something went wrong, it can not hurt to compare those with the expected values