-
mdosch
Some iteam members present? https://mail.jabber.org/pipermail/operators/ seems to be offline. At least I can't reach it.
-
mdosch
Meh, once you complain out of a sudden it works again. :-/
-
eevvoor
mdosch, yeah, the presentation phenomenon ;)
-
flow
jonas’> MattJ, note that XML forbids control characters, even in escaped form Do you have a reference for me?
-
flow
I just tried to find the corresponding part via a quick look at the XML 1.0 spec, but failed. I very well believe that something like that could be in there
-
MattJ
flow, https://www.w3.org/TR/REC-xml/#sec-references
-
MattJ
"Character references" (i.e. &#nnn; escapes) are only allowed to represent legal characters
-
flow
MattJ, "Legal characters are tab, carriage return, line feed, and the legal characters of Unicode and ISO/IEC 10646"
-
flow
not sure (yet) if this definition of "legal character" includes e.g. control characters
-
MattJ
"Characters referred to using character references MUST match the production for Char."
-
MattJ
Char ::= #x9 | #xA | #xD | [#x20-#xD7FF] | [#xE000-#xFFFD] | [#x10000-#x10FFFF]
-
flow
ahh
-
flow
so I guess everyhing that does not match this production, must be escaped via JSON escaping
-
MattJ
Yes
-
flow
although I could image that it could be disputed if "\u0007" is valid in JSON
-
flow
but that's not our discussion :)
-
flow
now i wonder if there are code points that are legal in XML 1.0 but have to be escaped using hex entity encoding
-
Link Mauve
flow, depends on the context, for instance in attributes, U+0022 and U+0027 are both legal, but one of them has to be escaped.
-
Link Mauve
U+003C in most contexts too of course.
-
flow
yep, i mean besides the set of general etnities (amp, lt, gt, apos, quot)
-
jonas’
flow, I don’t think so
-
Link Mauve
Hmm, depending on the document encoding, characters outside of its representable set of Unicode?
-
Link Mauve
For instance if your document is latin1, any character above that would have to be escaped.
-
flow
encoding happens one layer below, shouldn't be relevant
-
Link Mauve
Obviously irrelevant in the context of XMPP.
-
jonas’
flow, does it?
-
jonas’
it does
-
jonas’
nevermind
-
flow
the encoding, on the wire or within the file, is not relevant for the discussion how we stuff JSON in XMPP's XML✎ -
flow
the encoding, on the wire or within the file, is not relevant for the discussion how we shove JSON in XMPP's XML ✏
-
Link Mauve
flow, because it is set to UTF-8 in XMPP, but e.g. if you use an XML editor and save as latin1, said editor would have to replace any codepoint above U+00FF with an escaped version of it, giving the exact same result as if it had been escaped previously.
-
Link Mauve
Am I wrong?
-
Link Mauve
What the character data represents is irrelevant in this case.
-
flow
Link Mauve: JSON -A> XMPP -B> XML -C> wire
-
flow
we are only talking about A and B here
-
Link Mauve
Ok.
-
flow
that the resulting XML in this chain is encoded using UTF-8 is irrelevant for the discussion what and how to escape JSON in XMPP's XML✎ -
flow
that the resulting XML in this chain is encoded using UTF-8 before being put on the wire is irrelevant for the discussion about what and how to escape JSON in XMPP's XML ✏
-
Link Mauve
Well not so much, it can be considered irrelevant only because UTF-8 allows representing all legal characters in XML 1.0, so only illegal characters (but legal in JSON) have to be taken care of.✎ -
Link Mauve
Well not so much, it can be considered irrelevant only because UTF-8 allows representing all legal characters in XML 1.0 in unescaped form, so only illegal characters (but legal in JSON) have to be taken care of. ✏
-
flow
Link Mauve, but the set of illegal XML 1.0 characters does not depend on the encoding used to store the XML, right?
-
Link Mauve
No, it doesn’t.
-
flow
ahh double negation
-
flow
not sure if in aggreement or not✎ -
flow
not sure if in agreement or not ✏
-
Link Mauve
It is an agreement. ^^
-
flow
phew
-
Link Mauve
:)
-
jonas’
language!
-
jonas’
(is hard)
-
Zash
something something langsec
-
emus
Mid-month reminder to send your project news, aritcles and updates to the XMPP Newsletter on September! 💌️ https://github.com/xsf/xmpp.org/pull/787✎ -
emus
Mid-month reminder to send your project news, articles and updates to the XMPP Newsletter on September! 💌️ https://github.com/xsf/xmpp.org/pull/787 ✏