flow, I don’t see how your solution suggested on xmpp@ietf is practical.
1. "let’s change all the existing unicode interface implementations!" does not seem practical to begin with
2. it still leaves the issue of software not being on the same version all the time across the internet, and updating the unicode database is a *breaking* update [1], so it won’t happen e.g. on debian
[1]: https://postgresql.verite.pro/blog/2018/08/27/glibc-upgrade.html
kokonoehas left
flow
jonas’, I did not suggest to change all existing unicode interface implementation, nor to perform it system wide. The point is, I could take CSH's precis library, add a feature which allows to (dynamically) load a new Unicode Character Database (UCD) and be done. Yes, probably not everyone will always be on the same Unicode Version, but there is not much you can do about it, besides reducing the amount of services which are not on the same unicode version. And increasing agility regarding the supported unicode standard helps here
remkohas joined
kokonoehas joined
Daniel
yes to my understanding this is the benefit of precis. you can just ship a new unicode database with a minor update of the app. you don’t have to write new code
kokonoehas left
Daniel
that should be as easy as increasing a build dependency somewhere (or something)
jonas’
let’s load the unicode database from the xmpp server
kokonoehas joined
larmahas left
Daniel
so one could hope that proliferation of new databases goes rather fix
jonas’
well, at least the only implementation of Precis I know for python it just uses the db shipped with python
larmahas joined
Daniel
well if we identify that early as a pontential source of problems we could hand out implementation notes that say please make it pluggable
jonas’
true
flow
jonas’, i actually pondered about putting the unicode database into dns as PoC. It may be not suited for clients, due privacy reasons, but maybe for servers. And it's fun ;)
dwd
flow, I agree that your proposal would be an improvement, but like others, I see it as a mitigation given the circumstances rather than an outright solution.
kokonoehas left
flow
dwd, point of view i guess, I don't think there is a solution which does not involve agility of the supported unicode standard, so I see agility as a solution
flow
but happy to be proven wrong
dwd
flow, I think we have to be aiming for what you're suggesting. I do not think it'll solve things, just lessen their occurence.
jonas’
https://github.com/byllyfish/precis_i18n/issues/8
jonas’
let’s see how they react :)
dwd
jonas’, I suspect it's a matter of having unicodedata able to dynamically load.
jonas’
dwd, that’s what I’m asking for, ain’t I?
dwd
jonas’, I mean, in the standard Python library, not so much in precis_i18n.
jonas’
dwd, python uses it for parsing, so I doubt that’s happening
jonas’
(I think)
stpeterhas joined
jonas’
having unicodedata load dynamically in a separate object (like you can instantiate separate `random` objects) would be a measure of course
ajhas joined
jonas’
Daniel, you have a working prototype for MIX? Is it using :core:0 or :core:1 for joining? MIX-PAM uses :core:0, but MIX-CORE uses :core:1 :(
zachhas left
zachhas joined
Daniel
haven’t notice that. i seem to be using core:0 and pam:0
jonas’
nice, it’s pam:1 in the spec though
Daniel
and it was working with the one and only server implementation
Daniel
it has been 3/4 year though since i tested things
Mikaelahas joined
stpeterhas left
j.rhas left
intosihas joined
ralphm
dwd: I'm curious if we could something with service discovery and/or stream features.
dwd
ralphm, Possibly. But - taking "schloß@example.org" for a moment - if we assume that the caonical form is decided by "example.org", then a local user might type "schloss@example.org", and at some point we ask example.org what the correct version is. At that point, do we need heavyweight canonicalisation on other servers? (We might do - I'm just exploring).
kokonoehas joined
Zash
Dear lazyxmpp, I wish for a PRECIS implementation suitable for use in C
U-061Chas joined
kokonoehas left
ralphm
Did the normalization of ß change between Unicode versions?
kokonoehas joined
LNJhas left
U-061C
that response on the ietf mailing list, i believe, misses the point. this is a security issue of denial-of-service, not just compatibility
U-061Chas left
winfriedhas left
winfriedhas joined
kokonoehas left
ralphm
Indeed Nameless RTL Person.
remkohas left
Zash
Should we adopt U+061C instead of the speech bubbles? 🙂
Zash
Or maybe U+00DF
Ge0rG
Zash: U+FFFD was also suggested.
U+061C
(btw, U+061C was the first character that I could find to have an empty nickname. some other ones were already blocked when i tried)
U+061C
so it's not that bad ^^
Zash
Space characters that are in Unicode 3,2 (or whatever stringprep uses) would be normalized to U+20 and then the nickname is forbidden if that's all there is
zachhas left
zachhas joined
Zash
Space-ish codepoints outside of that tho
ralphm
Well, sure. That's why PRECIS came about
U+061C
well, a nickname that looks empty
kokonoehas joined
winfriedhas left
winfriedhas joined
winfriedhas left
waqashas left
mukt2has joined
kokonoehas left
winfriedhas joined
U+061Chas left
kokonoehas joined
kokonoehas left
Yagizahas left
zachhas left
zachhas joined
kokonoehas joined
Yagizahas joined
debaclehas joined
j.rhas joined
Steve Killehas left
Steve Killehas joined
LNJhas joined
kokonoehas left
kokonoehas joined
zachhas left
zachhas joined
kokonoehas left
kokonoehas joined
remkohas joined
Ge0rG
At least it doesn't crash iPhones
zachhas left
zachhas joined
lskdjfhas joined
adiaholichas left
adiaholichas joined
stpeterhas joined
pdurbinhas joined
karoshihas left
karoshihas joined
pdurbinhas left
karoshihas left
karoshihas joined
remkohas left
zachhas left
zachhas joined
kokonoehas left
stpeterhas left
kokonoehas joined
eevvoorhas joined
remkohas joined
zachhas left
zachhas joined
lumihas joined
mukt2has left
matlaghas left
zachhas left
zachhas joined
pdurbinhas joined
matlaghas joined
matlaghas left
matlaghas joined
eevvoorhas left
eevvoorhas joined
zachhas left
zachhas joined
lovetox_has left
LNJhas left
pdurbinhas left
remkohas left
zachhas left
zachhas joined
ajhas left
Nekithas left
Nekithas joined
APachhas joined
stpeterhas joined
Yagizahas left
Yagizahas joined
j.rhas left
kokonoehas left
j.rhas joined
stpeterhas left
moparisthebest
Nice one jonas’ :
> it is required that users of PRECIS have precise control over the unicode database version used.