Remember: IDN is crazy. And just not just a little.
daniel.haxx.se/blog/2022/12/14…
IDN is crazy
IDN, International Domain Names, is the concept that lets us register and use international characters in domain names, and by international we of course mean characters outside of the ASCII range.daniel.haxx.se
Anopka
in reply to daniel:// stenberg:// • • •IDN-based phishing is the reason I turned of punycode translation in Firefox. So, whenever I see a URL beginning with "xn--" I know this is most likely a phishing attempt.
And if one knows how domain names work, one also knows there mostly is a "replacement" for special non-ASCII characters (for example in German we replace "ä" with "ae" if necessary).
Tutiluren
in reply to daniel:// stenberg:// • • •Nice read.
I see the use of curated library (white-list regexps, with some heuristics, perhaps context-aware) that can be shared among tools such as browsers or command-line tools, so that they refuse with a warning of a suspicious idn is used.
But I don't see the need to ban any 📯 emojis there.
If needed, the user shall of course be able to bypass such warning, but no short should be tolerated (--bypass-idn-check, not just -b).
Petr Menšík
in reply to daniel:// stenberg:// • • •daniel:// stenberg://
in reply to Petr Menšík • • •Petr Menšík
in reply to daniel:// stenberg:// • • •Jörg Knappen
in reply to daniel:// stenberg:// • • •Hey, and this does not include the shenanigans with right-to-left-override and its left-to-right counterpart.
Obligatory xkcd:
explainxkcd.com/wiki/index.php…
1137: RTL - explain xkcd
www.explainxkcd.commort
in reply to daniel:// stenberg:// • • •The really sad part? All this complexity, all this surface area for nasty bugs, all these opportunities for social engineering.. and they don't even work for their intended purpose!! Earlier this year, I needed a new domain, with my last name in it — which contains an ø. I got one version with the "ø" and one with "o" instead just in case IDNs caused issues.
I've learned that virtually nothing supports IDNs. I've stopped using the "ø" version because "xn--blah-54a" showed up everywhere.
Patrick Mevzek
in reply to daniel:// stenberg:// • • •Stéphane Bortzmeyer
in reply to daniel:// stenberg:// • • •For once, you write wrong things. Just one: the "crazy" example you show is disallowed since IDN does not allow many of these characters: afnic.fr/en/observatory-and-re…
#IDN #Unicode
Can emojis be used in a domain name?
Stéphane Bortzmeyer (AFNIC)shellsharks
in reply to daniel:// stenberg:// • • •