We have a CI job to spot unwanted utf8 letters in #curl PRs as we have noticed that GitHub will gladly show the for example (identical) Cyrillic version of a letter next to the Latin version in a diff and it is yes, entirely impossible for a human to spot the diff. I mean the diff is shown, but the significance of it is not.
Changing just a single letter like that in a URL hostname opens up for a world of grief.
This entry was edited (3 months ago)
FOSS Unleashed
in reply to daniel:// stenberg:// • • •daniel:// stenberg://
in reply to FOSS Unleashed • • •Wolf480pl
in reply to daniel:// stenberg:// • • •daniel:// stenberg://
in reply to Wolf480pl • • •Will Orr
in reply to daniel:// stenberg:// • • •daniel:// stenberg://
in reply to Will Orr • • •Marcel Menzel
in reply to daniel:// stenberg:// • • •That's very interesting, as Forgejo displays a big warning on top if something like this is being detected, a button to escape them and is marking the line with a warning:
"This file contains Unicode characters that can be confused with other characters."
Really confused that this is missing in GitHub.
daniel:// stenberg://
in reply to Marcel Menzel • • •VencDvorak
in reply to daniel:// stenberg:// • • •nemo
in reply to daniel:// stenberg:// • • •GitHub recently added warning for Hidden Unicode characters.
Maybe they will get to homograph attacks next.
github.blog/changelog/2025-05-…
GitHub now provides a warning about hidden Unicode text - The GitHub Blog
The GitHub BlogEfi (nap pet) 🦊💤
in reply to daniel:// stenberg:// • • •daniel:// stenberg://
in reply to Efi (nap pet) 🦊💤 • • •Efi (nap pet) 🦊💤
in reply to daniel:// stenberg:// • • •daniel:// stenberg://
in reply to Efi (nap pet) 🦊💤 • • •Efi (nap pet) 🦊💤
in reply to daniel:// stenberg:// • • •Steve Holden
in reply to daniel:// stenberg:// • • •PEP 672 – Unicode-related Security Considerations for Python | peps.python.org
Python Enhancement Proposals (PEPs)dusoft
in reply to daniel:// stenberg:// • • •Brokar
in reply to daniel:// stenberg:// • • •That means that somebody actually sat down and browsed all the fonts to find the 2 characters which look exactly alike?
Just imagine these people would spend their time doing something productive for a change...
daniel:// stenberg://
in reply to Brokar • • •Unicode Utilities: Confusables
util.unicode.orgMustaque Ahmed
in reply to daniel:// stenberg:// • • •daniel:// stenberg://
in reply to Mustaque Ahmed • • •curl/.github/scripts/spacecheck.pl at master · curl/curl
GitHubMr. Lance E Sloan (IRL) 👤
in reply to daniel:// stenberg:// • • •daniel:// stenberg://
in reply to Mr. Lance E Sloan (IRL) 👤 • • •Elric
in reply to daniel:// stenberg:// • • •Gerard Braad
in reply to daniel:// stenberg:// • • •Tried something like: gist.github.com/gbraad/551eabc… for golang projects.
Seems our vendor folder with lots of k8s libs uses non-ASCII characters.
README.md
Gistkaiserkiwi
in reply to daniel:// stenberg:// • • •daniel:// stenberg://
in reply to kaiserkiwi • • •curl/.github/scripts/spacecheck.pl at master · curl/curl
GitHubvsz
in reply to daniel:// stenberg:// • • •daniel:// stenberg://
in reply to vsz • • •Grimmauld
in reply to daniel:// stenberg:// • • •daniel:// stenberg://
in reply to Grimmauld • • •Mx Autumn
in reply to daniel:// stenberg:// • • •GitHub - photogabble/php-confusable-homoglyphs: A PHP port of vhf/confusable_homoglyphs
GitHubLinuxUserGD
in reply to daniel:// stenberg:// • • •github.com/jirutka/setup-alpin…
Update README.adoc by LinuxUserGD · Pull Request #13 · jirutka/setup-alpine
GitHubWildRikku @ Gamescom
in reply to daniel:// stenberg:// • • •