Skip to main content


There's a lot of software still in use that fails to handle correctly text that isn't encoded using the 7-bit ASCII character set.

Latin1 was an 8-bit character set, based on a VT100 terminal character set from 1983.

Work on Unicode started in about 1989.

UTF8 was formally introduce in 1993.

Only the very newest of software can be expected to understand these newfangled text representations.

#rant

#rant