UTF-8
8:34pm, 29th August 2003
I seem to be having tremendous difficulty with my Unicode lifestyle. What does this say?: Portugues.
Update: This problem arose as I tried to write a comments system for this blog. The mechanics of accepting form input, processing it, and appending it to a page is simple enough, but when doing stuff like this, there’s one nasty ugly problem that always rears its head: unicode.
A post on this blog goes through six layers: keyboard, terminal, vim, python, MySQL, browser. Each one of them seems to have a different idea of how to handle the encoding of the text. The proper solution is to use UTF-8 throughout, but that’s where my problem is. Only the latest releases of said software support UTF-8. Upgrading all of them to mutually compatible versions is something which is currently beyond me.
Maybe in a couple of years I’ll have everything sorted, but for now, it’s plain old ASCII, and no comments.
