UTF-8

8:34pm, 29th August 2003

I seem to be having tremendous difficulty with my Unicode lifestyle. What does this say?: Portugues.

Update: This problem arose as I tried to write a comments system for this blog. The mechanics of accepting form input, processing it, and appending it to a page is simple enough, but when doing stuff like this, there’s one nasty ugly problem that always rears its head: unicode.

A post on this blog goes through six layers: keyboard, terminal, vim, python, MySQL, browser. Each one of them seems to have a different idea of how to handle the encoding of the text. The proper solution is to use UTF-8 throughout, but that’s where my problem is. Only the latest releases of said software support UTF-8. Upgrading all of them to mutually compatible versions is something which is currently beyond me.

Maybe in a couple of years I’ll have everything sorted, but for now, it’s plain old ASCII, and no comments.


Leave a comment

Comment sensibly. I know where you live.

I am a human, not a spambot

XHTML: You can use these tags: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>