Alden Bates (abates) wrote,
Alden Bates

Perl help needed

Here's my problem. I have a plugin I've written for Movable Type, which takes the page MT generates and splits it up into a bunch of subpages. The problem is that in doing this all the UTF8 characters get boned and converted into US-ASCII or something.

For instance, ß, which in UTF8 is 0xC39F, turns into 0xDF.

How do I get it to stop doing this? For the short term, I have inserted calls to utf8::encode from the UTF8 module before I write the string out, to convert them back to UTF8, however I'd really like to fix the thing so it doesn't convert the strings to US-ASCII in the first place.

From what I can tell, Perl does something to mark strings as UTF8 somehow, so I should maybe do something along those lines? I don't know.
Tags: programming is a bastard

  • DS9: Call to Arms

    Call to Arms: Annoyed at the Dominion convoys coming through the wormhole, the DS9 crew decide to mine the entrance. Rom and Leeta look at wedding…

  • DS9: In the Cards

    Wow, almost at he end of season 5 already. There's a squinty dude on Sport Box which is on before Trek, and he's really annoying. Also annoying - the…

  • DS9: Empok Nor

    Empok Nor: Why, yes, we can recycle the station sets. Quark's bar is oddly deserted when Dax, Kira and Worf arrive. O'Brien and Nog are repairing…

  • Post a new comment


    Comments allowed for friends only

    Anonymous comments are disabled in this journal

    default userpic

    Your reply will be screened

    Your IP address will be recorded