Alden Bates (abates) wrote,
Alden Bates
abates

Perl help needed

Here's my problem. I have a plugin I've written for Movable Type, which takes the page MT generates and splits it up into a bunch of subpages. The problem is that in doing this all the UTF8 characters get boned and converted into US-ASCII or something.

For instance, ß, which in UTF8 is 0xC39F, turns into 0xDF.

How do I get it to stop doing this? For the short term, I have inserted calls to utf8::encode from the UTF8 module before I write the string out, to convert them back to UTF8, however I'd really like to fix the thing so it doesn't convert the strings to US-ASCII in the first place.

From what I can tell, Perl does something to mark strings as UTF8 somehow, so I should maybe do something along those lines? I don't know.
Tags: programming is a bastard
Subscribe

  • TNG: The Best of Both Worlds, Part I

    The Best of Both Worlds, Part I: The Borg are coming! The Borg are coming! The Enterprise is answering a distress call from a Federation colony.…

  • TNG: Transfigurations

    Transfigurations: A strange alien with the power to heal is rescued by the Enterprise crew. The Enterprise is hanging out in an uncharted star…

  • TNG: Ménage à Troi

    Ménage à Troi: Ferengi kidnap Riker, Troi and Troi's mum. Ferengi and Troi's mother. Oh goody. Riker beats a Ferengi at 3D chess. Data…

  • Post a new comment

    Error

    Comments allowed for friends only

    Anonymous comments are disabled in this journal

    default userpic

    Your reply will be screened

    Your IP address will be recorded 

  • 6 comments