- Subject: Re: Jed and utf-8... a pre-pre-pre-plea :-)
- From: Romano Giannetti <romano@xxxxxxxxxxxxxxxx>
- Date: Wed, 18 Jun 2003 10:34:46 +0200
On Tue, Jun 17, 2003 at 12:56:51PM -0400, John E. Davis wrote:
>
> Suppose that the file on disk uses an ISO-LATIN character set. In
> particular, consider the upside-down exclamation point character (¡)
> given by character code 161. When a file containing this character is
> read under a UTF-8 locale, it will be displayed as the 4 character
> sequence <A1>, since by itself it represents an illegal UTF-8 encoded
> sequence. A command such as
>
> set_charset ("iso-latin-1");
>
> will convert an illegal sequence such 161 to its two character UTF-8
> equivalent 0xC2 0xA1. That is, specifying a character set could
> actually cause the bytes in a buffer to change.
I see. This is my problem... suppose I have an environment based on an utf-8
locale, and I want to edit a LaTeX file with jed. Unfortunately, LaTeX does
not understand utf-8 (yet... and I suspect it's quite a complex thing to
implement(*)). So the .tex file should stay on disk as a latin-1 (or latin-9,
whatever) file. But if I am editing it, and type "¡", I imagine that ---
being the locale utf-8 --- jed would see the string 0xC2 0xA1. And put that
into the buffer. Now the problem is --- when writing the file to disk, this
should be written as 0xA1. If I have understood well, this is just the
designed behavior, so that it's fine --- I think.
Romano
(*) yes, I know there is Omega and Lambda. But I am not so sure about them...
--
Romano Giannetti - Univ. Pontificia Comillas (Madrid, Spain)
Electronic Engineer - phone +34 915 422 800 ext 2416 fax +34 915 411 132
--------------------------
To unsubscribe send email to <jed-users-request@xxxxxxxxxxx> with
the word "unsubscribe" in the message body.
Need help? Email <jed-users-owner@xxxxxxxxxxx>.
[2003 date index]
[2003 thread index]
[Thread Prev] [Thread Next]
[Date Prev] [Date Next]