jed-users mailing list

[2003 Date Index] [2003 Thread Index] [Other years]
[Thread Prev] [Thread Next] [Date Prev] [Date Next]

Re: Jed and utf-8... a pre-pre-pre-plea :-)

Subject: Re: Jed and utf-8... a pre-pre-pre-plea :-)
From: "John E. Davis" <davis@xxxxxxxxxxxxx>
Date: Tue, 17 Jun 2003 19:06:13 -0400

Malcolm Dew-Jones <yf110@xxxxxxxxxxxxxx> wrote:
>So, if the original files included a sequence of bytes that did not
>correctly represent characters, then those bytes should be converted into
>a value in the Private Use Area.  A simple algorthm for this would be
>(lowest-possible-PUA-value + byte value).  That value is then handled as a
>regular unicode character.

The unicode support in S-Lang allows for invalid sequences.  Each byte
in the sequence displays as <XX> where XX is the hex value of the
byte.  Those bytes will be written back to disk as-is.  In light of
this, is there anything to gain by encoding such byte sequences as
as a (much longer) sequence of encoded PUA+bytevalue wide characters?

As far as PUA values goes, I was thinking about using such private
characters as a sort of combining characters to allow an individual
character to have a color attribute, or perhaps to indicate the
start/end of a color sequence.

Thanks,
--John

--------------------------
To unsubscribe send email to <jed-users-request@xxxxxxxxxxx> with
the word "unsubscribe" in the message body.
Need help? Email <jed-users-owner@xxxxxxxxxxx>.

References:
- Re: Jed and utf-8... a pre-pre-pre-plea :-)
  - From: Malcolm Dew-Jones

[2003 date index] [2003 thread index]
[Thread Prev] [Thread Next] [Date Prev] [Date Next]