John E. Davis spake unto us the following wisdom: > Ethan Blanton <eblanton@xxxxxxxxxxxx> wrote: > >equations. ;-) However, I cannot successfully _input_ UTF-8. If I run > >jed in a utf-8 capable terminal, with a utf-8 locale (I have tried set- > >ting any and all of LC_CTYPE, LANG, and LC_ALL to both en_US.utf8 and > >ja_JP.utf8), when I enter non-latin text it is displayed in jed as the > >constituent Unicode characters expressed in ISO-Latin-1 -- I realize > >this is somewhat hard to picture, so I've attached a screenshot demon- > >strating the problem. (Note that some of the contituent unicode charac- > > How are you entering the non-latin text? If you run jed as: I enter Russian via standard XKB input (that is, I change my keymap to a Russian keymap), and Japanese via XIM (specifically kinput2). The ter- minal I'm using to test is gnome-terminal 2.6.0. For the record, if anyone knows of a capable and light UTF-8 terminal (ala rxvt), I'd love to hear about it. (I'm an rxvt guy, normally.) > jed -l keycode -f keycode > > what does it reveal when entering such characters? Finally, does > putting Well, this is interesting ... Before changing anything, the Russian character 'r' (looks like a latin 'p') yields "\d209\e@"; after setting DEC_8BIT_HACK = 0, it yields "\d209\d128". This fixes Russian input, but Japanese remains broken. The keycodes as reported by keycode.sl for the Japanese hiragana 'ni' change from "\d227\eA\df171" to "\d227129171" (the latter being the correct UTF-8 sequence, but the ISO-Latin-1 equiv- alents of these (I presume) are inserted. The actual byte sequence inserted is "\d195\d163\d194\d129\d171". I hope that's helpful... Ethan -- The laws that forbid the carrying of arms are laws [that have no remedy for evils]. They disarm only those who are neither inclined nor determined to commit crimes. -- Cesare Beccaria, "On Crimes and Punishments", 1764
Attachment:
pgprp0LC2RY4C.pgp
Description: PGP signature