- Subject: Re: UTF-8 and Regular Expressions
- From: Jörg Sommer <joerg@xxxxxxxxxxxx>
- Date: Mon, 16 Apr 2007 08:54:37 +0000 (UTC)
Hi John,
"John E. Davis" <davis@xxxxxxxxxxxxx> wrote:
> For those of you that use jed with UTF-8 encoded text, has the lack of
> true UTF-8 support by the regular expression functions been much of an
> impediment?
No, I had no problems or better said, I stumbled over none. But I found
another problem that might be related to this. Try to match the
following line with Test.*x (Menu->Search->Regexp Search). You see get
also the string “ ab” marked as match, which is confusing.
Test äöü x ab
> The reason I ask is that slang 3.0 will use PCRE as its regular
> expression library. In anticipation of this, I plan to integrate PCRE
> with jed in the near future. In fact, I already have a version that
> uses PCRE.
Cool. I really need it for my latex mode.
> My inclination is that if the lack of UTF-8 support by the current
> regular expression engine is not much of a problem, then I think that
> by default, regular expressions will be compiled using byte-semantics,
> independent of whether or not jed is running in UTF-8 mode.
If it really has no implact than you might do so. But can you add a
switch to configure to easily change it on compile time?
Bye, Jörg.
--
Was der Bauer nicht kennt, das frisst er nicht. Würde der Städter kennen,
was er frisst, er würde umgehend Bauer werden.
Oliver Hassencamp
--------------------------
To unsubscribe send email to <jed-users-request@xxxxxxxxxxx> with
the word "unsubscribe" in the message body.
Need help? Email <jed-users-owner@xxxxxxxxxxx>.
[2007 date index]
[2007 thread index]
[Thread Prev] [Thread Next]
[Date Prev] [Date Next]