[ragel-users] How to convert [#x2070-#x218F] to Ragel grammar?
Iñaki Baz Castillo
ibc at aliax.net
Sat Nov 21 02:13:20 UTC 2009
El Viernes, 20 de Noviembre de 2009, Григорий Фучеджи escribió:
> I think this would work
> (0x20 0x70..0xFF) | (0x21 0x00..0x8F)
> But I think there should be better solution
Finally I've understood that this conversion is not valid:
#x2070-#x218F != (0x20 0x70..0xFF) | (0x21 0x00..0x8F)
The former grammar are Unicode character representation in hexadecimal code
point. It represents the symbol ⁰ and in hex UTF-8 bytes it would be "0xE2
0x81 0xB0".
A conversor utility:
http://www.ltg.ed.ac.uk/~richard/utf-8.cgi?input=C0&mode=hex
Of course it would be great if Ragel wouud provide Unicode hex code point
grammar in the .rl files so I could do:
0x2070..0x218F
instead of having to convert it to lots of hexadecimal bytes ranges.
Regards.
--
Iñaki Baz Castillo <ibc at aliax.net>
More information about the ragel-users
mailing list