17 Jun
2002
17 Jun
'02
1:08 p.m.
--On Monday, June 17, 2002 17:03 +0400 Oleg Broytmann <phd@phd.pp.ru> wrote:
What about non-iso8859 languages? How can I create normalization rules if my language does not have any mapping to latin alphabet?
In the current implementation normalizers can be specified through a text file. Inside the file you can declare the language and the used encoding, e.g. # german normalizer # $Id: de.txt,v 1.2.2.1 2002/06/13 12:50:08 ajung Exp $ # language = german # encoding = iso-8859-1 Ä Ae Ö Oe Ü Ue ä ae ö oe ü ue ß ss When the file is parsed every rule is translated to unicode using the specified encoding. -aj