17 Jun
2002
17 Jun
'02
1:03 p.m.
On Mon, Jun 17, 2002 at 08:51:26AM -0400, Andreas Jung wrote:
- normalization
Normalization means the translation of special characters or a sequence of characters to a more simpler form, e.g. 'д' -> 'Ae', 'Д' -> 'ae', ╢ъ' -> 'ss' or a more radical reduction like 'д' -> 'A', 'Д' -> 'a', ╢ъ' -> 's'. Such a reduction allows more fault tolerant searching.
At the moment TextIndexNG supports only German and English.
What about non-iso8859 languages? How can I create normalization rules if my language does not have any mapping to latin alphabet? Oleg. -- Oleg Broytmann http://www.zope.org/Members/phd/ phd@phd.pp.ru Programmers don't die, they just GOSUB without RETURN.