[Zope] zope, latin-1 and accented words
Andreas Jung
lists at andreas-jung.com
Tue Jun 14 11:13:12 EDT 2005
Use TextIndexNG...it is better suited for such purposes.
-aj
--On 14. Juni 2005 16:54:19 +0200 Yuri <yurj at alfa.it> wrote:
> How could I can tell the Splitter of ZCText intedex to not split words as
> "aaaèbbb" in "aaa" and "bbb"?
>
> I would like to tell zope that è,à and so on are alphanumeric
> letters... In Splitter.c I have:
>
> class Splitter:
>
> import re
> rx = re.compile(r"(?L)\w+")
>
> ?L match "as the locale", but I have multilingual latin-1 contents... \w
> would match only [a..z,A..Z]!
>
> TIA
>
> P.S. I've written a small Class for the ZCTextindex pipeline that
> convert all the accented characters in non accented ones, so I can index
> "perchè" as "perche". It would work only if I can solve this splitter
> problem...
> _______________________________________________
> Zope maillist - Zope at zope.org
> http://mail.zope.org/mailman/listinfo/zope
> ** No cross posts or HTML encoding! **
> (Related lists - http://mail.zope.org/mailman/listinfo/zope-announce
> http://mail.zope.org/mailman/listinfo/zope-dev )
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 186 bytes
Desc: not available
Url : http://mail.zope.org/pipermail/zope/attachments/20050614/30b8f3df/attachment.bin
More information about the Zope
mailing list