[Zope] Indexing: ZopeSplitter and numbers

Andreas Jung andreas@andreas-jung.com
Tue, 13 Nov 2001 07:05:27 -0500


The answer is - as always - in the sources ;-) The splitting algorithm is
pretty dumb. Roughly spoken it splits the text in words but not into
numbers.
To test the splitter try this:

from ZopeSplitter import ZopeSplitter
print list(ZopeSplitter('abc 123 t353 nmj'))

gives ['abc', 't353', 'nmj']


Andreas
----- Original Message -----
From: "Richard Jones" <richard@bizarsoftware.com.au>
To: <zope@zope.org>
Sent: Monday, November 12, 2001 23:45
Subject: [Zope] Indexing: ZopeSplitter and numbers


> We'd like to be able to have numeric-only fields that are searchable using
> TextIndex (eg. ISBN, telephone numbers, post codes, ...). We're just
> wondering what the logic is behind ZopeSplitter (and ISO_8859_1) rejecting
> words that only consist of numbers.
>
>
>     Richard
>
> _______________________________________________
> Zope maillist  -  Zope@zope.org
> http://lists.zope.org/mailman/listinfo/zope
> **   No cross posts or HTML encoding!  **
> (Related lists -
>  http://lists.zope.org/mailman/listinfo/zope-announce
>  http://lists.zope.org/mailman/listinfo/zope-dev )
>