[Zope] ZCatalog searching questions
Stuart 'Zen' Bishop
zen@cs.rmit.edu.au
Fri, 1 Oct 1999 15:55:42 +1000 (EST)
On Thu, 30 Sep 1999, Michel Pelletier wrote:
> Some people thing, 'why not use re (the Python regex module)?', because
> searching like '*ing' would require iterating over all the keys, a
> linear search like this could take multiple order of maginitude more
> time than a non-regex search.
But this is not a problem for many of us who arn't trying to index libraries.
Perhaps a way of specifying 'extended' searches to ZCatalog (but still
allowing normal, quick searches), implemented as a linear regex search
through the index. And if this was too slow for some people, it
would provide the hook they need to replace Catalog with something that
met their requirements:
GlimpseCatalog - indexes are dumped to text files that are indexed using
Glimpse. Extended syntax would be the fuzzy regexp based
matches used by Glimpse (as implemented by agrep) for
those sites whose visitors can't spell :-)
RDBCatalog - indexes are stored in a backend RDBMS, and substring matches
are done using SQL.
ConTextCatalog - interfaces to Oracle and the ConText option for people
who require funky soundex matches and the various wierd
options it provides.
And Ultraseek uses python doesn't it? Might be a trivial mating.
___
// Zen (alias Stuart Bishop) Work: zen@cs.rmit.edu.au
// E N Senior Systems Alchemist Play: zen@shangri-la.dropbear.id.au
//__ Computer Science, RMIT WWW: http://www.cs.rmit.edu.au/~zen