[ZODB-Dev] BTree pickle size

Roché Compaan roche at upfrontsystems.co.za
Sun Aug 24 02:55:40 EDT 2008


On Sun, 2008-08-24 at 08:08 +0200, dieter at handshake.de wrote:
> Tres Seaver wrote at 2008-8-22 16:45 -0400:
> > ...
> >I recall a pre-Zope (for me, 10 years ago) rule of thumb that text
> >indexing imposed an order of magnitude of overhead on the actual corpus,
> >with improvements possible only via batching or post-processing /
> >compresstion (incremental indexing is worst-case).
> 
> And this is especially true for indexes supporting a term frequency
> based ranking and which uses "IISet" at places where "IITreeSet" were
> more appropriate.
> 
> With "TextIndexNG3", one can get rid of the overhead of
> term frequency based ranking (in case one does not need it)
> 
> Using "AdvancedQuery" (and parsing the text subqueries oneself),
> one can use a "Managable SimpleTextIndex" which
> tries very hard to be as efficient as possible for large data sets
> (and does not support term frequency based ranking).

Thanks for the feedback. I'll re-run the tests without any text indexes,
as well as run it with other implementations such as TextIndexNG3 and
SimpleTextIndex and compare the results.

-- 
Roché Compaan
Upfront Systems                   http://www.upfrontsystems.co.za



More information about the ZODB-Dev mailing list