28 Nov
2001
28 Nov
'01
2:27 p.m.
Matt Hamilton wrote:
I would like in on that too :) About a year or so ago I was working on a full-text indexing system for indexing several gigabytes of text (mailing list archives). Most of it was written in C and uses quite a lot of cool algorithms from various information retrieval papers and books. I have been hoping to have the time to take parts of it and work it into the new PluginIndex architecture. The existing code uses BerkeleyDB files to hold the index structures, but I would like to use ZODB instead to give it a bit more modularity.
Hi Matt, Are any of these algorithms publicly available? I'd be _very_ interested in them :-) Chris