I am pleased to announce the release of TextIndexNG 1.09 TextIndexNG is a new pluggable index for the ZCatalog and is the most feature complete solution for fulltext inexing under Zope. TextIndexNG enhances the fulltext indexing capabilities of Zope by providing the following features: * support for document converters (HTML, PDF, WinWord, PowerPoint, Postscript). Custom converters can be easily added * stemmer support for 12 languages * optional support for right truncation * similarity search (soundex, metaphone support) (for English) * NEAR search * phrase search * pluggable query parsers (two parsers included) * stop words support * new test tab for interactive testing * faster than Zopes old TextIndex * full unicode support (new) * normalization support (new) * new similarity algorithm: double metaphone (new) * new TXNGSplitter * new vocabulary browser Changes since 1.08: * apply_index() now uses the default encoding choosen for a TextIndexNG instead of iso-8859-15 (thanks to Myrosav Opyr) * lexicons and storages now used IITreeSets? instead of IISet?. This should reduce memory consumption of TextIndexNG instances. Existing instances running TextIndexNG should be reindexed. * getEntryForObject() now returns a sequence of all index worde instead of "XXX distinct words" (standard behaviour of all Zope indexes). * words removed from a reindexed documents were not properly removed for the word-doc storage (means you could search for removed words and got hits although the words was no longer inside the document). Thanks to Yuppie, Tres Seavers and Dieter Maurer for pointing out this problem. It is highly recommended to reindex directly after the upgrading Download: http://sourceforge.net/projects/textindexng -- --------------------------------------------------------------------- - Andreas Jung http://www.andreas-jung.com - - EMail: andreas at andreas-jung.com - - "Life is too short to (re)write parsers" - ---------------------------------------------------------------------