[Zope-Annce] [ANN] TextIndexNG 1.09 released
Andreas Jung
andreas@andreas-jung.com
Sat, 08 Mar 2003 14:49:06 +0100
I am pleased to announce the release of
TextIndexNG 1.09
TextIndexNG is a new pluggable index for the ZCatalog and is the most
feature complete solution for fulltext inexing under Zope. TextIndexNG
enhances the fulltext indexing capabilities of Zope by providing the
following features:
* support for document converters (HTML, PDF, WinWord, PowerPoint,
Postscript). Custom converters can be easily added
* stemmer support for 12 languages
* optional support for right truncation
* similarity search (soundex, metaphone support) (for English)
* NEAR search
* phrase search
* pluggable query parsers (two parsers included)
* stop words support
* new test tab for interactive testing
* faster than Zopes old TextIndex
* full unicode support (new)
* normalization support (new)
* new similarity algorithm: double metaphone (new)
* new TXNGSplitter
* new vocabulary browser
Changes since 1.08:
* apply_index() now uses the default encoding choosen for a TextIndexNG
instead of iso-8859-15 (thanks to Myrosav Opyr)
* lexicons and storages now used IITreeSets? instead of IISet?. This
should reduce memory consumption of TextIndexNG instances. Existing
instances running TextIndexNG should be reindexed.
* getEntryForObject() now returns a sequence of all index worde instead
of "XXX distinct words" (standard behaviour of all Zope indexes).
* words removed from a reindexed documents were not properly removed
for the word-doc storage (means you could search for removed words
and got
hits although the words was no longer inside the document). Thanks to
Yuppie, Tres Seavers and Dieter Maurer for pointing out this problem.
It is highly recommended to reindex directly after the upgrading
Download: http://sourceforge.net/projects/textindexng
--
---------------------------------------------------------------------
- Andreas Jung http://www.andreas-jung.com -
- EMail: andreas at andreas-jung.com -
- "Life is too short to (re)write parsers" -
---------------------------------------------------------------------