RE: [Zope] How do I index documents containing accents with ZCata log?
-----Original Message----- From: Farzad Farid [mailto:farzy@via.ecp.fr] Sent: Thursday, January 06, 2000 9:03 AM To: Johan Carlsson Cc: zope@zope.org Subject: Re: [Zope] How do I index documents containing accents with ZCatalog?
Hi,
I got the answer by reading another mail on the same subject. I was using Zope 2.0.1 which has no locale support.
I installed Zope 2.1.2 for testing and now I can index accents (ISO-8859-1) correctly. You just have to start z2.py with the option "-L <locale>", "-L fr_FR" in my case, and the indexing of accents now works in ZCatalog.
Now I have to figure out how to search on partial words :)
Partial searching is not yet implimented, I am actually working on that as we speak. It will be a later Zope feature. -Michel
On Thu, Jan 06, 2000, Michel Pelletier wrote:
Now I have to figure out how to search on partial words :)
Partial searching is not yet implimented, I am actually working on that as we speak. It will be a later Zope feature.
Does this mean that the ZCatalog search engine is still missing some of the functionnalities you can find on a search engine like htdig? Another locale-related important feature is the ability to do searches ignoring the difference between accented and non accented letters. Suppose a document contains the word "édition", by typing "edition" I should be able to find the document, and vice versa. I tried and this feature does not seem to work right now. And is the ZCatalog implementation scalable? What happens if I try to index and search on a site containing hundreds of thousands of documents? Are the programs optimized not to use hundreds of Megs of memory? I've had a bad experience with swish++ which didn't scale when indexing 500000 text documents, it tried to use as much memory as it could allocate... On the other side htdig is well optimized from this point of view. Regards -- Farzad FARID <farzy@via.ecp.fr> Ingénieur Informatique Libre Alcôve - http://www.alcove.fr/
participants (2)
-
Farzad Farid -
Michel Pelletier