How do I index documents containing accents with ZCatalog?
I was very disappointed on my first try with ZCatalog when I noticed that it cannot properly index documents containing european accents.. The search engine I installed is of course unusable because it cannot find a large number of words in the indexed documents. I am running Zope 2.0.1, Python 1.5.2 and to install ZCatalog I followed the instructions in the ZCatalog howto. How can I correct this problem? Regards -- Farzad FARID <farzy@via.ecp.fr> Ingénieur Informatique Libre Alcôve - http://www.alcove.fr/
Hi Farzad, This looks serious, I am going to try this with Swedish characters to see if there is a problem for those too. As soon as I have the time. I suppose all your accent characters belong to ISO-8859-1? As I understand it Python in it self is capable to handle ISO 8859-1, but I have experience problems with Zope handling ISO 8859-1 correctly. This need to be fixed before Zope can be used seriously in on any European web sites. You might want to pop your question to the ZIP projects mailing list. (ZIP - Zope internationalization Project) Best regards, Johan Carlsson
I was very disappointed on my first try with ZCatalog when I noticed that it cannot properly index documents containing european accents.. The search engine I installed is of course unusable because it cannot find a large number of words in the indexed documents. I am running Zope 2.0.1, Python 1.5.2 and to install ZCatalog I followed the instructions in the ZCatalog howto.
How can I correct this problem?
Regards
-- Farzad FARID <farzy@via.ecp.fr> Ingénieur Informatique Libre Alcôve - http://www.alcove.fr/
_______________________________________________ Zope maillist - Zope@zope.org http://lists.zope.org/mailman/listinfo/zope ** No cross posts or HTML encoding! ** (Related lists - http://lists.zope.org/mailman/listinfo/zope-announce http://lists.zope.org/mailman/listinfo/zope-dev )
Hi, I got the answer by reading another mail on the same subject. I was using Zope 2.0.1 which has no locale support. I installed Zope 2.1.2 for testing and now I can index accents (ISO-8859-1) correctly. You just have to start z2.py with the option "-L <locale>", "-L fr_FR" in my case, and the indexing of accents now works in ZCatalog. Now I have to figure out how to search on partial words :) Regards On Thu, Jan 06, 2000, Johan Carlsson wrote:
Hi Farzad, This looks serious, I am going to try this with Swedish characters to see if there is a problem for those too. As soon as I have the time.
I suppose all your accent characters belong to ISO-8859-1?
As I understand it Python in it self is capable to handle ISO 8859-1, but I have experience problems with Zope handling ISO 8859-1 correctly.
This need to be fixed before Zope can be used seriously in on any European web sites.
-- Farzad FARID <farzy@via.ecp.fr> Ingénieur Informatique Libre Alcôve - http://www.alcove.fr/
participants (2)
-
Farzad Farid -
Johan Carlsson