RE: [Zope] Re: Zope version 2.3.2
The problem there is that when someone tries to actually search for a word sans an accent in this case, they wouldn't find a match. Somehow both cases would need to be indexed... Sean -----Original Message----- From: E. Seifert [mailto:e.seifert@gmx.net] Sent: Friday, May 18, 2001 10:48 AM To: zope@zope.org Subject: [Zope] Re: Zope version 2.3.2 Hi Jean-Louis, JL BERLIET wrote:
The search has to bring related results to the keyword, not only results that has it. A basic feature to do that is to ignore accentuation
A patch for "Splitter.c" i'm using with french words to ignore accentuation ! (see the function "sans_accent" and add yours portuguese caracters) Jean-Louis BERLIET
Your patch to Splitter.c contains a mapping for letters that are also used as umlaut characters in German which isn't correct. You would have to map the German "a to ae, "o to oe, "u to ue, and finally the sz ligature to ss. Greetings Eric _______________________________________________ Zope maillist - Zope@zope.org http://lists.zope.org/mailman/listinfo/zope ** No cross posts or HTML encoding! ** (Related lists - http://lists.zope.org/mailman/listinfo/zope-announce http://lists.zope.org/mailman/listinfo/zope-dev )
Hi Sean, Sean wrote:
The problem there is that when someone tries to actually search for a word sans an accent in this case, they wouldn't find a match. Somehow both cases would need to be indexed...
Sean
You're right. Both version would have to be indexed as German users probably tend to replace an "a with ae, etc. IMHO also the stop words will have to be changed depending on language. (I already compiled a German stop word list if someone's interested.) But this is another topic. Eric
-----Original Message----- From: E. Seifert [mailto:e.seifert@gmx.net] Sent: Friday, May 18, 2001 10:48 AM To: zope@zope.org Subject: [Zope] Re: Zope version 2.3.2
Hi Jean-Louis,
JL BERLIET wrote:
The search has to bring related results to the keyword, not only results that has it. A basic feature to do that is to ignore accentuation
A patch for "Splitter.c" i'm using with french words to ignore accentuation ! (see the function "sans_accent" and add yours portuguese caracters) Jean-Louis BERLIET
Your patch to Splitter.c contains a mapping for letters that are also used as umlaut characters in German which isn't correct. You would have to map the German "a to ae, "o to oe, "u to ue, and finally the sz ligature to ss.
Greetings Eric
participants (2)
-
E. Seifert -
sean.upton@uniontrib.com