Hi, I'm looking for a pice of python code which keeps PrincipiaSearchSource clean from HTML TAGS before indexing an object with "self.reindex_object()" to the Catalog. Thanks Marcus
DocumentLibrary and TextIndexNG contain converters to convert HTML to ASCII before the object is indexed. -aj ----- Original Message ----- From: "Marcus Schopen" <marcus.schopen@uni-bielefeld.de> To: <zope@zope.org> Sent: Wednesday, March 27, 2002 19:12 Subject: [Zope] PrincipiaSearchSource & HTML TAGS
Hi,
I'm looking for a pice of python code which keeps PrincipiaSearchSource clean from HTML TAGS before indexing an object with "self.reindex_object()" to the Catalog.
Thanks Marcus
_______________________________________________ Zope maillist - Zope@zope.org http://lists.zope.org/mailman/listinfo/zope ** No cross posts or HTML encoding! ** (Related lists - http://lists.zope.org/mailman/listinfo/zope-announce http://lists.zope.org/mailman/listinfo/zope-dev )
Hi Andreas, Andreas Jung wrote:
DocumentLibrary and TextIndexNG contain converters to convert HTML to ASCII before the object is indexed.
Thanks, but I think that's to much. It's just like finding a good way to remove all HTML tags from body var self.summary = body[0:min(200, len(body))] before write it in the next step with "self.reindex_object()" to the Catalog. Bye Marcus
Marcus Schopen writes:
I'm looking for a pice of python code which keeps PrincipiaSearchSource clean from HTML TAGS before indexing an object with "self.reindex_object()" to the Catalog. You may find "CatalogSupport" from
<http://www.dieter.handshake.de/pyprojects/zope> useful. Dieter
participants (3)
-
Andreas Jung -
Dieter Maurer -
Marcus Schopen