RE: [Zope] Advice on searching/indexing Word documents?
Cool. I'll have to take a look at this. Does anyone know if there is any effort aimed at writing document filters for use with Zope? A lot of commercial products used for knowledge management (like NextPage LivePublish, some Intranet search engines, etc) already have features like this, and I would think that a project for document filters would be a good idea, if something like this doesn't already exist. Possible things that could be filtered for input: - The IPTC header data from a JPG/TIF image - comtains a few things like the caption (the same one that you can edit in photoshop) - This would be a good addition to various Image classes. - Office documents (word, excel, powerpoint, wordperfect, staroffice, etc) - PDF and Postscript documents - Illustration files (Illustrator, CorelDraw) The value of such filters to Zope for use in knowledge-management and digital asset management would be great; I'm wondering if anyone is working on anything like this? Sean -----Original Message----- From: Jonothan Farr [mailto:jfarr@real.com] Sent: Tuesday, January 02, 2001 4:22 PM To: sean.upton@uniontrib.com; zope@zope.org Subject: Re: [Zope] Advice on searching/indexing Word documents?
I used to write text filters in C and Lex for my previous employer - one of these days I will figure out how to extend python with C and do this.
Here's one that's written entirely in Python: http://www.cosc.canterbury.ac.nz/~greg/python/Plex/ I've seen a couple of other implementations out there. --jfarr
participants (1)
-
sean.upton@uniontrib.com