RE: [Zope] Cataloging external documents (word, pdf, excel...)
Maybe your solution fits me. Is it a product you have done? Is it downloadable from zope.org? Another question I have is this: if I have a big bunch of documents in the local hard disk, I would have to create in Zope an instance of your product for each document in the disk? If so, isn't there a way to just catalog a whole set of external files? (something like doing your modifications to a ExternalFolder product, instead of a ExternalFile product). Thanks in advance, Igor Leturia Thierry Florac wrote:
Actually, I use a modified version of the ExternalFile product ; this product store file's metadata into ZODB, while the real document is stored on disk. I've just created a PrincipiaSearchSource method which retrieve document text which should be fully indexed, via external commands (pdftotext, catdoc,...). It actually works quite well with several documents types : DOC, PDF, PS, XLS, PPT, HTML and RTF (and ASCII files, of course !).
Concerning the patch, it's not available yet anywhere and is only used on my internal site ; this patch also includes other modifications, which allow for example ExternalFiles to be treated as HTMLDocuments. I can send it to you if you're interested with it... For your "big bunch of documents", the ExternalFile product now provides an "External File Batch" method, which allows you to create Zope instances for a whole directory in a single operation. Thierry On Tue, 2002-07-16 at 12:45, Igor Leturia wrote:
Maybe your solution fits me. Is it a product you have done? Is it downloadable from zope.org?
Another question I have is this: if I have a big bunch of documents in the local hard disk, I would have to create in Zope an instance of your product for each document in the disk? If so, isn't there a way to just catalog a whole set of external files? (something like doing your modifications to a ExternalFolder product, instead of a ExternalFile product).
Thanks in advance,
Igor Leturia
Thierry Florac wrote:
Actually, I use a modified version of the ExternalFile product ; this product store file's metadata into ZODB, while the real document is stored on disk. I've just created a PrincipiaSearchSource method which retrieve document text which should be fully indexed, via external commands (pdftotext, catdoc,...). It actually works quite well with several documents types : DOC, PDF, PS, XLS, PPT, HTML and RTF (and ASCII files, of course !).
_______________________________________________ Zope maillist - Zope@zope.org http://lists.zope.org/mailman/listinfo/zope ** No cross posts or HTML encoding! ** (Related lists - http://lists.zope.org/mailman/listinfo/zope-announce http://lists.zope.org/mailman/listinfo/zope-dev )
-- Linux every day, keeps Dr Watson away... http://gpc.tuxfamily.org -- http://www.ulthar.net
participants (2)
-
Igor Leturia -
Thierry Florac