Cataloging external documents (word, pdf, excel...)
Hi! I would like to know if it is possible to catalog in Zope some documents (word, pdf, excel...) that are in the local hard disk, not in the ZODB. I've been looking round zope.org and I found some products (ZCatalog, Kaivo Document Library, MSWordDocument, etc.), but I am not sure if they can catalog external documents and if they can catalog non-html documents such as word, excel, pdf, powerpoint, etc. I thought I could just ask the list for help, instead of wasting some hours installing and trying them. Can anyone help me? Thanks in advance, Igor Leturia
I could just ask the list for help, instead of wasting some hours installing and trying them. Can anyone help me? Have a look at textindexng (http://textindexng.sf.net)
Pieter
I could just ask the list for help, instead of wasting some hours installing and trying them. Can anyone help me? Have a look at textindexng (http://textindexng.sf.net)
Pieter
Just for my information, will this also work for *external* files, as Igor wanted, ie documents that are not part of the ZODB? I was working on something like this using LocalFS and just inserting the contents of the file into the catalog. Doesn't work yet though. Douwe
On 16/7/02 10:56 am, "douwe@oberon.nl" <douwe@oberon.nl> wrote:
I could just ask the list for help, instead of wasting some hours installing and trying them. Can anyone help me? Have a look at textindexng (http://textindexng.sf.net)
Pieter
Just for my information, will this also work for *external* files, as Igor wanted, ie documents that are not part of the ZODB?
I was working on something like this using LocalFS and just inserting the contents of the file into the catalog. Doesn't work yet though.
I had a look at that too - gave up, LocalFS hasn't seen work for a while, but there are moves afoot to bring it back to life (see archives and LocalFS page at Zope.org). DocumentLibrary (kaivo - http://www.zope.org/Members/Kaivo/DocumentLibrary) can index external files using a patch available from the mailing list (http://lists.sourceforge.net/mailman/listinfo/nlada-library-users). It does work, of a fashion - I can't index RTF files nor web pages, but the patch is built on 1.0b3 and the latest version of DocumentLibrary is 1.0rc1. DocumentLibrary as it stands (without the patch) is very well written and is very customizable - I just don't want a ZODB bloating up with multi-meg files! :) Hth Tone. -- Dr Tony McDonald, Assistant Director, FMCC, http://www.fmcc.org.uk/ The Medical School, Newcastle University Tel: +44 191 243 6140 A Zope list for UK HE/FE http://www.fmcc.org.uk/mailman/listinfo/zope
Hi Actually, I use a modified version of the ExternalFile product ; this product store file's metadata into ZODB, while the real document is stored on disk. I've just created a PrincipiaSearchSource method which retrieve document text which should be fully indexed, via external commands (pdftotext, catdoc,...). It actually works quite well with several documents types : DOC, PDF, PS, XLS, PPT, HTML and RTF (and ASCII files, of course !). Thierry On Tue, 2002-07-16 at 11:27, Igor Leturia wrote:
Hi!
I would like to know if it is possible to catalog in Zope some documents (word, pdf, excel...) that are in the local hard disk, not in the ZODB. I've been looking round zope.org and I found some products (ZCatalog, Kaivo Document Library, MSWordDocument, etc.), but I am not sure if they can catalog external documents and if they can catalog non-html documents such as word, excel, pdf, powerpoint, etc. I thought I could just ask the list for help, instead of wasting some hours installing and trying them. Can anyone help me?
Thanks in advance,
Igor Leturia
_______________________________________________ Zope maillist - Zope@zope.org http://lists.zope.org/mailman/listinfo/zope ** No cross posts or HTML encoding! ** (Related lists - http://lists.zope.org/mailman/listinfo/zope-announce http://lists.zope.org/mailman/listinfo/zope-dev )
-- Linux every day, keeps Dr Watson away... http://gpc.tuxfamily.org -- http://www.ulthar.net
participants (5)
-
douwe@oberon.nl -
Igor Leturia -
pieterb@gewis.nl -
Thierry Florac -
Tony McDonald