Hi Andreas, Neither PrincipiaSearchSource nor SearchableText does anything for these File-type objects. I guess nothing for SearchableText is expected since these are not CMF or Plone-derived objects. The only way I've managed to get *anything* indexed for these File-type objects is by specifying the "data" attribute. A couple of related postings that I've found through a bit of Googling have also noted having to use "data" when indexing these kinds of files, for example: http://mail.zope.org/pipermail/zope/2003-August/139702.html So, I should be able to use PrincipiaSearchSource? I've only used that for text-oriented objects like Page Templates. I'll keep digging around, but I welcome any suggestions for what the problem could be or how I can debug this further. Garth On 12/12/05, Andreas Jung <lists@andreas-jung.com> wrote:
--On 12. Dezember 2005 11:33:13 -0500 "Garth B." <garthb@gmail.com> wrote:
TextIndexNG 3.1.1 Zope 2.8.0 Python 2.3.5
What attribute should be specified when indexing PDFs? I've been using "data". Word docs are indexed properly, but the PDFs aren't. The PDFs are still found with the rest of the files, but the indexed content is not what I expected.
Depends on the content-type. PrincipiaSearchSource for core Zope types as File, DTML and SearchableText for any CMF or Plone content-type.
-aj