At 2000-07-04 20:32 +0100, Steve Alexander <steve@cat-box.net> wrote:
On Fri, 30 Jun 2000, David Trudgett wrote:
I wonder if anyone has a sample Python script that traverses the ZODB to query or perform arbitrary operations on the objects found in it? My present requirement is to write a script that goes through and does some automated updating of several dozen DTML documents.
Hi David,
I just posted something similar to Zope-Dev:
http://lists.zope.org/pipermail/zope-dev/2000-July/005744.html
Thanks for that. I actually worked out a fairly similar hack myself, which only did one folder (which was OK for the particular task that I wanted it to do). Your recursive method has no doubt saved me quite a bit of time in generalising it. There are still a couple of little question marks in my mind, though. But before I get to them, I'll just post a sample of what I came up with (minus most of the gory details of the actual processing I wanted to do: ************ def processDTML(self): """Process DTML documents within a folder.""" import re myregex = re.compile(r"(<H1>.*?</H1>)") thisdoc = self.getFirstChild() while thisdoc is not None: if hasattr(thisdoc, 'meta_type'): if thisdoc.meta_type == 'DTML Document': # Do processing stuff here doctext = thisdoc.raw doctext = myregex.sub(r"\1\n\n<H2>Profile</H2>\n", doctext) thisdoc.raw = doctext thisdoc = thisdoc.getNextSibling() return 'Complete!' ************ Just for easy reference, this was what you came up with: ************ import re def convert_dtml(self): """Convert DTML Methods and DTML Documents from old syntax to new syntax. Warning: recursive! This assumes that DTML Method and DTML Document haven't been subclassed. """ print 'convert_dtml: id=%s' % self.title_and_id() if hasattr(self, 'meta_type') and \ (self.meta_type == 'DTML Method' or \ self.meta_type == 'DTML Document'): convert(self) # should this be "isPrincipiaFolderish"? if hasattr(self, 'isAnObjectManager') and self.isAnObjectManager: for v in self.objectValues(): v.convert_dtml() _convert_regex = re.compile('''<!--#(/?)(([^"-]+?|"[^"]*?"|'[^']*?'|-[^-])+?)-->''') def convert(dtml_item): print 'converting...' title = dtml_item.title # like document_src, but doesn't require RESPONSE data = dtml_item.PrincipiaSearchSource() print '----data----' print data newdata = _convert_regex.sub('<\g<1>dtml-\g<2>>', data) print '----newdata----' print newdata print '----end----' dtml_item.manage_edit(newdata, title) ************ Some questions I have in my mind: 1. I used "raw" and you used "PrincipiaSearchSource()" . The reason I used raw was because I wanted to be sure that what I got wasn't munged in any way, and that what I put back reflects what was actually there before. 2. You used the "manage_edit()" method, whereas I just assigned to the "raw" property. My way seems to work OK, but I'm not sure how yours works: I assume it brings up an edit page in the browser for each document? 3. I don't like resorting to testing the "meta_type" for a particular string value. As you noted in your code, it doesn't allow for subclassing, so it's not fully general. 4. I was surprised that the import statement (not to mention "re.compile()") could be put outside of the method definition, considering that Zope's external methods grab on to individual methods within a Python module. David Trudgett