Hi. I have a bunch of MS Word 97 files that I want to put onto the web, into Zope. They can be converted to html, pdf or remain as doc, I dont really care. But I want to keep the layout, fonts etc. Saving as html in word doesnt do that very well. Is this possible somehow? I guess I want a catalogaware MSWord Product or something... but is there any way to provide a search interface? /Magnus Heino
Magnus, I have a set of external methods that do just what you want, The documents are converted into a XMLDocument put into Zope and are then searchable via ZCatalog. The script runs on NT since it uses a com interface to Word to do the work. Works beautifully though. http://media-1.ml.uwcm.ac.uk/test/frameset to see it. If you would like a look at the scripts, mail me privately. HTH Phil phil@wigwamweb.net |>-----Original Message----- |>From: magnus@vuab.net [mailto:magnus@vuab.net] |>Sent: Wednesday, November 03, 1999 1:27 PM |>To: zope@zope.org |>Subject: [Zope] MSWord Product... |> |> |> |>Hi. |> |>I have a bunch of MS Word 97 files that I want to put onto the web, into |>Zope. They can be converted to html, pdf or remain as doc, I dont really |>care. But I want to keep the layout, fonts etc. Saving as html in word |>doesnt do that very well. |> |>Is this possible somehow? I guess I want a catalogaware MSWord Product |>or something... but is there any way to provide a search interface? |> |>/Magnus Heino |> |>_______________________________________________ |>Zope maillist - Zope@zope.org |>http://lists.zope.org/mailman/listinfo/zope |> |>(Related lists - please, no cross posts or HTML encoding! |> |>To receive general Zope announcements, see: |>http://lists.zope.org/mailman/listinfo/zope-announce |> |>For developer-specific issues, zope-dev@zope.org - |>http://lists.zope.org/mailman/listinfo/zope-dev ) |>
Whao people, Your slowing up my development server. Sorry to be so draconian but I've had to put a block on visits to the below mentioned site. If you want access, mail me privately. See ya Phil phil@wigwamweb.net |>-----Original Message----- |>From: zope-admin@zope.org [mailto:zope-admin@zope.org]On Behalf Of Phil |>Harris |>Sent: Wednesday, November 03, 1999 4:12 PM |>To: magnus@vuab.net; zope@zope.org |>Subject: RE: [Zope] MSWord Product... |> |> |>Magnus, |> |>I have a set of external methods that do just what you want, The |>documents |>are converted into a XMLDocument put into Zope and are then searchable via |>ZCatalog. |> |>The script runs on NT since it uses a com interface to Word to do |>the work. |> |>Works beautifully though. |> |>http://media-1.ml.uwcm.ac.uk/test/frameset to see it. |> |>If you would like a look at the scripts, mail me privately. |> |>HTH |> |>Phil |>phil@wigwamweb.net |> |>|>-----Original Message----- |>|>From: magnus@vuab.net [mailto:magnus@vuab.net] |>|>Sent: Wednesday, November 03, 1999 1:27 PM |>|>To: zope@zope.org |>|>Subject: [Zope] MSWord Product... |>|> |>|> |>|> |>|>Hi. |>|> |>|>I have a bunch of MS Word 97 files that I want to put onto the web, into |>|>Zope. They can be converted to html, pdf or remain as doc, I dont really |>|>care. But I want to keep the layout, fonts etc. Saving as html in word |>|>doesnt do that very well. |>|> |>|>Is this possible somehow? I guess I want a catalogaware MSWord Product |>|>or something... but is there any way to provide a search interface? |>|> |>|>/Magnus Heino |>|> |>|>_______________________________________________ |>|>Zope maillist - Zope@zope.org |>|>http://lists.zope.org/mailman/listinfo/zope |>|> |>|>(Related lists - please, no cross posts or HTML encoding! |>|> |>|>To receive general Zope announcements, see: |>|>http://lists.zope.org/mailman/listinfo/zope-announce |>|> |>|>For developer-specific issues, zope-dev@zope.org - |>|>http://lists.zope.org/mailman/listinfo/zope-dev ) |>|> |> |> |>_______________________________________________ |>Zope maillist - Zope@zope.org |>http://lists.zope.org/mailman/listinfo/zope |> No cross posts or HTML encoding! |>(Related lists - |> http://lists.zope.org/mailman/listinfo/zope-announce |> http://lists.zope.org/mailman/listinfo/zope-dev ) |>
Magnus,
I have a set of external methods that do just what you want, The documents are converted into a XMLDocument put into Zope and are then searchable via ZCatalog.
The script runs on NT since it uses a com interface to Word to do the work.
Works beautifully though.
http://media-1.ml.uwcm.ac.uk/test/frameset to see it.
If you would like a look at the scripts, mail me privately.
HTH
Phil phil@wigwamweb.net
|>-----Original Message----- |>From: magnus@vuab.net [mailto:magnus@vuab.net] |>Sent: Wednesday, November 03, 1999 1:27 PM |>To: zope@zope.org |>Subject: [Zope] MSWord Product... |> |> |> |>Hi. |> |>I have a bunch of MS Word 97 files that I want to put onto the web, into |>Zope. They can be converted to html, pdf or remain as doc, I dont really |>care. But I want to keep the layout, fonts etc. Saving as html in word |>doesnt do that very well. |> |>Is this possible somehow? I guess I want a catalogaware MSWord Product |>or something... but is there any way to provide a search interface? |> |>/Magnus Heino
Another option is the word->html conversion program. I don't have the url but just search on (www.freshmeat.net) The results I have seen are reasonably good, probably not perfect though. If layout it critical I would suggesting converting to PDF using the Adobe Acrobat (unfortuntely this is quite expensive :( ) Benno
At 05:01 AM 11/4/99 +1100, Ben Leslie wrote:
The results I have seen are reasonably good, probably not perfect though. If layout it critical I would suggesting converting to PDF using the Adobe Acrobat (unfortuntely this is quite expensive :( )
There is another way to do that. Print to file using a Postscript driver from Word, then load the result into GhostScript/GSView (free, open source) and generate a PDF. Voila. It's a bit less convenient than Acrobat, but for many files it actually runs faster than Acrobat.
Date sent: Wed, 03 Nov 1999 13:29:03 -0500 To: benno@sesgroup.net, Phil Harris <phil@wigwamweb.net> From: "Phillip J. Eby" <pje@telecommunity.com> Subject: Re: [Zope] MSWord Product... Copies to: magnus@vuab.net, zope@zope.org
At 05:01 AM 11/4/99 +1100, Ben Leslie wrote:
The results I have seen are reasonably good, probably not perfect though. If layout it critical I would suggesting converting to PDF using the Adobe Acrobat (unfortuntely this is quite expensive :( )
There is another way to do that. Print to file using a Postscript driver from Word, then load the result into GhostScript/GSView (free, open source) and generate a PDF. Voila. It's a bit less convenient than Acrobat, but for many files it actually runs faster than Acrobat.
And anyway the double step postscript->pdf is generally assumed produce the best results for Acrobat as well. The direct conversion word->acrobat is just convenient. Note also that if you want to access/catalog the pdf contents as well you could use an existing python product (not zope related), called prescript. I do not have the url handy at the moment, but it should easily be found on the python website (www.python.org) Rik
http://www.nzdl.org/cgi-bin/gw?a=page&p=Prescript
-----Original Message----- From: zope-admin@zope.org [mailto:zope-admin@zope.org]On Behalf Of Rik Hoekstra Sent: Wednesday, November 03, 1999 13:26 To: Phillip J. Eby; Zope@zope.org Subject: Re: [Zope] MSWord Product...
Date sent: Wed, 03 Nov 1999 13:29:03 -0500 To: benno@sesgroup.net, Phil Harris <phil@wigwamweb.net> From: "Phillip J. Eby" <pje@telecommunity.com> Subject: Re: [Zope] MSWord Product... Copies to: magnus@vuab.net, zope@zope.org
At 05:01 AM 11/4/99 +1100, Ben Leslie wrote:
The results I have seen are reasonably good, probably not
perfect though.
If layout it critical I would suggesting converting to PDF using the Adobe Acrobat (unfortuntely this is quite expensive :( )
There is another way to do that. Print to file using a Postscript driver from Word, then load the result into GhostScript/GSView (free, open source) and generate a PDF. Voila. It's a bit less convenient than Acrobat, but for many files it actually runs faster than Acrobat.
And anyway the double step postscript->pdf is generally assumed produce the best results for Acrobat as well. The direct conversion word->acrobat is just convenient.
Note also that if you want to access/catalog the pdf contents as well you could use an existing python product (not zope related), called prescript. I do not have the url handy at the moment, but it should easily be found on the python website (www.python.org)
Rik
participants (6)
-
Ben Leslie -
Loren Stafford -
Magnus Heino -
Phil Harris -
Phillip J. Eby -
Rik Hoekstra