RE: [Zope] XML documents and International character set
Joachim wrote:
I played a little bit with the XMLDocument, this looks very interesting and opens a lot of possibilities, especially since you can use ZCatalog with XML-documents. That brings the question of international charactersets up. I tried to insert an "ö" into one of the example slides and get a parse error when hitting change. I can replace the "ö" by "ö", but how will that be handled by ZCatalog ?
Yes, this is currently a weakness in the indexing machinery -- it won't handle international characters. --Paul
On Sun, 22 Aug 1999, Paul Everitt wrote:
Joachim wrote:
I played a little bit with the XMLDocument, this looks very interesting and opens a lot of possibilities, especially since you can use ZCatalog with XML-documents. That brings the question of international charactersets up. I tried to insert an "ö" into one of the example slides and get a parse error when hitting change. I can replace the "ö" by "ö", but how will that be handled by ZCatalog ?
Yes, this is currently a weakness in the indexing machinery -- it won't handle international characters.
Do XML documents in general support 8-bit chars ? Gruß Joachim Schmitz WWW-Consultant email: js@ac-copy.net tel: +49-241-89491-0 fax: +49-241-89491-29
Do XML documents in general support 8-bit chars ?
http://www.w3.org/TR/1998/REC-xml-19980210 == 2.2 Characters A parsed entity contains text, a sequence of characters, which may represent markup or character data. A character is an atomic unit of text as specified by ISO/IEC 10646 [ISO/IEC 10646]. Legal characters are tab, carriage return, line feed, and the legal graphic characters of Unicode and ISO/IEC 10646. The use of "compatibility characters", as defined in section 6.8 of [Unicode], is discouraged. Character Range [2] Char ::= #x9 | #xA | #xD | [#x20-#xD7FF] | [#xE000-#xFFFD] | [#x10000-#x10FFFF] -- Petri Lankoski Let us gather hallusinations our private minds kreivi@iki.fi Let us witness the reincarnation of the Sun http://www.iki.fi/~kreivi/ fon PGP: http://www.iki.fi/~kreivi/pgp.txt
participants (3)
-
Joachim Schmitz -
Paul Everitt -
Petri Lankoski