Word 2003 as XML editor -- how to preserve white space (incl. CR/LF)?

J

Jon

Everyone,

I've been experimenting with using Word 2003 as an XML editor: to
read, edit, and save XML documents valid to a schema I'm developing
(this schema is NOT WordML.)

Unfortunately, when saving the document, Word 2003, by default,
normalizes white space which effectively makes the document one single
line (which is not wanted). In addition, it takes markup like this
on input:

<paragraph>This <emph>is</emph> a paragraph.</paragraph>

and saves it as:

<paragraph>This<emph>is</emph>a paragraph.</paragraph>

Which is pretty serious mangling for document use purposes.

I'm under the understanding that I need to apply an XSLT transform to
assure the white space (including CR/LF) is preserved when I save the
document as XML data. Not having worked with XSLT, can someone send me
a copy of an XSLT that will work? I'm under the impression it will
only be a few lines long.

Other requirements are that the transform must keep the encoding of
the original document (either UTF-8 or UTF-16) and make no changes
whatsoever to the markup.

Thanks.

Jon Noring


(p.s., has Word 2007 corrected some of the problems in Word 2003 for
XML editing use, such as the white space problem described above?)
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Top