Is there a way to strip HTML?

W

WalterH

I have a good number of messages that I have imported that were HTML,
but imported without HTML. Is there an easy way to strip all of the
erroneous HTML code out? All of the messages in question appear as
such, which make it quite hard to read (and this is one of the easier
ones). Any help would be appreciated. See below for an example:

<HTML dir=ltr><HEAD>

<META content="MSHTML 6.00.2900.3059" name=GENERATOR></HEAD>
<BODY>
<DIV><FONT face=Arial color=#000000 size=2>Hi guys </FONT></DIV>
<DIV><FONT face=Arial size=2></FONT>&nbsp;</DIV>
<DIV><FONT face=Arial size=2>I wanted to draw your attention to one
more issue with regard to the EasyScript/Retail interface.&nbsp;
Thanks for working to&nbsp;populate the EasySCript OEF with teh dosage
form information.&nbsp; This seems to be workign correctly and flowign
form the order sentences.&nbsp; The new (and bigger) glitch is that
during testing non of this information flows to teh e-Rx in retail
medmanager.&nbsp; We need to figure out a way to transfer this data in
a consices and easily identifiable way to the e-Rx.&nbsp; </FONT></
DIV>
<DIV><FONT face=Arial size=2></FONT>&nbsp;</DIV>
<DIV><FONT face=Arial size=2>I'll log this as a solution issue today.</
FONT></DIV>
<DIV><FONT face=Arial size=2></FONT>&nbsp;</DIV>
<DIV><FONT face=Arial size=2>Jarrid</FONT></DIV></BODY></HTML>
 
D

Diane Ross

Is there an easy way to strip all of the
erroneous HTML code out?

Not an easy way, but it works.

<http://www.zubrag.com/tools/html-tags-stripper.php>
=============== Script options ===============
Check out this page for some options: (unix)

http://www.w3.org/Tools/html2things.html

Extract text inside tags of a string (MacScripter forums)

<http://macscripter.net/viewtopic.php?id=17180>

Batch text conversion with textutil (not sure if this one works in Leopard
or SL)
<http://www.macosxhints.com/article.php?story=20060309220909384>

Remove HTML E - Translate HTML coded message into plain text (this is also
an older one. Not sure if DecodeQP works in newer OS)
<http://mac.softpedia.com/get/HTML-Tools/Remove-HTML-E.shtml>

Let us know if you find one that's easy to use. I'll post the results on the
Entourage Help Page.
 
T

Terry R.

On 11/11/2009 12:26 PM On a whim, WalterH pounded out on the keyboard
I have a good number of messages that I have imported that were HTML,
but imported without HTML. Is there an easy way to strip all of the
erroneous HTML code out? All of the messages in question appear as
such, which make it quite hard to read (and this is one of the easier
ones). Any help would be appreciated. See below for an example:

<HTML dir=ltr><HEAD>

<META content="MSHTML 6.00.2900.3059" name=GENERATOR></HEAD>
<BODY>
<DIV><FONT face=Arial color=#000000 size=2>Hi guys</FONT></DIV>
<DIV><FONT face=Arial size=2></FONT>&nbsp;</DIV>
<DIV><FONT face=Arial size=2>I wanted to draw your attention to one
more issue with regard to the EasyScript/Retail interface.&nbsp;
Thanks for working to&nbsp;populate the EasySCript OEF with teh dosage
form information.&nbsp; This seems to be workign correctly and flowign
form the order sentences.&nbsp; The new (and bigger) glitch is that
during testing non of this information flows to teh e-Rx in retail
medmanager.&nbsp; We need to figure out a way to transfer this data in
a consices and easily identifiable way to the e-Rx.&nbsp;</FONT></
DIV>
<DIV><FONT face=Arial size=2></FONT>&nbsp;</DIV>
<DIV><FONT face=Arial size=2>I'll log this as a solution issue today.</
FONT></DIV>
<DIV><FONT face=Arial size=2></FONT>&nbsp;</DIV>
<DIV><FONT face=Arial size=2>Jarrid</FONT></DIV></BODY></HTML>

Hi Walter,

I don't know how many you have, but one way is to select all the text as
you did above, paste it into a text document and save it with a .HTM
extension. Double click on the save file and it should open in your
browser. Select all the text and copy it, then paste it back into a text
document. The HTML will be removed and only the plain text is copied.


Terry R.
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Top