Data extraction from HTML source using C++

D

David F

Anybody knows about C++ sources to extract raw text data from (X)HTML and
XML source pages?

Aby relevant info is greatly appreciated.
 
P

Peter Aitken

David F said:
Anybody knows about C++ sources to extract raw text data from (X)HTML and
XML source pages?

Aby relevant info is greatly appreciated.

Microsoft has an XML parser and toolkit that you can download. I'm sure you
could call it from C++. I don't remember the details but if you search on
the msdn website you should be able to find it. If you're using C++ in .Net
then the .Net framework itself has powerful XML tools.
 
Top