Extracting Embedded Objects from MS PowerPoint documents

I

InfoSeeker

I am posting to this group due to the following excellent thread
already in existence here:

http://groups.google.co.uk/groups?h...24NaWYdEHA.3476%40tk2msftngp13.phx.gbl&rnum=1

While the above thread provided the basis to implement a solution to
extract embedded objects from Word and Excel I need some assistance
with respect to PowerPoint. While Excel and Word do differ in their
object embedding technique (to a lesser extent) the same pinciples of
extraction apply but PowerPoint appears to use a very different
internal storage method for embedded objects. Unlike the other two
document types, PowerPoint, does not appear to use Storage objects for
the seperate embedded objects and I assume they are all "mixed" into
the main document stream? However I am hoping there is some
"structure" to this stream that I can use to extract the objects?

As well as extracting other office documents that maybe embedded in
the PowerPoint document I also wish to extract any embedded zip files.
So far I have done this in Word and Excel by content "sniffing" the
Ole10Native stream of a CLSID_Package storage object. However with
PowerPoint it is not immeadiately obvious where to start?

Thanks in advance for any information that can be offered on this
topic. For reference, although largely irrelevant, this is all to be
part of a C++ application.

Regards,
Tim.
 
M

math512

Hi,

I have the same problem.

With POI HSLF api (java), i could extract embedded object, but they are
compressed. I don't know compress format.

Thanks in advance for any information that can be offered on this topic.

Regards,

Mathieu
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Top