J
James Knowles
Hi,
I am trying to build a document indexing tool for a .NET application. I am
fine with the indexing bit and but I cannot find any real information on
Converting word documents into Text. I do not want to use the word object and
application as I will be indexing around about 40,000 word documents. So I
want to be able to read the file directly and extract the text and index.
Does anyone know where I can find out about the word file format or can point
me in a direction were I can find out more information on this.
Thanks for any help,
James
I am trying to build a document indexing tool for a .NET application. I am
fine with the indexing bit and but I cannot find any real information on
Converting word documents into Text. I do not want to use the word object and
application as I will be indexing around about 40,000 word documents. So I
want to be able to read the file directly and extract the text and index.
Does anyone know where I can find out about the word file format or can point
me in a direction were I can find out more information on this.
Thanks for any help,
James