Letters gets changed when copied from PDF Document

P

Ponnurangam

Hi,

I have a PDF document that has letters in Greek

When try to copy it to Word Document(Microsoft Office Word 2003), some
letters are getting replaced by other ones.(Font: TimesNewRoman)

Here is Two of them:

(1) "U+03AC: Greek small Letter Alpha With Tonos" of PDF is getting replaced
as "U+00DC: Latin Capital Letter U With Diaeresis" in Word

(2) "U+0394: Greek Capital Letter Delta" of PDF is getting replaced as
"U+00C4: Latin Capital Letter A With Diaeresis" in Word


Thanks
Ponnurangam
 
G

Graham Mayor

How are you copying? Word has no facility to translate anything to or from
PDF.

--
<>>< ><<> ><<> <>>< ><<> <>>< <>><<>
Graham Mayor - Word MVP


<>>< ><<> ><<> <>>< ><<> <>>< <>><<>
 
P

Ponnurangam

I am copying the text from PDF to clipboard and then pasting to the word.

Thanks
Ponnurangam
 
G

Graham Mayor

Try edit paste special rather than a simple paste, but my guess is that you
may not get any further with this. PDF is essentially a graphics format and
the conversion of graphics formats back to text is somewhat hit & miss and
unlike OCR software where at least you can train the software to use
particular characters in some circumstances, here you don't have that
option.

--
<>>< ><<> ><<> <>>< ><<> <>>< <>><<>
Graham Mayor - Word MVP


<>>< ><<> ><<> <>>< ><<> <>>< <>><<>
 
P

Ponnurangam

I tried Paste Special. It didn't work.Can you suggest some OCR software that
I can use to scan text from an image.

Thanks
Ponnurangam
 
C

Charles Kenyon

I have had good results taking PDF files and using PaperPort's print driver
to convert them to a PaperPort file and then OCR to Word. Retyping may be
easier.
 
G

Graham Mayor

Abbyy Finereader works reasonably well and they now have a separate module
available dedicated to reading PDF files. See if the demo works on your
file. http://www.abbyy.com/pdftransformer/

--
<>>< ><<> ><<> <>>< ><<> <>>< <>><<>
Graham Mayor - Word MVP


<>>< ><<> ><<> <>>< ><<> <>>< <>><<>
 
P

Ponnurangam

Hi,

First, Thanks very much for all your replies

Abbyy Finereader also does the same replacement of characters as I mentioned
earlier. Any other ideas.

Also, I couldn't find any OCR Software for Greek Language. Do you know any
one of them

Thanks
Ponnurangam
 
P

Ponnurangam

Hi,

Can you tell me how to do that.

I mean how to convert from PDF to PaperPort file using PaperPort's print
driver and then OCR to Word.

Also, I couldn't find any OCR Software for Greek Language.Do you know any
one of them.

Thanks
Ponnurangam
 
G

Graham Mayor

Just a thought - try setting the Windows language to Greek whilst
converting.

--
<>>< ><<> ><<> <>>< ><<> <>>< <>><<>
Graham Mayor - Word MVP


<>>< ><<> ><<> <>>< ><<> <>>< <>><<>
 
P

Ponnurangam

Hi,

Setting the language to Greek while converting using Abbyy Finereader didn't
work

Thanks
Ponnurangam
 
G

Graham Mayor

If you like you could send me a copy of the pdf, via the link on my web
site, with confirmation of where the problem occurs and I'll try some
alternatives.

--
<>>< ><<> ><<> <>>< ><<> <>>< <>><<>
Graham Mayor - Word MVP


<>>< ><<> ><<> <>>< ><<> <>>< <>><<>
 
P

Ponnurangam

Hi,

I got a solution for this. It seems my PC doesn't have "Times New Roman Dual
Greek" font. I need to install it

Thanks very Much
Ponnurangam
 
S

Suzanne S. Barnhill

TNR Unicode does contain the Basic Greek character subset, which should
include most of the needed glyphs (including all the ones you mention).
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Top