Unique issue with PDF to WORD .doc conversion with Acrobat Pro - any ideas?
i have been unable solve following issue when converting (save as...) pdf documents microsoft word .doc using numerous methods. either issue fixed in acrobat pro itself, or in ms word - posting adobe forums first.
preface: attempting use converted .doc file translation applications/software. google translator toolkit use most, other translators having same issue .doc file. --the source pdfs product information drug manufacturers in various countries need have translated english. not have access source documents, not provide own source docs obvious reasons.
also: cannot use google translator toolkit translate pdfs directly - if that, attempt translate pdf , export in .html file, not exact spacing of sentences correctly, leads errors in translating - key things such "can take alcohol" , "do not take alcohol". that's out!
i not having problems resultant .doc file in ms word itself. looks right, spacing matches original pdf source perfectly, prints correctly, etc... reference here on product info sheet austria in german:
the problem: screenshot google translator toolkit - right side of image - spacing in lettering .doc file uploading not being read correctly, resulting in untranslated gibberish. (note: isn't problem translation applications or software -- having issue .doc files converted .pdf - issue isn't present old .doc file wasn't converted .pdf) -- it's got kind of embedded data in .doc file cannot isolate!!)
my settings in adobe pro (convert pdf .doc):
page layout: flowing text (this prevents resultant .doc having of text boxes, don't work in translators)
include comments: true
include images: true
run ocr if needed: true
notes:
-i have run ocr text recognition on source pdf files in it's specific language.
-i have edited accessibilty of pdf , have run tag recognition , quick checks (to see if solved issue, did not - tagged or untagged, same problems!)
-i have exported .doc pdf using ms word's function, results in great looking tagged pdf. re-saved new pdf .doc - same issue.
-i have tried saving pdf in of other formats translators accept. have different issues. 1 works consistently saving .txt (plain)... best .doc .doc conversion, original spacing. (i not spending hours reformatting .txt translation in word)...
i can't seem find spacing data in .doc file!!!! (changing fonts, sizes, margins -- doesnt fix either). have tried many methods...
any thoughts on other things try in adobe pro (or word)?
---
edit: here's additional tidbit of info may key this... there's kind of coding in .doc adobe pro converted source pdf doesnt display in word, being seen translation programs....... have no idea these are, want remove them!
message edited by: kaotikadc
i suggest @ fonts being used. may font issue not being read translation program.
More discussions in Creating PDFs
adobe



Comments
Post a Comment