Pages in topic: < [1 2 3 4] > | Tips for PDF translation Thread poster: Mauricio Coitiño
| | Get a PDF translated in 5 steps | Aug 24, 2009 |
Yaotl Altan wrote:
Pablo:
I've downloaded a Solid Converter PDF trial version and it works really fine!
Yes, SolidConverter PDF does most of the job, if you use the real page convertig method. The problem is that MS-Word neither some MS-Word based CAT tools does not see the text in textboxes. Did you try the method that I suggested with the werecat macro?
BR | | | Jande Australia Local time: 13:40 Danish to English + ... Tricks for translating pdf's | Aug 24, 2009 |
First method:
Print out the pdf.
or
Arrange your screen to see both the source pdf and final (word) file.
Start translating.
Second method:
You can take a snapshot of the pdf or do a print screen (Print Scrn) and then paste it into a word document.
With two word files (if you managed to get your pdf into a word document)
Open both documents source and final document and use the "window" "compare side... See more First method:
Print out the pdf.
or
Arrange your screen to see both the source pdf and final (word) file.
Start translating.
Second method:
You can take a snapshot of the pdf or do a print screen (Print Scrn) and then paste it into a word document.
With two word files (if you managed to get your pdf into a word document)
Open both documents source and final document and use the "window" "compare side by side with" feature in word for windows.
I usually find that my pdf translations are as good as my word ones. I love to copy the pictures and edit them in bitmap, so the final document is a carbon copy of the original, except in another langauage.
It is easier to remember to translate headers, footers and diagrams, if you do pdf's.
As the final document should look like the first, you can check to see if you've missed any sentances by comparing paragraphs, tables etc. Especially if you use the side by side feature. ▲ Collapse | | | My way of dealing with PDFs | Feb 12, 2010 |
Hopefully, I have few now. But I used to process many, so I have an algorithm.
Here we go:
1) Take a PDF file (the easiest step ).
2) Remove password protection, if any, using tools available on the Net.
3) Feed it to some OCR tool (I use FineReader 8 and 9 -- there is difference*).
4) Recognize and save as RTF.
5) Open in WordPad and save as DOC (thus you remove all unnecessary formatting... See more Hopefully, I have few now. But I used to process many, so I have an algorithm.
Here we go:
1) Take a PDF file (the easiest step ).
2) Remove password protection, if any, using tools available on the Net.
3) Feed it to some OCR tool (I use FineReader 8 and 9 -- there is difference*).
4) Recognize and save as RTF.
5) Open in WordPad and save as DOC (thus you remove all unnecessary formatting).
6) Open in Word and format the source to match the original (to eliminate the bugs that might affect segmentation).
7) Translate using a CAT tool.
8) Check.
9) Double check.
10) Print to PDF using, say DoPDF driver and...
... let the client be happy with PDF!
* In FR8, unlike FR9, you can disable "formatting using frames" (so skip Step 5). FR9 is good at recognizing very small fonts, dirty paper and wavy lines (but you might get frames).
[Редактировалось 2010-02-12 22:20 GMT] ▲ Collapse | |
|
|
Brian Young United States Local time: 18:40 Danish to English PDF conversion | Feb 13, 2010 |
My outsourcers now use a program called Able2docpro to convert PDF's to word. It works extremely well, and retains all of the original formatting.
If I have to work with a PDF I just open it in a window, and then open a word doc in a new window. I size the windows so that they both appear on the same screen; the PDF on the top, the word doc on the bottom. This has really worked well for me.
I agree that converting PDF to word is not what I would call DTP.
I always try to get t... See more My outsourcers now use a program called Able2docpro to convert PDF's to word. It works extremely well, and retains all of the original formatting.
If I have to work with a PDF I just open it in a window, and then open a word doc in a new window. I size the windows so that they both appear on the same screen; the PDF on the top, the word doc on the bottom. This has really worked well for me.
I agree that converting PDF to word is not what I would call DTP.
I always try to get the original documents, but sometimes they are simply not available. In those cases I translate the language, and let someone else do the DTP, if that is required to replicate complicated formatting, or the insertion of graphics. I don't know how to do that type of work, and I don't even care to learn.
As far as OCR'ing the PDF, then you need both documents, because very often the OCR either misses too many things, or it does not recognize certain characters. You need the PDF to check the accuracy of the OCR'ed version.
I work on a mac, using Office for mac. I don't think Able2docpro will run on a mac, but I have not looked into that. It works great on a PC. ▲ Collapse | | | Yaotl Altan Mexico Local time: 20:40 Member (2006) English to Spanish + ... | sebdea Romania Local time: 04:40 My recommandation | Apr 28, 2010 |
Yaotl Altan wrote:
Thank you, efreitag, but which is better? Abbyy Fine Reader 9 or Abbyy PDF Transfomer? The first costs 400 USD and the second 60 USD. Am I missing something? ![](https://cfcdn.proz.com/images/bb/smiles/icon_smile.gif) [Edited at 2009-08-22 21:50 GMT]
I kindly recommend you to buy Fine Reader 9. it is more powerfull then Transformer. Still, Fine Reader has its own limitations. I have trouble when I have images inserted in the middle of text, or tables with a complex structure.
Seba | | | sebdea Romania Local time: 04:40 PDF conversion is DTP (in my opinion) | Apr 28, 2010 |
Hi,
I work as DTP operator for a translation company based in Timisoara. Clients usually are sending PDF files to be translated. If the clients need back only a PDF file with translation I use InFix PDF Editor. Great tool. But most of the clients are sending low res PDF files and want back an editable file (mostly ask for MS Word). In this case I do DTP job. I use Abby Fine Reader 9. Convert files into MS Word and do layout to match exactly with the original.
My real pr... See more Hi,
I work as DTP operator for a translation company based in Timisoara. Clients usually are sending PDF files to be translated. If the clients need back only a PDF file with translation I use InFix PDF Editor. Great tool. But most of the clients are sending low res PDF files and want back an editable file (mostly ask for MS Word). In this case I do DTP job. I use Abby Fine Reader 9. Convert files into MS Word and do layout to match exactly with the original.
My real problem appears when the client need an estimation for translation and for DTP. In this case the translators ask me for an editable file (they get a low res PDF from client) to import it into CAT Tools for word-counting. This is the challenge! What shall be done? I usually convert PDF into MS Word without formatting the doc. But this procedure is time consuming. And there are cases when clients don't work with us after they receive the estimation.
When PDF files are at a high resolution and are clean I use InFix PDF Editor to export PDF into XML file. The translators import the file into CAT Tools and do word counting for price estimation to the clients. But not always works so fine.
So, my question to you all is as follows. What has to be done to avoid working several hours for nothing?
Thank you, in advance.
Seba
[Edited at 2010-04-28 10:51 GMT] ▲ Collapse | |
|
|
Mauricio Coitiño Uruguay Local time: 23:40 English to Spanish + ... TOPIC STARTER No need to convert PDF for wordcount | Apr 28, 2010 |
There are tools for PDF wordcount, and you don't even need to download them. Just google them. Online Wordcount and Abracadabra are just examples.
Even if you don't use this tools, you just need to convert the file (it iakes a few seconds, unless it's a hundred-page doc) and then use any wordcount tool for Word files. You can leave the program working while you do something else, right?
Regards,
Mauricio | | | Vito Smolej Germany Local time: 03:40 Member (2004) English to Slovenian + ... SITE LOCALIZER
dont fall for it.
Or, in dire straits, quote a reasonable bozo price - pi as a blot factor is realistic. | | | sebdea Romania Local time: 04:40 You are very wrong - No offence | Apr 29, 2010 |
Mauricio Coitiño wrote:
There are tools for PDF wordcount, and you don't even need to download them. Just google them. Online Wordcount and Abracadabra are just examples.
Even if you don't use this tools, you just need to convert the file (it iakes a few seconds, unless it's a hundred-page doc) and then use any wordcount tool for Word files. You can leave the program working while you do something else, right?
Regards,
Mauricio
They need a more detailed word count (e.g. Repetitions found - Pretranslated - Partially Translated - 100-95% - 94-85% - 84-75% - 74-50% - Remaining not translated units - Total)
I do believe that kind of word count you will never get it with an online so called word count software.
Best regards,
Seba | | | Mauricio Coitiño Uruguay Local time: 23:40 English to Spanish + ... TOPIC STARTER No offence: just a bit of accuracy | Apr 29, 2010 |
Dear Seba,
Let's not confuse word count with text analysis. My post dealt with the former, and that may be enough for certain projects.
Like you, I haven't used these tools yet, but I wouldn't dare to say they can't analyse. Just give them a try.
Regards,
Mauricio | |
|
|
sebdea Romania Local time: 04:40 A minor misunderstanding | May 4, 2010 |
Mauricio Coitiño wrote:
Dear Seba,
Let's not confuse word count with text analysis. My post dealt with the former, and that may be enough for certain projects.
Like you, I haven't used these tools yet, but I wouldn't dare to say they can't analyse. Just give them a try.
Regards,
Mauricio
My mistake. There is a problem with our PM. They always ask for wordcount. As you said is text analysis. Anyway. I will try use those tools but I am concerned about low resolution PDF. Those files are giving me a headache. | | | Catherine Muir Australia Local time: 13:40 Indonesian to English + ... In memoriam Any current advice on converting image-based PDFs to MS Word? | Aug 15, 2010 |
I have just stumbled upon this thread and, because almost all my jobs arrive as image-based PDFs, I am interested to know which conversion software gets the general nod of approval these days. I downloaded the trial version of ABBYY Fine Reader V.10, on the advice of a client, and it seems to work well, but it's expensive to buy. Is this still considered to be the best software for converting image-based PDFs, or is there something better out there now? | | | Update in InFix | Aug 15, 2010 |
It all depends whether one actually want to convert a PDF into a DOC just to translate it and rebuild the original PDF pub translated, or if the text from the PDF is needed for other purposes, i.e. a newly designed pub.
For translation of a "distilled" PDF, InFix Pro is almost there. Apart from a few minor fixes it still needs, it's working! I delivered a translated PDF last week done... See more It all depends whether one actually want to convert a PDF into a DOC just to translate it and rebuild the original PDF pub translated, or if the text from the PDF is needed for other purposes, i.e. a newly designed pub.
For translation of a "distilled" PDF, InFix Pro is almost there. Apart from a few minor fixes it still needs, it's working! I delivered a translated PDF last week done with it. A short demo video is available at http://www.iceni.com/infixTutorials/Translation/translate.htm
Most - possibly all - of the features I'd want are already there. For the first time I finally read through the whole 150-page manual, and discovered them. ▲ Collapse | | | Pages in topic: < [1 2 3 4] > | To report site rules violations or get help, contact a site moderator: You can also contact site staff by submitting a support request » Tips for PDF translation CafeTran Espresso | You've never met a CAT tool this clever!
Translate faster & easier, using a sophisticated CAT tool built by a translator / developer.
Accept jobs from clients who use Trados, MemoQ, Wordfast & major CAT tools.
Download and start using CafeTran Espresso -- for free
Buy now! » |
| Pastey | Your smart companion app
Pastey is an innovative desktop application that bridges the gap between human expertise and artificial intelligence. With intuitive keyboard shortcuts, Pastey transforms your source text into AI-powered draft translations.
Find out more » |
|
| | | | X Sign in to your ProZ.com account... | | | | | |