Importing csv with for Chinese Thread poster: Omer Shani
|
Omer Shani Israel Local time: 15:17 Member (2012) English to Hebrew + ...
Hello supporters colanguages!
I want to import a csv file and apply the multilingual delimited filter and making sure that the Chinese characters (both Simplified & Traditional) will be exported properly and not as Gibbrish.
In the Comma or tab-seperated options, which codepages should I choose for Codepage if not included and for Export codepage from the dropdown list offered for each variant?
If there are other configuration that I should apply, please let me know
Thank... See more Hello supporters colanguages!
I want to import a csv file and apply the multilingual delimited filter and making sure that the Chinese characters (both Simplified & Traditional) will be exported properly and not as Gibbrish.
In the Comma or tab-seperated options, which codepages should I choose for Codepage if not included and for Export codepage from the dropdown list offered for each variant?
If there are other configuration that I should apply, please let me know
Thanks! ▲ Collapse | | |
esperantisto Local time: 16:17 Member (2006) English to Russian + ... SITE LOCALIZER
Use UTF-8. Just always use UTF-8. | | |
Omer Shani Israel Local time: 15:17 Member (2012) English to Hebrew + ... TOPIC STARTER
esperantisto wrote:
Use UTF-8. Just always use UTF-8.
I always use UTF-8.
I tried using UTF-32 and I tried using specific Chinese-Simplified export settings (Chinese Simplified 2312, 2312-80 etc) and still - Gibbrish. | | |
esperantisto Local time: 16:17 Member (2006) English to Russian + ... SITE LOCALIZER
Omer Shani wrote:
I always use UTF-8.
I tried using UTF-32 and I tried using specific Chinese-Simplified export settings (Chinese Simplified 2312, 2312-80 etc) and still - Gibbrish.
Where, at which point do you see gibberish? Share a sample file. | |
|
|
Omer Shani Israel Local time: 15:17 Member (2012) English to Hebrew + ... TOPIC STARTER Screenshot of the exported file | Jan 28, 2021 |
esperantisto wrote:
Omer Shani wrote:
I always use UTF-8.
I tried using UTF-32 and I tried using specific Chinese-Simplified export settings (Chinese Simplified 2312, 2312-80 etc) and still - Gibbrish.
Where, at which point do you see gibberish? Share a sample file.
I see Gibberish after exporting the file into my PC. Within memoQ, the text looks just fine.
It is not a local font issue, for other exported files, on Excel & Word look just fine when I open them.
You can see here an example how do I see it after exporting
| | |
Michael Beijer United Kingdom Local time: 13:17 Member Dutch to English + ...
~
[Edited at 2021-01-28 14:22 GMT] | | |
esperantisto Local time: 16:17 Member (2006) English to Russian + ... SITE LOCALIZER Excel or what? | Jan 28, 2021 |
It is not clear how you open the file. I suspect, it is Excel. A very dumb program, like other garbage from Microsoft. Try LibreOffice or Apache OpenOffice, they open files in UTF-8 without problems. | | |
Omer Shani Israel Local time: 15:17 Member (2012) English to Hebrew + ... TOPIC STARTER
Yes, I knoe csv or Excel are not the best formats, but that is what the client is using. I need to align to their demands and not offer them another format which is not recognized from their end | |
|
|
esperantisto Local time: 16:17 Member (2006) English to Russian + ... SITE LOCALIZER Program, not format | Jan 29, 2021 |
I was talking about the particular program that you use, not about the file format. Once again: try opening the file in a good program such as LibreOffice/Apache OpenOffice or any good text editor (and no, Windows Notepad is not a good text editor). The screenshot provided implies that the file is opened in a wrong encoding, which is a fault of Excel or whatever program you use.
[Адрэдагавана 2021-01-29 07:01 GMT] | | |
Sam Abramov Russian Federation Local time: 20:17 English to Russian + ...
Well... I have lots of Asian characters inside my translation. Haven't been an issue since around 2010.
Most latest versions of Excel and MS Word are able to deal with most non-latin fonts surprisingly well.
Wouldn't it be an easy solution for you to just break segments into latin/asian pieces?
You can do it on the fly in MemoQ 8.7. | | |