[solved] loadString, system() & Unicode

daniel · April 4, 2018, 3:53pm

Hello hello,

I’m having a bit of trouble with Utf-8 encoded files. Let’s make it a very simple example:

CI_LOG_I(loadString(loadAsset("test.yaml"));

And test.yaml:

ÄÄää

If I save my file as a latin1 variant (i.e. ISO-8869-15) I see the correct output in my Visual Studio Output. If I save it as UTF-8 or UTF-8 with BOM I get something like “Ã„Ã„Ã”. (Tested with vim and Sublime)

Now I’m not too sure how Visual Studio Console Output handles Unicode. So maybe that is the problem. If I make TextBox and use my loaded string it works like a charm either way (fair enough!).

However: I need to pipe my loaded content at some point to system() for some ugly behind-the-scenes font rendering mumbo jumbo. In general system() should be fine with this and don’t care too much about the encoding. I end up getting the very same “Ã„Ã„Ã” rendered though which makes me think that how I (or cinder or c++ or…) handles the encoding might be the problem.

I’d be grateful for any ideas or hints on this, thank you!

// edit:
So it basically seems that at some point my UTF-8 encoded string is treated as latin-1. For example I get the same kind of characters if I save a html file as unicode and ommit the Content-Type: Charset=Utf-8 header.

// edit2:
I’m on windows

andrewfb · April 5, 2018, 2:59am

I believe this is actually a bug in Cinder’s console() function (which wraps OutputDebugString() on MSW). I’ve made a pull request here:

I’d be curious if that addresses your use case as well.

daniel · April 5, 2018, 8:31am

Yes! It did indeed and also helped me with my other problem.

To pass on unicoded arguments to a system call I now use this:

_wsystem( msw::toWideString(cmd).c_str() );

Thank you so much, Andrew.

Topic		Replies	Views
UTF8 encoding and Cinder and cross-platform Using Cinder	2	1470	June 30, 2016
Filepath looks fine in UTF8 BOM, but doesn't render. UTF8 changes the string :S	1	797	October 10, 2017
Yet another UTF-8 Question - loading files with Cyrillic paths Using Cinder	4	657	February 14, 2019
Windows text encoding	1	891	February 20, 2017
[Solved] Handling invalid UTF-8 characters Using Cinder	4	1159	June 20, 2018

[solved] loadString, system() & Unicode

Related topics