I’m having a bit of trouble with Utf-8 encoded files. Let’s make it a very simple example:
If I save my file as a latin1 variant (i.e. ISO-8869-15) I see the correct output in my Visual Studio Output. If I save it as UTF-8 or UTF-8 with BOM I get something like “Ã„Ã„Ã”. (Tested with vim and Sublime)
Now I’m not too sure how Visual Studio Console Output handles Unicode. So maybe that is the problem. If I make TextBox and use my loaded string it works like a charm either way (fair enough!).
However: I need to pipe my loaded content at some point to system() for some ugly behind-the-scenes font rendering mumbo jumbo. In general system() should be fine with this and don’t care too much about the encoding. I end up getting the very same “Ã„Ã„Ã” rendered though which makes me think that how I (or cinder or c++ or…) handles the encoding might be the problem.
I’d be grateful for any ideas or hints on this, thank you!
So it basically seems that at some point my UTF-8 encoded string is treated as latin-1. For example I get the same kind of characters if I save a html file as unicode and ommit the Content-Type: Charset=Utf-8 header.
I’m on windows