I'm having some problem with getting Python to handle my unicode text correctly.
I've boiled it down to the following:
>>>print 'Høst'
Høst
>>>print u'Høst'
HÃ,st
>>>u = u'Høst'
>>>u
u'H\xf8st'
sys.stdout.encoding says that it's using UTF-8, which is most likely why the first, non-unicode, print works. If I just need to print something, then this would be fine. However I'm constructing an xml document, from data in a SQL Server and then it really need to be real unicode.
My data looks like it's perfectly good unicode data, u'H\xf8st' look right to me, so why does Python keep outputting it as 'HÃ,st'?
