Python SyntaxError: Non-UTF-8 [duplicate]

Question

I converted my Python script to a Mac.app (via py2app). I try to run it and get the following error:

SyntaxError: Non-UTF-8 code starting with '\xcf' in file 
py2app/dist/myapp.app/Contents/MacOS/myapp on line 1, but no encoding declared; see 
http://python.org/dev/peps/pep-0263/ for details

I visited the PEP website and added the following to the first two lines of my script:

#!/usr/bin/python
# -*- coding: utf-8 -*-

I have also put my code into various online tools (such as this one) to check whether there are any non-UTF-8 characters but I'm not getting any issues.

I did copy some text from an Excel file however there were no special symbols that I was aware of.

The script is approx 800 lines so is there a way of identifying the problem that doesn't involve manually scanning the script line-by-line?

EDIT

Not exactly a fix, but converting my script into an executable instead of a .app has fixed the issue and it now runs correctly.

Maybe see if you have any of these characters: tripleee.github.io/8bit/#cf — tripleee
– tripleee, Commented Sep 29, 2019 at 11:16
If you copy/pasted into an on-line UTF-8 checker then your editor talking to your browser by way of copy/paste buffer probably converted the text on the way. — tripleee
– tripleee, Commented Sep 29, 2019 at 11:17
Try iconv -t utf-16le file >/dev/null and examine the error output. — tripleee
– tripleee, Commented Sep 29, 2019 at 11:19
@tripleee no error output with that command... every check I've run so far has told me it's all UTF-8 — DDiran
– DDiran, Commented Sep 29, 2019 at 11:21
Can you add a hex dump of py2app/dist/myapp.app/Contents/MacOS/myapp please? (MacOS has a command xxd; maybe prune the output if it's excessive.) — tripleee
– tripleee, Commented Sep 29, 2019 at 11:27

Giacomo Catenazzi · Accepted Answer · 2019-09-30 09:34:49Z

Python 3 uses UTF-8 as default encoding. This simplify the codes you get from Internet (and other packages). \xcf in UTF-8 is valid only if the byte before has predefined values, which it is not the case: Non-UTF8 code starting mean this, it is not a valid start (first byte) of UTF8 codepoint encoding.

As you see in the comment, you may convert the file into UTF-8, many times you can ignore the initial encoding (often such errors are from comments, e.g. author name). you may convert it, e.g. on options in Saving As on your original editor.

As an alternate way, you can specify the encoding on the first few lines of your code, see PEP-263 on how to do it. Note: Python has hardcoded byte strings to check [because it has not idea of encoding], so try to copy exactly the string as in such document. I think such line # -*- coding: latin-1 -*- should be ok, but this could misinterpret some characters, so test your program. If you do no know the original encoding, the easier way it is to convert original source (because you should in any case check all strings in the source code, and check if you guessed the correct encoding).

i added this and its works, i speaking spanish and use vscode editor #!/usr/bin/python # -*- coding: latin-1 -*-

Collectives™ on Stack Overflow

Python SyntaxError: Non-UTF-8 [duplicate]

1 Answer 1

1 Comment

Linked

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

1 Comment

Linked

Related