-
Notifications
You must be signed in to change notification settings - Fork 24
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Issues with UTF-8 encoded files #21
Comments
Chocolatey (the exe) has a command |
Thanks. @ferventcoder: how would you use |
It's in the rewritten Chocolatey that has not been released yet (about 70% of the way there on existing features) - see our kickstarter for a preview - https://www.kickstarter.com/projects/ferventcoder/chocolatey-the-alternative-windows-store-like-yum The kickstarter is to support improvements to infrastructure, addition of pro/business licenses for features that cost $$ to provide, and the longer term future of Chocolatey itself. |
Ah I see, thanks. On Tue, Nov 4, 2014 at 5:32 PM, Rob Reynolds [email protected]
|
This issue was originally reported here: https://stackoverflow.com/questions/26693809/creating-a-chocolatey-package-using-the-quick-start-guide-fails-the-utf-8-test-e
Summary: Warmup does not seem to to parse UTF-8 encoded files correctly.
Consider the follwing comment with non-ASCII characters in the nuspecs of the Chocolatey templates repository:
After a
warmup chocolatey some-package
, the comment insome-package.nuspec
becomes to:As you can see, all non-ASCII characters are replaced with
?
. This is a similar issue that affects many Microsoft products, e.g. notepad.exe, PowerShell, cmd.exe or Visual Studio. A BOM would force it to parse it as UTF-8, but of course we all know that a BOM for UTF-8 is deprecated, not recommended by the Unicode standard and could cause issues with programs that do not expect a BOM. Microsoft is probably the only larger company which still ignores this part of the standard, even more than a decade after UTF-8 became a de-facto standard for text files. 😠This issue is probably also a cause why some packages on chocolatey.org have broken Unicode characters.
The solution here would be to integrate a character encoding detection library to detect all common character encodings and then output the nuspecs as UTF-8 without BOM.
The text was updated successfully, but these errors were encountered: