Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

YamlParser.ReadScalarAsString lockup when encountering some rare unicode characters #133

Open
Amethyst-szs opened this issue Dec 10, 2024 · 0 comments

Comments

@Amethyst-szs
Copy link

Problem:

Been working on parsing a large database of text in different languages and ran into a set of Unicode characters that causes YamlParser.ReadScalarAsString to lock-up permanently without an throwing an exception or any other form of error.

Example:

The initial culprit was "2", the full-width two character [U+FF12]. Some surface level testing showed that the other full-width numbers cause the same parsing lockup issue. All other text I tried supplying ran without issue.

Priority

This isn't a super high priority fix for me, I can just clean up the data before passing into the YamlParser, but thought it was worth documenting in case this is a sign of a larger unicode handling issue? Thanks, take care!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant