-
-
Notifications
You must be signed in to change notification settings - Fork 105
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Emoji character in domains not recognized correctly (punycode) #231
Comments
In case you're wondering,
as it only supports
Apparently, |
icu4j is using the ICU license. This seems to be a tweaked version of the X11 License which is compatible with our own LGPL. |
Yeah as you wrote, they use the ICU license which is deemed compatible with GPL. So it should be ok, no? |
Yeah, that's good. I was just making a note here as I worked through figuring out how I want to integrate it. No action needed. |
Ok, this is a lot more complex than I originally thought looking at the bug report. TL;DR there's no good way for us to support this type of encoding from the The good news is that there's an existing API which will handle this correctly:
This API works a bit differently because it doesn't accept a full URL, but that's the same thing that makes the IDN conversion work as expected. For example, to get https you would use:
In order to support this from the Does this unblock your use case for the time being? |
Well, if I understand correctly, the My use case is not blocked as I use a different library to detect URLs in strings and the library can do URL normalization, including using ICU (I did the PR), so I simply pass dispatch the normalized URL. I created this issue because I believe that other users of Dispatch might have the same problem and because I think this library should be able to handle URL normalization properly (not only emojis, also other characters, as illustrated here), which the current |
How inconvenient this is largely depends on the application.
Yep, I hear you. I'm not closing the issue, just pointing out that this isn't going to be a quick, drop-in fix like I originally thought. This will take some time to get right. |
leads to
Its punycode [1] conversion [2] (
https://xn--i-7iq.ws/
) works as expected.https://i❤.ws/
points to a domain registrar and is SFW."https://i❤.ws/" visualization when not between backticks.
Opening a new issue because I cannot reopen an issue closed by a collaborator.
[1] https://en.wikipedia.org/wiki/Punycode
[2] used converter: https://www.punycoder.com/
The text was updated successfully, but these errors were encountered: