-
Notifications
You must be signed in to change notification settings - Fork 239
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add remove emoji #111
Comments
Hey @rezasugiarto, thank you for opening an issue! 🎉 It sounds like a good idea. Can you please provide us with an example with a sample code? We will need to make sure the solution is fast enough and correct. If you haven't done yet, please read CONTRIBUTING.md. Also, you might find this python toolkit useful: python-ftfy. Regards, |
I have code that does this! It is not very fast, but it works. I also have code that tokenizes emails and telephone numbers. Would you be intested in a PR to include these? |
Hi, sorry I haven't done it yet. maybe we can collaborate on it @polvoazul |
Hey @polvoazul, thank you for your comment. Can you give us further information regarding your code? Yes, they might be useful. Regarding tokenizing emails and telephone numbers, for now we are simply using a regular expression but we were considering switch to spacy #131. What's and how does it works your solution? |
I believe the import / new dependency of the |
Ohh, i think i missed it. It would be perfect i guess |
Hi, do you think its great to have remove emoticon in preprocessing? Its on top of this one. Would help on it.
The text was updated successfully, but these errors were encountered: