Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow regex-based stopwords #368

Open
asongtoruin opened this issue May 2, 2018 · 1 comment
Open

Allow regex-based stopwords #368

asongtoruin opened this issue May 2, 2018 · 1 comment

Comments

@asongtoruin
Copy link

Less of an issue and more of a feature request - I was wondering whether it would be plausible to allow for regex-based stopwords? The thought being that if you have a body of text with some consistent part you want to remove (e.g. stage directions, twitter usernames, email addresses), it might be easier to remove them with a pattern in stopwords rather than trying to catch each one individually?

Happy to have a look at this myself over the next few weeks if it's something that you think would be helpful.

Thanks!

@amueller
Copy link
Owner

amueller commented May 2, 2018 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants