Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Handle non-ASCII characters #38

Open
ashley-evans opened this issue Oct 21, 2021 · 0 comments
Open

Handle non-ASCII characters #38

ashley-evans opened this issue Oct 21, 2021 · 0 comments
Labels

Comments

@ashley-evans
Copy link
Owner

Value Added

More accurate occurrence results

Description

Currently non-ASCII characters can be found by getKeyPhrases within find-keyphrases, however, countKeyPhrases is unable to then count the number of occurrences of that key phrase/word on the page.

Therefore, the occurrence counting functionality should be updated such that it can handle these characters.

Acceptance Criteria

AC01

  • If a keyword/phrase has been found that includes non-ASCII characters then the countKeyPhrases function must be able to count the number of occurrences of that keyword/phrase

AC02

  • The number of occurrences must be stored in dynamodb as before
@ashley-evans ashley-evans added the 2 label Oct 21, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant