Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Patterns containing multiple synsets are not being matched correctly #61

Open
philip-schrodt opened this issue Dec 10, 2018 · 0 comments

Comments

@philip-schrodt
Copy link
Contributor

In AFP_SPA_19940921.0205_7.0, the verb pattern HACER &ARTICULO &DETENCION is matched, but the" &DETENCION" part does not occur in the sentence. This leads to another potential issue: HACER...&ARTICULO is a very common combination of words—the synset &ARTICULO just contains the Spanish articles [EL, LA, LAS, LOS, UN, UNO, UNA…] and consequently is likely to match inappropriately in a large number of cases: this might partially account for the high number of false positives we are currently seeing in Spanish but not in English or Arabic. More generally, there are 1579 patterns in CAMEO.spanish.verpatterns.181009.txt containing two or more synsets, so if this part of the code isn't working a lot of incorrect pattern matches are being generated.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant