-
Notifications
You must be signed in to change notification settings - Fork 60
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
NER & Parsing not working for new language #82
Comments
|
In case someone comes across this in the future: The issue is that the whole stanza pipeline is integrated as the tokenizer in the spacy pipeline (which is a bit unexpected) and you're not running the tokenizer when you call:
Starting with a text,
With
|
I am currently trying to import stanza's NER and dependency parsing for Arabic into spacy.
As mentioned in a different issue, there seems to be an issue with "mwt" and importing the named entities into the spacy object. The "Arabic"-Pipeline is no different from this and I have to deal with the same problem.
To deal with this I thought of the following workaround:
So far everything works fine.
But once I call the nlp pipeline the returned object does not have entities and
has_annotation
is also False.There are no error messages. So I don't know what I am doing wrong.
But it seems like the stanza pipeline is not even called. It isn't because of
tokenize_pretokenized
either.Are there just missing error messages and it is just the same problem as #32 ?
Minimal working example (without entities). Translation is: "I am hungry. I am going home."
The text was updated successfully, but these errors were encountered: