-
Notifications
You must be signed in to change notification settings - Fork 23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
High level helper functions for extracting typesystem independent annotations #87
Comments
Hi @aggarwalpiush :) The issue is that cassis is meant to be a generic type-system agnostic library. I.e. it should support any UIMA type system. In fact, we have users which use e.g. the cTAKES type system and may not work at all with DKPro Core. So we would need some way of @jcklie and I have beed throwing around a number of ideas, e.g.
... so the wrapper approach seems to us the most promising one for the moment. Also, cassis doesn't need to be extended to support it. That said ...
This is something which I think would be really nice to have. |
1 similar comment
Hi @aggarwalpiush :) The issue is that cassis is meant to be a generic type-system agnostic library. I.e. it should support any UIMA type system. In fact, we have users which use e.g. the cTAKES type system and may not work at all with DKPro Core. So we would need some way of @jcklie and I have beed throwing around a number of ideas, e.g.
... so the wrapper approach seems to us the most promising one for the moment. Also, cassis doesn't need to be extended to support it. That said ...
This is something which I think would be really nice to have. |
Wouldn't that also be type system specific?
|
If we imagine TOKEN and LEMMA to be type name string constants - no. |
How would cassis know what feature to use for |
In Python, one would normally just use a list comprehension for that, e.g.
|
For |
This would somewhat diminish the usefulness, as many types beyond token would not return useful results. If we use an accessor, couldn't it decide to return different feature values depending on the type? |
It probably could, but it could be confusing. E.g. if |
|
There is a way to access feature values, e.g. as @jcklie illustrated:
Right now, |
I will track the extension mechanism in #83 and the extension methods you want here so that we do not mix up the issues. |
Hi,
We are trying to extract token's text_strings and pos tags from cas objects. Also, different type systems lead to return different pos tags formats. @zesch Please correct me here if I am wrong.
It would be great to have some helper functions (some are shown in the below examples) that could solve these requests.
For example for the given
cas
object:We hope to see these helper functions as part of this API.
Thanks!!
The text was updated successfully, but these errors were encountered: