Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add XPOS for verbs and numbers #88

Closed
nschneid opened this issue Jun 12, 2023 · 2 comments
Closed

add XPOS for verbs and numbers #88

nschneid opened this issue Jun 12, 2023 · 2 comments
Milestone

Comments

@nschneid
Copy link
Contributor

  • Knowing the form of the verb is important for detecting clause-level constructions like passives
  • Knowing the difference between numbers and other N/D words can be useful, and the distinction is made in PTB and UD (including the nummod relation distinct from det)
  • Rather than implement a full set of morphosyntactic features, an incremental step is to import fine-grained POS tags. UD has XPOS tags which for English follow the PTB tagset.
    • Can go directly in .cgel file with :xpos feature, alongside the existing lexical categories
  • For now, limit to CD (number), MD (modal aux), and the following for all other verbs: VB (plain form), VBD (preterite), VBG (gerund-participial form), VBN (past participle), VBP (present tense except 3sg), VBZ (present 3sg)
    • For verbs, CGEL makes some finer-grained distinctions, like modal present vs. preterite, and incorporates negation clitics. And the tags collapse some of the paradigm cells for be. But these further distinctions are groupings of closed-class lexemes, so it isn't strictly necessary to have finer-grained tags.
    • Importing all XPOS tags might cause confusion as there are significant boundary differences for categories like P and D in CGEL versus other schemes (as we explored here)
@nschneid
Copy link
Contributor Author

merged in #93

@nschneid
Copy link
Contributor Author

nschneid commented Sep 8, 2024

We also have :xpos "LS" for list item markers.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant