Skip to content

Latest commit

 

History

History
39 lines (20 loc) · 970 Bytes

README.md

File metadata and controls

39 lines (20 loc) · 970 Bytes

spacy_ud

Transforms Spacy English parser output to a Universal Dependency (UD) document

Spacy English parser produces a Spacy document

This code transforms SpacyDocs into UdDocs

Usage:

If you already have a Spacy document (output of Spacy), use this to create a UdDoc:

ud_doc = spacy_to_ud_doc(spacy_doc)

to print a UdDoc, use:

ud_doc.print_doc()

To create a Spacy doc, first create an instance of SpacyParser

spr = SpacyParser()

To get a Spacy doc, use

spacy_doc = spr.nlp('Spacy parser is pretty good')

To print a Spacy doc, use:

print_spacy_doc(spacy_doc)

For more information, see comments inside main.py

ud_sentences.txt contains 197 sentences taken mostly from examples in https://universaldependencies.org/u/dep/all.html

They should cover most of the English dependencies

ud_sentences_parses_curated.txt contains Stanza parses which were manually curated to conform to UD. Stanza is not perfect.

spacy_ud was developed using these sentences.