Skip to content

Latest commit

 

History

History
35 lines (24 loc) · 1.64 KB

treebanks.md

File metadata and controls

35 lines (24 loc) · 1.64 KB

Overview of Classical Greek Syntactical Treebanks

NT software (rough-grained and/or idiosyncratic formatting): Accordance, Cascadia, Global Bible Initiative, Lowfat, OpenText

https://jonathanrobie.biblicalhumanities.org/blog/2017/12/20/treebanks-for-ancient-greek/

Ancient Greek and Latin Dependency Treebanks (AGDT; medium-grained, established international standard for Classics)

Gorman Trees (> 500K tokens)

https://github.com/perseids-publications/gorman-trees/tree/master/public/xml

Perseus DL (> 500K tokens; overlaps with Gorman Trees)

https://github.com/PerseusDL/treebank_data/tree/master/v2.1/Greek/texts

Pedalion Trees (ca. 119K tokens)

https://github.com/perseids-publications/pedalion-trees/tree/master/public/xml

Sematia -> PapyGreek Treebanks (ca. 6K tokens; documentary papyri)

https://zenodo.org/records/5074307 https://papygreek.hum.helsinki.fi/annotated/export_data

GlauX (> 2M tokens; automated, semi-automated, and manual)

https://github.com/perseids-publications/glaux-trees/

PROIEL treebanks (ca. 2M tokens; fine-grained; some overlap with UD-formatting)

https://github.com/proiel/proiel-treebank

Universal Dependency (UD) Treebanks (fine-grained; established international standard for Computational Linguistics)

PROIEL UD

v2.12 of all UD treebanks at http://hdl.handle.net/11234/1-5150 UD from PROIEL description: https://github.com/UniversalDependencies/UD_Ancient_Greek-PROIEL/tree/master UD from AGDT description: https://github.com/UniversalDependencies/UD_Ancient_Greek-Perseus/tree/master

PTNK UD (34K tokens from LXX)

https://github.com/UniversalDependencies/UD_Ancient_Greek-PTNK/tree/master