License:
Das Referenzkorpus Mittelhochdeutsch ist lizenziert unter einer Creative Commons Namensnennung - Weitergabe unter gleichen Bedingungen 4.0 International Lizenz.
No change is made on the corpus. This code is intended to parse the corpus.
- Go to https://www.linguistics.rub.de/rem/access/index.html.
- Click on "CORA-XML AKS .TAR.XZ" or "CORA-XML ALS .ZIP"
- Click on "Herunterladen".
- Uncompress the dowloaded file.
- You have a folder, named rem-corraled-20161222 (2019-09-18) with a list of XML files which are annotated texts.
The available code will parse individual XML files.