V4.0.0 - Initial release of GUM series 4
NOTE: the version in top level folders is numbered 4.0.0nr for "no reddit". The full version 4.0.0 can be generated automatically, see README_reddit.md
on compiling reddit data.
- Added 25 new documents in four new genres
- New genres: academic, bio, fiction, reddit
- Total: 101 documents
- The 6 reddit documents are only included without token data in _/build/src/
- Build bot now generates Universal Dependencies parses and morphology from gold Stanford Dependencies
- Numerous error corrections and build bot bug fixes