Minor updates, using n-grams has become the standard
This release includes some minor updates:
- Ability to import vocabulary without CPT-4
- Lowered minimum heap size to 1GB so Usagi will run on 32-bit Java
- Using UTF-8 to store and load mapping files, so additional information can contain Chinese characters
Using n-grams for term similarity has now become the standard.