Name		Name	Last commit message	Last commit date
parent directory ..
CMakeLists.txt		CMakeLists.txt
README.md		README.md
WordSegCommand.swift		WordSegCommand.swift
WordSegSettings.swift		WordSegSettings.swift
main.swift		main.swift
smalldata.txt		smalldata.txt

README.md

Word Segmentation

This example demonstrates how to train the word segmentation model against the dataset provided in the paper "Learning to Discover, Ground, and Use Words with Segmental Neural Language Models" by Kazuya Kawakami, Chris Dyer, and Phil Blunsom.

A segmental neural language model (SNLM) is instantiated from the library of standard models. A custom training loop is defined and the training losses for each epoch are shown. For an explanation of this approach, see the Understanding WordSeg doc.

This implementation is not affiliated with DeepMind and has not been verified by the authors.

Setup

To begin, you'll need the latest version of Swift for TensorFlow installed. Make sure you've added the correct version of swift to your path.

Execution

To train the model to accuracy using the full datasets published in the paper, run:

cd swift-models
swift run -c release WordSeg

To train the model using a smaller, unrealistic sample dataset, run:

swift run -c release WordSeg \
  --training-path Examples/WordSeg/smalldata.txt \
  --validation-path Examples/WordSeg/smalldata.txt

To run the model with your own dataset, run:

swift run -c release WordSeg \
  --training-path path/to/training_data.txt \
  [ --validation-path path/to/validation_data.txt \
  [ --test-path path/to/test_data.txt ]]

To view a list of all configurable parameters and their defaults, run:

swift run -c release WordSeg --help

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WordSeg

WordSeg

README.md

Word Segmentation

Setup

Execution

Files

WordSeg

Directory actions

More options

Directory actions

More options

Latest commit

History

WordSeg

Folders and files

parent directory

README.md

Word Segmentation

Setup

Execution