GitHub - HotGiardiniera/HMM-POS-Tagger

Hidden Markov Model Part of Speech Tagger using the Viterbi algorithm

Requirments: Python 3.x

Training:

Invoked with ./train_set.py -f <path to training file>

Other arguments can be seen through ./train_set.py --help

Viterbi:

Invoked with ./viterbi.py

With no arguments it assumes there is a file in the invoking directory called 'WSJ_24.words'

You can specify a sentence file with the '-s' argument as in ./viterbi.py -s <path_to_sentence_file> All sentence files are assumed to be a word on each line and sentences seperated by a new line

Other arguments can be seen through ./viterbi.py --help

Note: This program expects two json files in the same directory: trainingDataPOS.json trainingDataWORD.json

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.gitignore		.gitignore
README.md		README.md
State.py		State.py
WSJ_24.pos		WSJ_24.pos
__init__.py		__init__.py
train_set.py		train_set.py
viterbi.py		viterbi.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

HotGiardiniera/HMM-POS-Tagger

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages