Skip to content

This repo stores the work I participated in the URP, COMETH, at RPI.

Notifications You must be signed in to change notification settings

shikanchen/language-modeling-cometh

Repository files navigation

language-modeling-cometh

This repo stores the work I participated in the URP, COMETH, at RPI.

preprocess corpus

Preprocess the corpus for fine-tuning

defineCorpus(publishers, passages, documents, output)

Fine-tune bert model

Use 10%, 10% and 80% propotions of the corpus for Test, Validation and Training, respectively to fine-tune the pretrained bert model

fineTuneEmbedding(publishers, output)

Combine with Flair Transformer

Combine the fine-tuned bert models with Flair Transformer

combineModelFlair(publishers, documents, ignores, output)

Embedding

Use Flair embedding for its Transformer

saveEmbedding(embeddingVectors, output)

About

This repo stores the work I participated in the URP, COMETH, at RPI.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages