Skip to content

Latest commit

 

History

History
30 lines (22 loc) · 591 Bytes

README.md

File metadata and controls

30 lines (22 loc) · 591 Bytes

NLP Topic Modeling


In this repository, I've run some topic modeling experiments on the NIPS papers dataset.



Requirements:

  • python 3.x
  • pandas
  • nltk
  • gensim
  • pyLDAvis



Usage

To prepare corpus and train LDA model, first unzip data/papers.tar.xz into the data folder and then:

python topic_modeling_train.py

After training, you can see visualization results in lda_vis.html file.

To test model:

python topic_modeling_test.py

There is also a jupyter notebook containing complete code for train and test process.