- FIX dependencies #117 #123
- FIX #106 Fix scikit-learn version
- FIX #96 Fix preprocessing with num_processes not None
- FIX #104 fix numpy version
- FIX #102 fix requirements
- fix #91 add parameter for setting num of processes for gensim coherence
- FIX pandas error
- fix gensim requirements #87
- Improve preprocessing #70
- Bug fix CTM num_topics #76
- Add top_words parameter to CTM model #84
- Add seed parameter to CTM #65
- Update some requirements
- Add testing for python 3.9 and remove 3.6
- Minor fixes
- Update metadata Italian datasets
- Fix dataset encoding (#57)
- Fix word embeddings topic coherence (#58)
- Fix dataset name BBC_News (#59)
- Fix KL Divergence in diversity metrics (#51, #52)
- Bug fix optimizer evaluation with additional metrics (#46)
- Bug fix Coherence with word embeddings (#43, #45)
- ETM now supports different formats of word embeddings (#36)
- Bug fix similarity measures (#41)
- Minor fixes
- Bug fix preprocessing (#26)
- Bug fix ctm (#28)
- Bug fix weirbo_centroid (#31)
- Added new Italian datasets
- Minor fixes
- Gensim migration from 3.8 to >=4.0.0
- Fixed unwanted sorting of documents
- Fixed gensim version (#22)
- Added per-topic kl-uniform significance
- Handling multilabel classification
- Fixed preprocessing when dataset is not split (#17)
- Added regularization hyperparameter to NMF_scikit
- Added similarity metrics
- Fixed handling of stopwords in preprocessing
- Fixed coherence and diversity metrics
- Added new metrics tests
- Fixed CTM training when only training dataset is used
- Dashboard bugs fixed
- Minor bug fixes
- Added new tests for TM training
- Added parameter num_samples to CTM, NeuralLDA and ProdLDA
- Bug fix AVITM
- Bug fix info dataset
- Tomotopy LDA's implementation should work now
- bug fix dataset download
- CTM is no longer verbose
- New classification metrics
- Vocabulary downloader fix
- Dataset downloader fix
- New metrics initialization (do not support dictionaries as input anymore)
- Optimization, dataset and dashboard bug fixes
- Refactoring
- Updated README and documentation
- Dataset preprocessing produces also an indexes.txt file containing the indexes of the documents
- Eval metrics bug fixes
- BBC news added in the correct format
- Bug fixes
- New dataset format
- First release on PyPI.