You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
That's our ML beast written in Python. Finally, load the model:
from ast2vec import Topics; model = Topics().load("model.asdf")
That's it. All the keywords are there. model.matrix is the sparse matrix of keyword -> topic, model.tokens is the keyword list. It is important to notice that those "tokens" are splitted and stemmed as given in the paper; the processing code is in ast2vec/uast_ids_to_bag.py No need to extract any ASTs to get the identifiers, using a regular syntax highlighter is enough.
Source{d} has a repository that automatically suggests topics based on sourcecode.
Repository: https://github.com/src-d/tmsc
Paper on arxiv: https://arxiv.org/abs/1704.00135
Blogpost about paper: https://blog.sourced.tech/post/github_topic_modeling
People to ping about this: https://twitter.com/tmarkhor, https://twitter.com/francesc
The text was updated successfully, but these errors were encountered: