Could you provide a sample embedding vocab files to run the socre.py? #10

lahpan · 2022-03-31T07:37:37Z

Hi,

Thanks for your fascinating work. I am trying to reproduce your results for my own project, but I found it hard to run the socre.py code. In your example call,
python3 code/score.py --entities KG --entities_file {dest_to_entities_file} --clustering_algo GMM --dataset reuters --vocab {dest_to_vocab_file} --num_topics 20 50 --doc_info WGT--rerank tf
You did not specify the type of embedding and vocab files, so I kept running into problem.
Following is the code from your preprocess.py.
def create_global_vocab(vocab_files): vocab_list = set(line.split()[0] for line in open(vocab_files[0])) for vocab in vocab_files: vocab_list = vocab_list & set(line.split()[0] for line in open(vocab)) return vocab_list
My question is what kind of vocab files(txt, pkl,csv etc.) can have open(vocab_files[0]) ?

It will be helpful you can provide a sample embedding and vocab file so we can run the demo.sh code in the bin.

Thanks in advance for your help.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Could you provide a sample embedding vocab files to run the socre.py? #10

Could you provide a sample embedding vocab files to run the socre.py? #10

lahpan commented Mar 31, 2022

Could you provide a sample embedding vocab files to run the socre.py? #10

Could you provide a sample embedding vocab files to run the socre.py? #10

Comments

lahpan commented Mar 31, 2022