Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

error when running python preprocess_reddit.py --data-dir data/ #6

Open
Blossomone opened this issue Aug 7, 2021 · 0 comments
Open

Comments

@Blossomone
Copy link

I run the command "python preprocess_reddit.py --data-dir data/" to get reddit data.
But I get errors Cannot open exception db file for reading: C:\Users\17674\AppData\Local\Python-Eggs\Python-Eggs\Cache\rouge_papier-0.0.1-py3.6.egg-tmp\rouge_papier\rouge_data/WordNet-2.0.exc.db
I get stuck by this error for several days. Can you guide me to solve it ?

(datannsum) C:\Users\17674\Desktop\MultiModalSummary\summarization-datasets-master>python preprocess_reddit.py --data-dir data/
1145229 / 1145229
Writing train abstracts to: data\reddit\human-abstracts\train
Writing valid abstracts to: data\reddit\human-abstracts\valid
Writing test abstracts to: data\reddit\human-abstracts\test
Writing train extracts to: data\reddit\human-extracts\train
Writing valid extracts to: data\reddit\human-extracts\valid
Writing test extracts to: data\reddit\human-extracts\test
SpawnPoolWorker-2: Ready!
SpawnPoolWorker-7: Ready!
SpawnPoolWorker-8: Ready!
SpawnPoolWorker-3: Ready!
SpawnPoolWorker-4: Ready!
SpawnPoolWorker-5: Ready!
SpawnPoolWorker-6: Ready!
SpawnPoolWorker-1: Ready!
Cannot open exception db file for reading: C:\Users\17674\AppData\Local\Python-Eggs\Python-Eggs\Cache\rouge_papier-0.0.1-py3.6.egg-tmp\rouge_papier\rouge_data/WordNet-2.0.exc.db
Cannot open exception db file for reading: C:\Users\17674\AppData\Local\Python-Eggs\Python-Eggs\Cache\rouge_papier-0.0.1-py3.6.egg-tmp\rouge_papier\rouge_data/WordNet-2.0.exc.db
Cannot open exception db file for reading: C:\Users\17674\AppData\Local\Python-Eggs\Python-Eggs\Cache\rouge_papier-0.0.1-py3.6.egg-tmp\rouge_papier\rouge_data/WordNet-2.0.exc.db
Cannot open exception db file for reading: C:\Users\17674\AppData\Local\Python-Eggs\Python-Eggs\Cache\rouge_papier-0.0.1-py3.6.egg-tmp\rouge_papier\rouge_data/WordNet-2.0.exc.db
Cannot open exception db file for reading: C:\Users\17674\AppData\Local\Python-Eggs\Python-Eggs\Cache\rouge_papier-0.0.1-py3.6.egg-tmp\rouge_papier\rouge_data/WordNet-2.0.exc.db
Cannot open exception db file for reading: C:\Users\17674\AppData\Local\Python-Eggs\Python-Eggs\Cache\rouge_papier-0.0.1-py3.6.egg-tmp\rouge_papier\rouge_data/WordNet-2.0.exc.db
Cannot open exception db file for reading: C:\Users\17674\AppData\Local\Python-Eggs\Python-Eggs\Cache\rouge_papier-0.0.1-py3.6.egg-tmp\rouge_papier\rouge_data/WordNet-2.0.exc.db
Cannot open exception db file for reading: C:\Users\17674\AppData\Local\Python-Eggs\Python-Eggs\Cache\rouge_papier-0.0.1-py3.6.egg-tmp\rouge_papier\rouge_data/WordNet-2.0.exc.db
multiprocessing.pool.RemoteTraceback:
"""
Traceback (most recent call last):
  File "C:\Users\17674\anaconda3\envs\datannsum\lib\multiprocessing\pool.py", line 119, in worker
    result = (True, func(*args, **kwds))
  File "C:\Users\17674\Desktop\MultiModalSummary\summarization-datasets-master\preprocess_reddit.py", line 148, in worker
    ext_labels = {"id": story_id, "labels": get_labels(example, ext_paths)}
  File "C:\Users\17674\Desktop\MultiModalSummary\summarization-datasets-master\preprocess_reddit.py", line 114, in get_labels
    remove_stopwords=True, length=75)
  File "C:\Users\17674\anaconda3\envs\datannsum\lib\site-packages\rouge_papier-0.0.1-py3.6.egg\rouge_papier\generate.py",line 15, in compute_extract
    length_unit=length_unit, remove_stopwords=remove_stopwords), None
  File "C:\Users\17674\anaconda3\envs\datannsum\lib\site-packages\rouge_papier-0.0.1-py3.6.egg\rouge_papier\generate.py",line 127, in compute_greedy_sequential_extract
    length_unit=length_unit, remove_stopwords=remove_stopwords)
  File "C:\Users\17674\anaconda3\envs\datannsum\lib\site-packages\rouge_papier-0.0.1-py3.6.egg\rouge_papier\wrapper.py", line 58, in compute_rouge
    output = check_output(" ".join(args), shell=True).decode("utf8")
  File "C:\Users\17674\anaconda3\envs\datannsum\lib\subprocess.py", line 356, in check_output
    **kwargs).stdout
  File "C:\Users\17674\anaconda3\envs\datannsum\lib\subprocess.py", line 438, in run
    output=stdout, stderr=stderr)
subprocess.CalledProcessError: Command 'perl C:\Users\17674\AppData\Local\Python-Eggs\Python-Eggs\Cache\rouge_papier-0.0.1
-py3.6.egg-tmp\rouge_papier\data\ROUGE-1.5.5.pl -e C:\Users\17674\AppData\Local\Python-Eggs\Python-Eggs\Cache\rouge_papier
-0.0.1-py3.6.egg-tmp\rouge_papier\rouge_data -a -n 1 -x -d -m -s -l 75 -r 1000 -f A -z SPL C:\Users\17674\AppData\Local\Temp\tmpyt4kke18\tmpcd6ql1xn' returned non-zero exit status 255.
"""

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "preprocess_reddit.py", line 262, in <module>
    main()
  File "preprocess_reddit.py", line 221, in main
    pool)
  File "preprocess_reddit.py", line 255, in make_dataset
    for i, _ in enumerate(pool.imap(worker, story_iter()), 1):
  File "C:\Users\17674\anaconda3\envs\datannsum\lib\multiprocessing\pool.py", line 735, in next
    raise value
subprocess.CalledProcessError: Command 'perl C:\Users\17674\AppData\Local\Python-Eggs\Python-Eggs\Cache\rouge_papier-0.0.1
-py3.6.egg-tmp\rouge_papier\data\ROUGE-1.5.5.pl -e C:\Users\17674\AppData\Local\Python-Eggs\Python-Eggs\Cache\rouge_papier
-0.0.1-py3.6.egg-tmp\rouge_papier\rouge_data -a -n 1 -x -d -m -s -l 75 -r 1000 -f A -z SPL C:\Users\17674\AppData\Local\Temp\tmpyt4kke18\tmpcd6ql1xn' returned non-zero exit status 255.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant