Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

"No such file" when running download.sh; "UnicodeDecodeError" when running featurize.py #6

Open
maoredman opened this issue Feb 11, 2018 · 2 comments

Comments

@maoredman
Copy link

maoredman commented Feb 11, 2018

Hi, when I ran the data/download.sh script, the command
cat augmented_data/augmented_zips.zip.z* > augmented_train.json.zip
raised an error:
cat: augmented_data/augmented_zips.zip.z*: No such file or directory

I then changed augmented_zips.zip.z*
to
cat augmented_data/augmented_zips.z01 augmented_data/augmented_zips.z02 augmented_data/augmented_zips.z03 augmented_data/augmented_zips.z04 augmented_data/augmented_zips.z05 augmented_data/augmented_zips.z06 augmented_data/augmented_zips.z07 augmented_data/augmented_zips.z08 augmented_data/augmented_zips.z09 augmented_data/augmented_zips.z10 augmented_data/augmented_zips.zip > augmented_train.json.zip

But am unable to run featurize.py successfully afterwards, encountering UnicodeDecodeError: 'utf-8' codec can't decode bytes in position 15-16: invalid continuation byte when running line 139 of vocab.py (gensim.models.KeyedVectors.load_word2vec_format(path, binary=True)) during "Building word embedding matrix..."

Is there any advice on what modifications I can make?
Thanks!

@maoredman maoredman changed the title Possible small error in data/download.sh data/download.sh error -- augmented_zips.zip.z*: No such file or directory Feb 11, 2018
@maoredman maoredman changed the title data/download.sh error -- augmented_zips.zip.z*: No such file or directory data/download.sh error: augmented_zips.zip.z*, No such file or directory Feb 11, 2018
@maoredman maoredman changed the title data/download.sh error: augmented_zips.zip.z*, No such file or directory "No such file" when running download.sh; "UnicodeDecodeError" when running featurize.py Feb 11, 2018
@Kotorinyanya
Copy link

Kotorinyanya commented Apr 3, 2018

I have the same problem, is there nobody up to fix this?

@lan2720
Copy link

lan2720 commented Apr 11, 2018

@maoredman The exact same error when running featurize.py. Did you fix it?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants