Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

decode problem in csv2parquet #4

Open
caijun opened this issue Oct 16, 2016 · 2 comments
Open

decode problem in csv2parquet #4

caijun opened this issue Oct 16, 2016 · 2 comments

Comments

@caijun
Copy link

caijun commented Oct 16, 2016

Hi, I am using csv2parquet to convert csv into parquet, but I get following decode problem.

./csv2parquet all_cities.csv all_cities.parquet
Traceback (most recent call last):
  File "./csv2parquet", line 329, in <module>
    csv_source = CsvSource(args.csv_input, args.column_map, args.types)
  File "./csv2parquet", line 145, in __init__
    self.headers = self._init_headers()
  File "./csv2parquet", line 150, in _init_headers
    return next(csv_data)
  File "/usr/local/Cellar/python3/3.5.2_3/Frameworks/Python.framework/Versions/3.5/lib/python3.5/codecs.py", line 321, in decode
    (result, consumed) = self._buffer_decode(data, self.errors, final)
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xcc in position 164: invalid continuation byte
@caijun
Copy link
Author

caijun commented Oct 16, 2016

After I used the iconv command to convert the file encoding from CP936 to UTF-8, I encountered another problem.

FATAL: Drill script failed with error code 2.  To troubleshoot, run
with --debug and inspect files script, script_stderr and script_stdout.

@titidutarn
Copy link

I got the same problem
solve: add "sudo" when entering your command line

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants