Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Hive Support #16

Open
oclarms opened this issue Nov 28, 2017 · 2 comments
Open

Hive Support #16

oclarms opened this issue Nov 28, 2017 · 2 comments
Labels

Comments

@oclarms
Copy link

oclarms commented Nov 28, 2017

Read/write InputFormat/OutputFormat, SerDe from/to Hive Metastore.
Read/write data from/to Hive Table or Partition.

@advancedxy
Copy link
Collaborator

Do you have any plans or thoughts to support Hive IO formats?

If we are running on Spark, we can delegate the read path to Spark. However we have to implement
all the Hive IO formats in cpp when running on local or writing data to Hive, which would be a quite complex project.

Another way would be delegating our work to hcatalog, that looks like a good direction.

@oclarms
Copy link
Author

oclarms commented Nov 29, 2017

Yes, hcatalog is a good idea. But we need to test whether hcatalog supports the parquert file. Sometimes we want to specify the data storage format.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants