-
Notifications
You must be signed in to change notification settings - Fork 654
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Core feature] Add support for duckdb with structured dataset #2865
Comments
Hi Team, I was new to Flyte, Shall I take this task and try this integration of flytekit and DuckDB? Thanks in Advance |
@rajagurunath, please go ahead! Do you want me to assign the issue to you? |
Thanks a lot, @samhita-alla, please assign this issue to me! |
@rajagurunath, hey I'm going to unassign you as my team will work on this issue. I hope you haven't started working on it. Sorry! Please feel free to leave a comment on the other hacktoberfest-labeled issues we have: #2917. |
Hi @samhita-alla I would like to take up the issue |
sure No problem @samhita-alla Thanks, yeah kind of started with an initial exploration of the code base and saving the data frame to duckdb, etc, will have a look at the list and pick other issues. Please let me know @shivaylamba, for any help needed from my side! |
Motivation: Why do you think this is important?
duckdb is a serverless DB deployed at GCP. we could integrate it with Flyte, and write task output, like
pandas.DataFrame
orarrow.Tables
, to duckdb transparently.Goal: What should the final outcome look like, ideally?
Add a duckDB plugin in flytekit, and add a structured dataset encoder and decoder.
Here is an example to add custom encoder/decoder in flytekit-plugin
Describe alternatives you've considered
No response
Propose: Link/Inline OR Additional context
No response
Are you sure this issue hasn't been raised already?
Have you read the Code of Conduct?
The text was updated successfully, but these errors were encountered: