Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Input Source - Delta Lake #30

Open
Chase-Edwards opened this issue Feb 5, 2023 · 3 comments
Open

Input Source - Delta Lake #30

Chase-Edwards opened this issue Feb 5, 2023 · 3 comments
Labels
good first issue Good for newcomers help wanted Extra attention is needed

Comments

@Chase-Edwards
Copy link

Delta Lake is a common OSS table format that would be useful to support with Quokka.

@marsupialtail
Copy link
Owner

It is in progress. In fact if you look at setup.py I already included the optional dependencies.

@marsupialtail
Copy link
Owner

Contributions very welcome -- it shouldn't be that different from a regular list of parquet inputs.

https://marsupialtail.github.io/quokka/tutorial/
https://github.com/marsupialtail/quokka/blob/master/pyquokka/dataset.py#L29

@marsupialtail marsupialtail added help wanted Extra attention is needed good first issue Good for newcomers labels Feb 6, 2023
@SemyonSinchenko
Copy link

SemyonSinchenko commented Aug 29, 2023

Hello! Why just not to use delta-rs library? Of course, it is possible to implement it from scratch, but it would make maintenance harder. Of course, it requires to have this dependency on all the nodes, but I see that with iceberg you used side-dependency instead of writing reader from scratch.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
good first issue Good for newcomers help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

3 participants