Skip to content

Commit

Permalink
Add Parquet example data to new ParquetExamples directory (#113)
Browse files Browse the repository at this point in the history
* Add Parquet example data to new ParquetExamples directory

* Create README

* Align table in README
  • Loading branch information
alexpeters1208 authored Oct 10, 2024
1 parent 82cfc16 commit 2239b54
Show file tree
Hide file tree
Showing 21 changed files with 67 additions and 0 deletions.
25 changes: 25 additions & 0 deletions ParquetExamples/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,25 @@
# Parquet Example Data

Parquet data can be stored in many different ways - with or without metadata, in key-value partitioned directories, or in flat partitioned directories. This directory contains a simple dataset stored in all of these different formats. These are used to demonstrate Deephaven's ability to read various kinds of Parquet data.


## The `grades` dataset

The `grades` dataset is a simple synthetic dataset used to demonstrate the various ways to store Parquet data. It looks like this:

| Name | Class | Test1 | Test2 |
| ------ | ------- | ----- | ----- |
| Ashley | Math | 92 | 94 |
| Jeff | Math | 78 | 88 |
| Rita | Math | 87 | 81 |
| Zach | Math | 74 | 70 |
| Ashley | Science | 87 | 91 |
| Jeff | Science | 90 | 83 |
| Rita | Science | 99 | 95 |
| Zach | Science | 80 | 78 |
| Ashley | History | 82 | 88 |
| Jeff | History | 87 | 92 |
| Rita | History | 84 | 85 |
| Zach | History | 76 | 78 |

This dataset can be partitioned by `Name` or `Class` to demonstrate the different ways to store Parquet data.
3 changes: 3 additions & 0 deletions ParquetExamples/grades/grades.parquet
Git LFS file not shown
3 changes: 3 additions & 0 deletions ParquetExamples/grades_flat/history.parquet
Git LFS file not shown
3 changes: 3 additions & 0 deletions ParquetExamples/grades_flat/math.parquet
Git LFS file not shown
3 changes: 3 additions & 0 deletions ParquetExamples/grades_flat/science.parquet
Git LFS file not shown
Binary file added ParquetExamples/grades_flat_meta/_common_metadata
Binary file not shown.
Binary file added ParquetExamples/grades_flat_meta/_metadata
Binary file not shown.
3 changes: 3 additions & 0 deletions ParquetExamples/grades_flat_meta/history.parquet
Git LFS file not shown
3 changes: 3 additions & 0 deletions ParquetExamples/grades_flat_meta/math.parquet
Git LFS file not shown
3 changes: 3 additions & 0 deletions ParquetExamples/grades_flat_meta/science.parquet
Git LFS file not shown
Git LFS file not shown
Git LFS file not shown
Git LFS file not shown
Git LFS file not shown
Git LFS file not shown
Git LFS file not shown
Binary file added ParquetExamples/grades_kv_meta/_common_metadata
Binary file not shown.
Binary file added ParquetExamples/grades_kv_meta/_metadata
Binary file not shown.
Binary file added ParquetExamples/grades_meta/_common_metadata
Binary file not shown.
Binary file added ParquetExamples/grades_meta/_metadata
Binary file not shown.
3 changes: 3 additions & 0 deletions ParquetExamples/grades_meta/grades.parquet
Git LFS file not shown

0 comments on commit 2239b54

Please sign in to comment.