Parquet4S

Parquet4s is a simple I/O for Parquet. Allows you to easily read and write Parquet files in Scala.

Use just a Scala case class to define the schema of your data. No need to use Avro, Protobuf, Thrift, or other data serialisation systems. You can use generic records if you don't want to use the case class, too.

Compatible with files generated with Apache Spark. However, unlike in Spark, you do not have to start a cluster to perform I/O operations.

Based on official Parquet library, Hadoop Client and Shapeless (Shapeless is not in use in a version for Scala 3).

As it is based on Hadoop Client, you can connect to any Hadoop-compatible storage like AWS S3 or Google Cloud Storage.

Integrations for Akka Streams, Pekko Streams, and FS2.

Released for Scala 2.12.x, 2.13.x and 3.3.x.

Documentation

Documentation is available at here.

Contributing

Do you want to contribute? Please read the contribution guidelines.

Name		Name	Last commit message	Last commit date
Latest commit History 474 Commits
.circleci		.circleci
.github		.github
akkaPekko/src		akkaPekko/src
akkaPekkoBenchmarks/src/main/scala/com/github/mjakubowski84/parquet4s		akkaPekkoBenchmarks/src/main/scala/com/github/mjakubowski84/parquet4s
core/src		core/src
coreBenchmarks/src/main/scala/com/github/mjakubowski84/parquet4s		coreBenchmarks/src/main/scala/com/github/mjakubowski84/parquet4s
examples/src/main		examples/src/main
fs2/src		fs2/src
fs2Benchmarks/src/main/scala/com/github/mjakubowski84/parquet4s		fs2Benchmarks/src/main/scala/com/github/mjakubowski84/parquet4s
project		project
scalapb/src		scalapb/src
site/src/main/resources/docs		site/src/main/resources/docs
testkit/src/main/scala/com/github/mjakubowski84/parquet4s/testkit		testkit/src/main/scala/com/github/mjakubowski84/parquet4s/testkit
.gitignore		.gitignore
.java-version		.java-version
.sbtopts		.sbtopts
.scalafmt.conf		.scalafmt.conf
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
build.sbt		build.sbt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Parquet4S

Documentation

Contributing

Sponsors

About

Releases

Packages

Languages

License

jessekempf-vsco/parquet4s

Folders and files

Latest commit

History

Repository files navigation

Parquet4S

Documentation

Contributing

Sponsors

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages