v0.15: Experimental new CSV-, and Geographic integrations and many other fixes #991
Jolanrensen
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
This release contains several new features, tons of fixes and two new exciting experimental new integrations:
GeoDataFrame
class for working with geographical data (from GeoJson/Shapefile) and plotting it with Kandy. See below for more information.BigInteger
support:Just like we support the
BigDecimal
numbers, DataFrame now also supportsBigInteger
in parsing, converting, statistics, column arithmetics, etc.Parsing and converting
String
columns to other types is now faster. We also introduce the new experimentalParserOptions.useFastDoubleParser
setting which uses FastDoubleParser for faster and more flexibleDouble
parsing.New Experimental CSV integration
DataFrame's CSV parsing has been based on Apache Commons CSV from the beginning. While this has been sufficient for most applications, it had some issues like running out of memory, performance, and our API lacking in clarity, documentation, and completeness.
For DataFrame 0.15, we introduce a new separate package
org.jetbrains.kotlinx:dataframe-csv
which tries to solve all these issues at once. It's based on Deephaven-CSV which makes it faster and more memory efficient. And since we built it from the ground up, we made sure the API was complete, predictable, and documented carefully.To try it yourself, explicitly add the dependency
org.jetbrains.kotlinx:dataframe-csv
to your project. In notebooks you can addenableExperimentalCsv=true
to the %use-magic, like%use dataframe(enableExperimentalCsv=true)
.Use the new
DataFrame.readCsv()
/DataFrame.readTsv()
/DataFrame.readDelim()
functions over the oldDataFrame.readCSV()
ones.We happily await your feedback!
New Experimental Geo integration
Kandy v0.8 introduces geo-plotting which allows you to visualize geospatial/geographical data using the awesome Kandy DSL. To make working with this geographical data (from GeoJson/Shapefile) easier, we happily accepted the GeoDataFrame PR from the Kandy team.
To try it yourself, explicitly add the dependency
org.jetbrains.kotlinx:dataframe-geo
to your project or notebook (with the repositorymaven("https://repo.osgeo.org/repository/release")
) and useGeoDataFrame.readGeoJson()
orGeoDataFrame.readShapeFile()
to get started!Features
BigInteger
support by @Jolanrensen in FullBigInteger
support #972Compiler Plugin
Fixes
Anycol.isValueColumn
etc. for smart-casting by @Jolanrensen in Adding contracts forAnycol.isValueColumn
etc. for smart-casting #882DataFrame.read("", delimiter =)
by @Jolanrensen in DeprecatingDataFrame.read("", delimiter =)
#902Any
by changing visibility check by @Jolanrensen in Fixed local classes being inferred asAny
by changing visibility check #929describe()
fixes by @Jolanrensen indescribe()
fixes #937startsWith
andendsWith
in CS DSL to Error by @Jolanrensen in Bumped deprecations ofstartsWith
andendsWith
in CS DSL to Error #978Docs and Examples
New Contributors
Full Changelog: v0.14.2...v0.15.0
This discussion was created from the release v0.15: Experimental new CSV-, and Geographic integrations and many other fixes.
Beta Was this translation helpful? Give feedback.
All reactions