Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SEDONA-226] Support reading and writing GeoParquet file metadata #740

Merged
merged 1 commit into from
Dec 30, 2022

Conversation

Kontinuation
Copy link
Member

@Kontinuation Kontinuation commented Dec 29, 2022

Did you read the Contributor Guide?

Is this PR related to a JIRA ticket?

What changes were proposed in this PR?

This patch makes GeoParquetFileFormat aware of the geo metadata in GeoParquet files. We removed the fieldGeometry option for loading GeoParquet files, and file metadata would be written when saving dataframes as GeoParquet files.

This patch also resolves several bugs related to GeoParquet support:

How was this patch tested?

Unit tests were added to test this patch. We've also tested this patch manually on Spark standalone clusters running various versions of Spark.

Interoperability with the latest GeoPandas (0.11.1) was tested manually.

Did this PR include necessary documentation updates?

  • Yes, I have updated the documentation update.

@Kontinuation Kontinuation marked this pull request as ready for review December 29, 2022 13:38
Honor the geo metadata when loading GeoParquet files, instead of letting the user explicitly specify the name of the geometry column.
@jiayuasu jiayuasu merged commit 8ddd353 into apache:master Dec 30, 2022
@Kontinuation Kontinuation deleted the geoparquet-metadata branch August 23, 2023 15:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants