feat: preserve Arrow Date(MILLISECOND) columns through Spark roundtrip by summaryzb · Pull Request #464 · lance-format/lance-spark

summaryzb · 2026-04-21T08:59:09Z

Preserve Arrow Date(MILLISECOND) columns through Spark read-write roundtrips by carrying the original Arrow date unit in Spark field metadata. Without this change, Lance datasets created by PyArrow or other non-Spark sources that use Date(MILLISECOND) would silently downgrade to Date(DAY) when written back through Spark, corrupting interoperability with downstream consumers that expect millisecond-unit dates.

Change-Id: I832408fa941c1ebdf59a61307bfcda4e183ba64f

feat: preserve Arrow Date(MILLISECOND) columns through Spark roundtrip

5700de9

Change-Id: I832408fa941c1ebdf59a61307bfcda4e183ba64f

github-actions Bot added the enhancement New feature or request label Apr 21, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: preserve Arrow Date(MILLISECOND) columns through Spark roundtrip#464

feat: preserve Arrow Date(MILLISECOND) columns through Spark roundtrip#464
summaryzb wants to merge 1 commit intolance-format:mainfrom
summaryzb:date_type

summaryzb commented Apr 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

summaryzb commented Apr 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant