feat(table): Support Dynamic Partition Overwrite #482

dttung2905 · 2025-07-08T21:59:46Z

No description provided.

Signed-off-by: dttung2905 <ttdao.2015@accountancy.smu.edu.sg>

table/transaction.go

Signed-off-by: dttung2905 <ttdao.2015@accountancy.smu.edu.sg>

zeroshade · 2025-08-05T18:46:11Z

table/transaction.go

+	// Check that all partition fields use identity transforms
+	currentSpec := t.meta.CurrentSpec()
+	for field := range currentSpec.Fields() {
+		if _, ok := field.Transform.(iceberg.IdentityTransform); !ok {
+			return fmt.Errorf("%w: dynamic overwrite does not support non-identity-transform fields in partition spec: %s",
+				ErrInvalidOperation, field.Name)
+		}
+	}


is this defined in the spec? Or is this just a NotYetImplemented thing?

zeroshade · 2025-08-05T18:46:32Z

table/transaction.go

+	if tbl.NumRows() == 0 {
+		return nil
+	}


shouldn't this overwrite the partition with an empty partition?

cmiiw but in the spark writer is it quite similar https://github.com/apache/iceberg/blob/0651b8913d27c3b1c9aca4a9609bec521905fb36/spark/v3.4/spark/src/main/java/org/apache/iceberg/spark/source/SparkWrite.java#L297-L305
wdyt 🤔 ?

zeroshade · 2025-08-05T18:48:21Z

table/transaction.go

+	var allDataFiles []iceberg.DataFile
+	for df, err := range dataFiles {
+		if err != nil {
+			return err
+		}
+		allDataFiles = append(allDataFiles, df)
+	}
+
+	partitionsToOverwrite := make(map[string]struct{})
+	for _, df := range allDataFiles {
+		partitionKey := fmt.Sprintf("%v", df.Partition())
+		partitionsToOverwrite[partitionKey] = struct{}{}
+	}


you can probably merge these loops

zeroshade · 2025-08-05T18:51:19Z

table/transaction.go

+		return err
+	}
+
+	deleteProducer := t.updateSnapshot(fs, snapshotProps).mergeOverwrite(nil)


shouldn't this use the commitUUID?

zeroshade · 2025-08-05T18:56:21Z

table/transaction.go

+			partitionExpr := partitionExprs[0]
+			for _, expr := range partitionExprs[1:] {
+				partitionExpr = iceberg.NewAnd(partitionExpr, expr)
+			}


this is already handled via NewAnd. You can do: partitionExpr := iceberg.NewAnd(partitionExprs[0], partitionExprs[1], partitionExprs[2:]...)

zeroshade · 2025-08-05T18:58:13Z

table/transaction.go

+	result := expressions[0]
+	for _, expr := range expressions[1:] {
+		result = iceberg.NewOr(result, expr)
+	}


same comment as above, iceberg.NewOr already handles an arbitrary number of arguments so you don't have to do this loop manually

zeroshade · 2025-08-05T19:02:51Z

table/transaction.go

+func parsePartitionKey(partitionKey string, fieldNames []string) []interface{} {
+	// Simple parsing for demonstration - assumes a format like "field1=value1/field2=value2"
+	parts := strings.Split(partitionKey, "/")
+	values := make([]interface{}, len(fieldNames))


we have the schema, we can use the field names to determine the types so we know what type to parse into from the strings

zeroshade · 2025-08-05T19:04:51Z

table/transaction.go

+	switch t := typ.(type) {
+	case iceberg.PrimitiveType:
+		switch t {
+		case iceberg.PrimitiveTypes.Int32:
+			if v, ok := value.(int32); ok {
+				return iceberg.EqualTo(term, v)
+			}
+		case iceberg.PrimitiveTypes.Int64:
+			if v, ok := value.(int64); ok {
+				return iceberg.EqualTo(term, v)
+			}
+		case iceberg.PrimitiveTypes.Float32:
+			if v, ok := value.(float32); ok {
+				return iceberg.EqualTo(term, v)
+			}
+		case iceberg.PrimitiveTypes.Float64:
+			if v, ok := value.(float64); ok {
+				return iceberg.EqualTo(term, v)
+			}
+		case iceberg.PrimitiveTypes.String:
+			if v, ok := value.(string); ok {
+				return iceberg.EqualTo(term, v)
+			}
+		case iceberg.PrimitiveTypes.Bool:
+			if v, ok := value.(bool); ok {
+				return iceberg.EqualTo(term, v)
+			}
+		}
+	}


the types and casting should be handled for you once the expression is bound. So you shouldn't need the iceberg.Type, just do a switch on value.(type) and calling iceberg.EqualTo(term, v)

lliangyu-lin · 2025-08-07T18:40:59Z

table/transaction.go

+}
+
+// deleteFileByFilter performs a delete operation with the given filter and snapshot properties.
+func (t *Transaction) deleteFileByFilter(ctx context.Context, filter iceberg.BooleanExpression, snapshotProps iceberg.Properties) error {


I'm also working on a complete delete API (CoW) that can delete row level and file level based on predicate in #518.
Hopefully we don't need this method once the full delete API is supported.

Signed-off-by: dttung2905 <ttdao.2015@accountancy.smu.edu.sg>

Support Dynamic Partition Overwrite

de83515

Signed-off-by: dttung2905 <ttdao.2015@accountancy.smu.edu.sg>

laskoviymishka suggested changes Jul 9, 2025

View reviewed changes

table/transaction.go Outdated Show resolved Hide resolved

Make deleteByFilter method private

931506d

Signed-off-by: dttung2905 <ttdao.2015@accountancy.smu.edu.sg>

zeroshade requested changes Aug 5, 2025

View reviewed changes

lliangyu-lin reviewed Aug 7, 2025

View reviewed changes

dttung2905 mentioned this pull request Aug 23, 2025

can you support partitioned tables? #536

Open

Rebase from main and fix some code review comments

784146f

Signed-off-by: dttung2905 <ttdao.2015@accountancy.smu.edu.sg>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(table): Support Dynamic Partition Overwrite #482

feat(table): Support Dynamic Partition Overwrite #482

Uh oh!

dttung2905 commented Jul 8, 2025

Uh oh!

Uh oh!

zeroshade Aug 5, 2025

Uh oh!

zeroshade Aug 5, 2025

Uh oh!

dttung2905 Dec 26, 2025

Uh oh!

zeroshade Aug 5, 2025

Uh oh!

zeroshade Aug 5, 2025

Uh oh!

zeroshade Aug 5, 2025

Uh oh!

zeroshade Aug 5, 2025

Uh oh!

zeroshade Aug 5, 2025

Uh oh!

zeroshade Aug 5, 2025

Uh oh!

lliangyu-lin Aug 7, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

feat(table): Support Dynamic Partition Overwrite #482

Are you sure you want to change the base?

feat(table): Support Dynamic Partition Overwrite #482

Uh oh!

Conversation

dttung2905 commented Jul 8, 2025

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lliangyu-lin Aug 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

lliangyu-lin Aug 7, 2025 •

edited

Loading