[iceberg] support migrating iceberg table which had suffered schema evolution #5083

LsomeYeah · 2025-02-14T02:25:43Z

Purpose

Linked issue: close #xxx

In pr #4639 and #4878, we had supported migrating iceberg table managed by hadoop-catalog or hive-catalog to paimon. This pr aims to support migrating the iceberg table which had suffered once or several times schema evolution.

Paimon stores the schema-id in each DataFileMeta for reading data files which had suffered schema evolution, so we extract the schema-id used by each iceberg data file and record it in the corresponding paimon DataFileMeta, and this makes paimon can handle the schema evolution case.

Tests

IcebergMigrateTest#testDeleteColumn
IcebergMigrateTest#testRenameColumn
IcebergMigrateTest#testAddColumn
IcebergMigrateTest#testReorderColumn
IcebergMigrateTest#testUpdateColumn
IcebergMigrateTest#testMigrateWithRandomIcebergEvolution

API and Format

Documentation

LsomeYeah added 2 commits February 14, 2025 10:15

[core] support iceberg schema evolution

759f6fb

[core] allow define options for target table

902bbd0

tsreaper approved these changes Feb 27, 2025

View reviewed changes

tsreaper merged commit 7210cf6 into apache:master Feb 27, 2025
12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[iceberg] support migrating iceberg table which had suffered schema evolution #5083

[iceberg] support migrating iceberg table which had suffered schema evolution #5083

LsomeYeah commented Feb 14, 2025

[iceberg] support migrating iceberg table which had suffered schema evolution #5083

[iceberg] support migrating iceberg table which had suffered schema evolution #5083

Conversation

LsomeYeah commented Feb 14, 2025

Purpose

Tests

API and Format

Documentation