HIVE-29287: Iceberg: Variant Shredding support #6152

deniskuzZ · 2025-10-23T14:45:56Z

What changes were proposed in this pull request?

Support for variant shredding, enabling Hive to write shredded variant data into Iceberg tables.

Ideally, this should follow the approach described in the reader/writer API proposal for Iceberg V4, where an execution engine provides the shredded writer schema.

As an interim solution, this PR introduces a writer that infers the shredded schema from the sample record captured before the Parquet writer is initialized.

Why are the changes needed?

Enables data skipping (predicate pushdown)

Does this PR introduce any user-facing change?

No

How was this patch tested?

variant_type_shredding.q

deniskuzZ · 2025-10-31T13:58:02Z

same thing as apache/iceberg#14297

deniskuzZ · 2025-10-31T15:25:36Z

iceberg/iceberg-handler/src/test/results/positive/variant_type_shredding.q.out

+                TableScan
+                  alias: tbl_shredded_variant
+                  filterExpr: (UDFToDouble(variant_get(data, '$.age')) > 25.0D) (type: boolean)
+                  Statistics: Num rows: 3 Data size: 1020 Basic stats: COMPLETE Column stats: NONE


PPD is not supported here, would be addressed in a separate JIRA

sonarqubecloud · 2025-11-01T12:30:00Z

Quality Gate passed

Issues
27 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

kokila-19 · 2025-11-18T11:51:09Z

I tested variant_type_shredding.q by removing 'variant.shredding.enabled'='true' from the table properties, and the qtest still passes without any failures.
This test verifies that basic INSERT/SELECT operations succeed with VARIANT columns but not actual shredding in the file.
This testing is not possible in qtest.

so maybe we can add a JUnit test (e.g., TestVariantShredding) that:
Writes VARIANT data with variant.shredding.enabled=true and false
Opens the resulting Parquet files via ParquetFileReader
Asserts that the typed_value field is present/absent accordingly

deniskuzZ marked this pull request as draft October 23, 2025 14:46

asf-ci-hive added the tests pending label Oct 23, 2025

deniskuzZ changed the title ~~[DRAFT] HIVE-29287: Variant Shredding~~ [DRAFT] HIVE-29287: Iceberg: Variant Shredding Oct 23, 2025

asf-ci-hive added tests unstable and removed tests pending labels Oct 23, 2025

deniskuzZ force-pushed the HIVE-29287 branch from 3fdd44c to 1d0c300 Compare October 31, 2025 14:13

deniskuzZ changed the title ~~[DRAFT] HIVE-29287: Iceberg: Variant Shredding~~ HIVE-29287: Iceberg: Variant Shredding support Oct 31, 2025

asf-ci-hive added tests pending and removed tests unstable labels Oct 31, 2025

deniskuzZ marked this pull request as ready for review October 31, 2025 14:14

HIVE-29287: Variant Shredding support

213a62f

deniskuzZ force-pushed the HIVE-29287 branch from 1d0c300 to 213a62f Compare October 31, 2025 15:24

deniskuzZ commented Oct 31, 2025

View reviewed changes

deniskuzZ requested a review from ayushtkn October 31, 2025 15:37

sonar

a38cb67

asf-ci-hive added tests passed tests pending tests unstable and removed tests pending tests passed tests unstable labels Oct 31, 2025

asf-ci-hive added tests passed and removed tests pending labels Nov 1, 2025

deniskuzZ requested a review from difin November 6, 2025 16:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

HIVE-29287: Iceberg: Variant Shredding support #6152

HIVE-29287: Iceberg: Variant Shredding support #6152

Uh oh!

deniskuzZ commented Oct 23, 2025 •

edited

Loading

Uh oh!

deniskuzZ commented Oct 31, 2025

Uh oh!

deniskuzZ Oct 31, 2025 •

edited

Loading

Uh oh!

sonarqubecloud bot commented Nov 1, 2025

Uh oh!

kokila-19 commented Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

HIVE-29287: Iceberg: Variant Shredding support #6152

Are you sure you want to change the base?

HIVE-29287: Iceberg: Variant Shredding support #6152

Uh oh!

Conversation

deniskuzZ commented Oct 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

deniskuzZ commented Oct 31, 2025

Uh oh!

deniskuzZ Oct 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sonarqubecloud bot commented Nov 1, 2025

Quality Gate passed

Uh oh!

kokila-19 commented Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

deniskuzZ commented Oct 23, 2025 •

edited

Loading

deniskuzZ Oct 31, 2025 •

edited

Loading