Column projection added along with test function by Arijit6258 · Pull Request #19449 · apache/druid

Arijit6258 · 2026-05-11T21:01:04Z

Fixes #19267 .

Description

Added column projection to determine which columns to read from iceberg table. This change will help greatly to improve read efficiency for use cases where whole table scan is not intended.

Key changed/added classes in this PR

IcebergCatalog
IcebergInputSource
IcebergInputSourceTest

This PR has:

been self-reviewed.
a release note entry in the PR description.
added unit tests or modified existing tests to cover new code paths, ensuring the threshold for code coverage is met.

FrankChen021

Severity	Findings
P0	0
P1	0
P2	1
P3	0
Total	1

Reviewed 3 of 3 changed files.

This is an automated review by Codex GPT-5.5

FrankChen021 · 2026-05-12T13:19:19Z

+            .map(Types.NestedField::name)
+            .filter(columnsFilter::apply)
+            .collect(Collectors.toList());
+        tableScan = tableScan.select(new ArrayList<>(projectedColumns));


[P2] Projection is discarded before data is read

This selects projected columns on the Iceberg TableScan, but the method only returns task.file().location() afterward. The projected FileScanTask schema is discarded, and IcebergInputSource builds the warehouse delegate from the same raw file paths, so Druid's Parquet reader still opens the original files without the Iceberg projection. The new test also manually projects with Parquet.read(...).project(...), so it would pass even if this select had no effect. To make column projection work, the projected schema/split information needs to be carried into the reader path or pruning needs to happen in the delegate input format.

Arijit6258 added 2 commits May 12, 2026 01:58

Column projection added along with test function

02feaca

Removing functions not needed.

2ea2cbf

FrankChen021 reviewed May 12, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Column projection added along with test function#19449

Column projection added along with test function#19449
Arijit6258 wants to merge 2 commits into
apache:masterfrom
Arijit6258:Column-Projection-In-Druid-Iceberg-Extension

Arijit6258 commented May 11, 2026

Uh oh!

FrankChen021 left a comment

Uh oh!

FrankChen021 May 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Arijit6258 commented May 11, 2026

Description

Key changed/added classes in this PR

Uh oh!

FrankChen021 left a comment

Choose a reason for hiding this comment

Uh oh!

FrankChen021 May 12, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants