Skip to content

Conversation

@mbutrovich
Copy link
Contributor

@mbutrovich mbutrovich commented Feb 2, 2026

Which issue does this PR close?

N/A.

Rationale for this change

apache/iceberg-rust#2100 adds Parquet metadata caching to iceberg-rust's ArrowReader. We want to test that in Comet CI. Note that this builds on top of #3349's changes and should not be merged.

What changes are included in this PR?

Point iceberg-rust dependency at feature branch for apache/iceberg-rust#2100 and run cargo update.

How are these changes tested?

Existing tests.

…oned columns) and run a representative test.
1. findAllIcebergSplitData() collected perPartitionByLocation (all partitions' data)
2. This map was captured in the createCometExecIter closure
3. ZippedPartitionsRDD serialized that closure to every task
4. Each task received ALL partitions' data (925 bytes to both tasks)

Instead we now use CometIcebergSplitRDD which puts per-partition data in Partition objects.
… columns), fixes TestRuntimeFiltering Iceberg Java tests with column renames.

CometIcebergSplitRDD registers subqueries so native code can look them up, fixes TestViews Iceberg Java tests with rewritten filter.
… assertion at index lookup, and defensive fallback if future Spark behavior changes.
# Conflicts:
#	.github/workflows/iceberg_spark_test.yml
@codecov-commenter
Copy link

codecov-commenter commented Feb 2, 2026

Codecov Report

❌ Patch coverage is 81.06796% with 78 lines in your changes missing coverage. Please review.
✅ Project coverage is 60.17%. Comparing base (f09f8af) to head (5519093).
⚠️ Report is 918 commits behind head on main.

Files with missing lines Patch % Lines
...e/spark/sql/comet/CometIcebergNativeScanExec.scala 61.53% 10 Missing and 20 partials ⚠️
...n/scala/org/apache/comet/rules/CometScanRule.scala 21.73% 11 Missing and 7 partials ⚠️
.../comet/serde/operator/CometIcebergNativeScan.scala 89.79% 13 Missing and 2 partials ⚠️
...n/scala/org/apache/spark/sql/comet/operators.scala 87.67% 2 Missing and 7 partials ⚠️
...cala/org/apache/spark/sql/comet/CometExecRDD.scala 93.33% 3 Missing and 3 partials ⚠️
Additional details and impacted files
@@             Coverage Diff              @@
##               main    #3365      +/-   ##
============================================
+ Coverage     56.12%   60.17%   +4.05%     
- Complexity      976     1502     +526     
============================================
  Files           119      174      +55     
  Lines         11743    16348    +4605     
  Branches       2251     2713     +462     
============================================
+ Hits           6591     9838    +3247     
- Misses         4012     5129    +1117     
- Partials       1140     1381     +241     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

@mbutrovich mbutrovich closed this Feb 2, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants