You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
These items may not be relevant to all users, but for some environments, there is more work required to allow the new ParquetExec scans to be used. Comet's current default native_comet scan is JVM-based and leverages Hadoop data source functionality that is not available in DataFusion.
What is the problem the feature request solves?
We would like Comet to fully support complex types (arrays, structs, and maps). This issue is for tracking all of the individual issues.
Google doc: https://docs.google.com/document/d/1eiDFEScPjxBMahJW6lmBI8JjVlI6CwhiJgkTSsTvPVY/edit?usp=sharing
Implement new native scans based on DataFusion's DataSourceExec
We now have new
native_datafusion
andnative_iceberg_compat
scans that use DataFusion'sDataSourceExec
, which already supports complex types.We need to fix the remaining Spark SQL test failures:
Known issues:
native_datafusion/native_iceberg_compat
scans case sensitive #1574ParquetEncryptionITCase
fails withnative_iceberg_compat
#1488Other scan-related work
These items may not be relevant to all users, but for some environments, there is more work required to allow the new
ParquetExec
scans to be used. Comet's current defaultnative_comet
scan is JVM-based and leverages Hadoop data source functionality that is not available in DataFusion.Supporting expressions that operate on complex types
to_json
Performance
Testing
native_datafusion
andnative_iceberg_compat
with all supported Java, Spark, and Scala versions #1486native_datafusion
andnative_iceberg_compat
with Spark SQL tests #1489Older / related issues:
Describe the potential solution
No response
Additional context
No response
The text was updated successfully, but these errors were encountered: