You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
<!--
Thanks for opening a pull request!
-->
<!-- In the case this PR will resolve an issue, please replace
${GITHUB_ISSUE_ID} below with the actual Github issue id. -->
<!-- Closes #${GITHUB_ISSUE_ID} -->
# Rationale for this change
Adds support for Bodo DataFrame library, which is a drop in replacement
for Pandas that accelerates and scales Python code automatically by
applying query, compiler and HPC optimizations.
# Are these changes tested?
Added integration test.
# Are there any user-facing changes?
Adds `Table.to_bodo()` function. Example code:
```python
df = table.to_bodo() # equivalent to `bodo.pandas.read_iceberg_table(table)`
df = df[df["trip_distance"] >= 10.0]
df = df[["VendorID", "tpep_pickup_datetime", "tpep_dropoff_datetime"]]
print(df)
```
<!-- In the case of user-facing changes, please add the changelog label.
-->
Bodo is optimized to take advantage of Iceberg features such as hidden partitioning and various statistics for efficient reads.
1571
+
1526
1572
### Daft
1527
1573
1528
1574
PyIceberg interfaces closely with Daft Dataframes (see also: [Daft integration with Iceberg](https://docs.daft.ai/en/stable/io/iceberg/)) which provides a full lazily optimized query engine interface on top of PyIceberg tables.
0 commit comments