@@ -496,17 +496,29 @@ You can find more information about the Hugging Face Dataset Hub in the [documen
496
496
497
497
## Out-of-core
498
498
499
- ### [ Bodo] ( https://bodo. ai/ )
499
+ ### [ Bodo] ( https://github.com/bodo- ai/Bodo )
500
500
501
- Bodo is a high-performance Python computing engine that automatically parallelizes and
502
- optimizes your code through compilation using HPC (high-performance computing) techniques.
503
- Designed to operate with native pandas dataframes, Bodo compiles your pandas code to execute
504
- across multiple cores on a single machine or distributed clusters of multiple compute nodes efficiently.
505
- Bodo also makes distributed pandas dataframes queryable with SQL.
506
501
507
- The community edition of Bodo is free to use on up to 8 cores. Beyond that, Bodo offers a paid
508
- enterprise edition. Free licenses of Bodo (for more than 8 cores) are available
509
- [ upon request] ( https://www.bodo.ai/contact ) for academic and non-profit use.
502
+ Bodo is a high-performance compute engine for Python data processing.
503
+ Using an auto-parallelizing just-in-time (JIT) compiler, Bodo simplifies scaling Pandas
504
+ workloads from laptops to clusters without major code changes.
505
+ Under the hood, Bodo relies on MPI-based high-performance computing (HPC) technology—making it
506
+ both easier to use and often much faster than alternatives.
507
+ Bodo also provides a SQL engine that can query distributed pandas dataframes efficiently.
508
+
509
+ ``` python
510
+ import pandas as pd
511
+ import bodo
512
+
513
+ @bodo.jit
514
+ def process_data ():
515
+ df = pd.read_parquet(" my_data.pq" )
516
+ df2 = pd.DataFrame({" A" : df.apply(lambda r : 0 if r.A == 0 else (r.B // r.A), axis = 1 )})
517
+ df2.to_parquet(" out.pq" )
518
+
519
+ process_data()
520
+ ```
521
+
510
522
511
523
### [ Cylon] ( https://cylondata.org/ )
512
524
0 commit comments