Skip to content

Commit 0774547

Browse files
committed
remove progress bar and other small issues
1 parent a41eb2e commit 0774547

File tree

2 files changed

+11
-10
lines changed

2 files changed

+11
-10
lines changed

docs/_freeze/posts/ibisml/index/execute-results/html.json

+3-3
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

docs/posts/ibisml/index.qmd

+8-7
Original file line numberDiff line numberDiff line change
@@ -1,25 +1,24 @@
11
---
22
title: "Using IbisML and DuckDB for a Kaggle competition: credit risk model stability"
33
author: "Jiting Xu"
4-
date: "2024-08-15"
4+
date: "2024-08-21"
55
categories:
66
- blog
7-
- DuckDB
7+
- duckdb
88
- machine learning
99
- feature engineering
10-
execute:
11-
freeze: auto
1210
---
1311

1412
## Introduction
15-
In this post, we'll demonstrate how to use Ibis and IbisML end-to-end for the
13+
In this post, we'll demonstrate how to use Ibis and [IbisML](https://github.com/ibis-project/ibis-ml)
14+
end-to-end for the
1615
[credit risk model stability Kaggle competition](https://www.kaggle.com/competitions/home-credit-credit-risk-model-stability).
1716

1817
1. Load data and perform feature engineering on DuckDB backend using IbisML
1918
2. Perform last-mile ML data preprocessing on DuckDB backend using IbisML
2019
3. Train two models using different frameworks:
2120
* An XGBoost model within a scikit-learn pipeline.
22-
* A neural network with PyTorch and PyTorch Lightning
21+
* A neural network with PyTorch and PyTorch Lightning.
2322

2423
The aim of this competition is to predict which clients are more likely to default on their
2524
loans by using both internal and external data sources.
@@ -93,6 +92,8 @@ ibis.options.interactive = True
9392
Set the backend for computing:
9493
```{python}
9594
con = ibis.duckdb.connect()
95+
# remove the black bars from duckdb's progress bar
96+
con.raw_sql("set enable_progress_bar = false")
9697
# DuckDB is the default backend for Ibis
9798
ibis.set_backend(con)
9899
```
@@ -612,7 +613,7 @@ Calculate all the days difference between any date columns and the column `date_
612613
#| code-summary: "Show code to calculate days difference between date columns and date_decision"
613614
date_cols = [col_name for col_name in df_train.columns if col_name[-1] == "D"]
614615
days_to_decision_expr = {
615-
# Difference in days
616+
# difference in days
616617
f"{col}_date_decision_diff": (
617618
_.date_decision.epoch_seconds() - getattr(_, col).epoch_seconds()
618619
)

0 commit comments

Comments
 (0)