You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardexpand all lines: README.md
+62-62
Original file line number
Diff line number
Diff line change
@@ -21,6 +21,17 @@
21
21
22
22
Mambular is a Python library for tabular deep learning. It includes models that leverage the Mamba (State Space Model) architecture, as well as other popular models like TabTransformer, FTTransformer, TabM and tabular ResNets. Check out our paper `Mambular: A Sequential Model for Tabular Deep Learning`, available [here](https://arxiv.org/abs/2408.06291). Also check out our paper introducing [TabulaRNN](https://arxiv.org/pdf/2411.17207) and analyzing the efficiency of NLP inspired tabular models.
23
23
24
+
<h3>⚡ What's New ⚡</h3>
25
+
<ul>
26
+
<li>Individual preprocessing: preprocess each feature differently, use pre-trained models for categorical encoding</li>
27
+
<li>Extract latent representations of tables</li>
28
+
<li>Use embeddings as inputs</li>
29
+
<li>Define custom training metrics</li>
30
+
</ul>
31
+
32
+
33
+
34
+
24
35
<h3> Table of Contents </h3>
25
36
26
37
-[🏃 Quickstart](#-quickstart)
@@ -30,7 +41,6 @@ Mambular is a Python library for tabular deep learning. It includes models that
30
41
-[🛠️ Installation](#️-installation)
31
42
-[🚀 Usage](#-usage)
32
43
-[💻 Implement Your Own Model](#-implement-your-own-model)
33
-
-[Custom Training](#custom-training)
34
44
-[🏷️ Citation](#️-citation)
35
45
-[License](#license)
36
46
@@ -103,6 +113,7 @@ pip install mamba-ssm
103
113
<h2> Preprocessing </h2>
104
114
105
115
Mambular simplifies data preprocessing with a range of tools designed for easy transformation of tabular data.
116
+
Specify a default method, or a dictionary defining individual preprocessing methods for each feature.
106
117
107
118
<h3> Data Type Detection and Transformation </h3>
108
119
@@ -116,6 +127,7 @@ Mambular simplifies data preprocessing with a range of tools designed for easy t
116
127
-**Polynomial Features**: Automatically generates polynomial and interaction terms for numerical features, enhancing the ability to capture higher-order relationships.
117
128
-**Box-Cox & Yeo-Johnson Transformations**: Performs power transformations to stabilize variance and normalize distributions.
118
129
-**Custom Binning**: Enables user-defined bin edges for precise discretization of numerical data.
130
+
-**Pre-trained Encoding**: Use sentence transformers to encode categorical features.
If you prefer to setup custom training, preprocessing and evaluation, you can simply use the `mambular.base_models`.
336
-
Just be careful that all basemodels expect lists of features as inputs. More precisely as list for numerical features and a list for categorical features. A custom training loop, with random data could look like this.
337
385
338
-
```python
339
-
import torch
340
-
import torch.nn as nn
341
-
import torch.optim as optim
342
-
from mambular.base_models import Mambular
343
-
from mambular.configs import DefaultMambularConfig
0 commit comments