sap-ai-research
/

contexttab

foundation-model

Model card Files Files and versions Community

marekpolewczyk commited on 5 days ago

Commit

a19ec95

·

verified ·

1 Parent(s): 8067157

Update README.md

Files changed (1) hide show

README.md +19 -3

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ tags:
 ## Description
-Implementation of the deep learning model with the inference pipeline described in the paper "ConTextTab: A Semantics-Aware Tabular In-Context Learner".
 ![logo](./ConTextTab_architecture.png)
 ## Abstract
@@ -25,6 +25,8 @@ Tabular in-context learning (ICL) has recently achieved state-of-the-art (SOTA)
 ## Requirements
 The requirements are detailed in the `requirements.txt` file for Python 3.11 version.
 Local development installation:
@@ -53,7 +55,7 @@ X, y = load_breast_cancer(return_X_y=True)
 X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.5, random_state=42)
 # Initialize a classifier
-clf = ConTextTabClassifier(bagging=1, max_context_size=2048)
 clf.fit(X_train, y_train)
@@ -82,7 +84,7 @@ y = df.target.astype(float)
 X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.5, random_state=42)
 # Initialize the regressor
-regressor = ConTextTabRegressor(bagging=1, max_context_size=2048)
 regressor.fit(X_train, y_train)
@@ -93,6 +95,20 @@ r2 = r2_score(y_test, predictions)
 print("R² Score:", r2)
 ```
 ## Known Issues
 No known issues

 ## Description
+Implementation of the deep learning model with the inference pipeline described in the paper ["ConTextTab: A Semantics-Aware Tabular In-Context Learner"](https://arxiv.org/abs/2506.10707).
 ![logo](./ConTextTab_architecture.png)
 ## Abstract
 ## Requirements
+This project uses model checkpoints available on https://huggingface.co/sap-ai-research/contexttab that are automatically downloaded when running the model.
 The requirements are detailed in the `requirements.txt` file for Python 3.11 version.
 Local development installation:
 X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.5, random_state=42)
 # Initialize a classifier
+clf = ConTextTabClassifier(bagging=1, max_context_size=2048, test_chunk_size=1000)
 clf.fit(X_train, y_train)
 X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.5, random_state=42)
 # Initialize the regressor
+regressor = ConTextTabRegressor(bagging=1, max_context_size=2048, test_chunk_size=1000)
 regressor.fit(X_train, y_train)
 print("R² Score:", r2)
 ```
+## Citations
+If you use this model in your research or want to refer to our work, please cite:
+```
+@inproceedings{
+spinaci2025contexttab,
+title={ConTextTab: A Semantics-Aware Tabular In-Context Learner},
+author={Marco Spinaci and Marek Polewczyk and Maximilian Schambach and Sam Thelin},
+booktitle={1st ICML Workshop on Foundation Models for Structured Data},
+year={2025},
+url={https://openreview.net/forum?id=MmKuX9ZvM3}
+}
+```
 ## Known Issues
 No known issues