Install#


pip install skrub -U

Deep learning dependencies

Deep-learning based encoders like TextEncoder require installing optional dependencies to use them. The following will install torch, transformers, and sentence-transformers.

$ pip install skrub[transformers] -U

conda install -c conda-forge skrub

Deep learning dependencies

Deep-learning based encoders like TextEncoder require installing optional dependencies to use them. The following will install torch, transformers, and sentence-transformers.

$ conda install -c conda-forge skrub[transformers]

mamba install -c conda-forge skrub

Deep learning dependencies

Deep-learning based encoders like TextEncoder require installing optional dependencies to use them. The following will install torch, transformers, and sentence-transformers.

$ mamba install -c conda-forge skrub[transformers]

Advanced Usage for Contributors#

1. Fork the project#

To contribute to the project, you first need to fork skrub on GitHub.

That will enable you to push your commits to a branch on your fork.

2. Clone your fork#

Clone your forked repo to your local machine:

git clone https://github.com/<YOUR_USERNAME>/skrub
cd skrub

Next, add the upstream remote (i.e. the official skrub repository). This allows you to pull the latest changes from the main repository:

git remote add upstream https://github.com/skrub-data/skrub.git

Verify that both the origin (your fork) and upstream (official repo) are correctly set up:

git remote -v

3. Setup your environment#

Now, setup a development environment. For example, you can use conda to create a virtual environment:

conda create -n skrub python=3.10 # or any later python version
conda activate skrub

Install the local package in editable mode with development dependencies:

pip install -e ".[dev, lint, test]"

Enable pre-commit hooks to ensure code style consistency:

pre-commit install

Optionally, configure Git to ignore certain revisions in git blame and IDE integrations. These revisions are listed in .git-blame-ignore-revs:

git config blame.ignoreRevsFile .git-blame-ignore-revs

4. Run the tests#

To ensure your environment is correctly set up, run the test suite:

pytest -s skrub/tests

Testing should take about 5 minutes. If no errors or failures are found, your environment is ready for development!

Now that you’re set up, review our implementation guidelines and start coding!


Deep learning dependencies

Deep-learning based encoders like TextEncoder require installing optional dependencies to use them. The following will install torch, transformers, and sentence-transformers.

$ pip install -e ".[transformers]"