Building a pipeline#

See End-to-End pipeline for further details. For more flexibility and control to build pipelines, see the skrub DataOps.

tabular_pipeline

Get a simple machine-learning pipeline for tabular data.

TableVectorizer

Transform a dataframe to a numeric (vectorized) representation.

ApplyToCols

Apply a transformer to selected columns in a dataframe.

SelectCols

Select a subset of a DataFrame's columns.

DropCols

Drop a subset of a DataFrame's columns.