DatasetAll#

Usage examples at the bottom of this page.

class skrub.datasets.DatasetAll(name, description, source, target, X, y, path, read_csv_kwargs)[source]#

Represents a dataset and its information. With this state, the dataset is loaded in memory as a DataFrame (X and y). Additional information such as path and read_csv_kwargs are provided in case the dataframe has to be read from disk, as such:

ds = fetch_employee_salaries(load_dataframe=False)
df = pd.read_csv(ds.path, **ds.read_csv_kwargs)

Examples using skrub.datasets.DatasetAll#

Getting Started

Getting Started

Encoding: from a dataframe to a numerical matrix for machine learning

Encoding: from a dataframe to a numerical matrix for machine learning

Feature interpretation with the GapEncoder

Feature interpretation with the GapEncoder