skrub.datasets.DatasetInfoOnly#

Usage examples at the bottom of this page.

class skrub.datasets.DatasetInfoOnly(name, description, source, target, path, read_csv_kwargs)[source]#

Represents a dataset and its information. With this state, the dataset is NOT loaded in memory, but can be read with path and read_csv_kwargs, as such:

ds = fetch_employee_salaries(load_dataframe=False)
df = pd.read_csv(ds.path, **ds.read_csv_kwargs)