fetch_open_payments#

skrub.datasets.fetch_open_payments(data_home=None)[source]#

Fetches the open payments dataset (classification), available at https://github.com/skrub-data/skrub-data-files

Description of the dataset:

Payments given by healthcare manufacturing companies to medical doctors or hospitals. Size on disk: 8.7MB.

Parameters:
data_homestr or path-like, default=None

The directory where to download and unzip the files.

Returns:
bunchBunch

A dictionary-like object with the following keys:

open_paymentsDataFrame of shape (73558, 6)

The dataframe.

XDataFrame of shape (73558, 5)

Features, i.e. the dataframe without the target labels.

yDataFrame of shape (73558, 1)

Target labels.

metadatadict

A dictionary containing the name, description, source and target.

pathstr

The path to the open payments CSV file.