fetch_drug_directory#

skrub.datasets.fetch_drug_directory(data_home=None)[source]#

Fetches the drug directory dataset (classification), available at skrub-data/skrub-data-files

Description of the dataset:

Product listing data submitted to the U.S. FDA for all unfinished, unapproved drugs.

Parameters:
data_home: str or path, default=None

The directory where to download and unzip the files.

Returns:
bunchsklearn.utils.Bunch

A dictionary-like object with the following keys:

  • drug_directory : pd.DataFrame, the dataframe

  • X : pd.DataFrame, features, i.e. the dataframe without the target labels

  • y : pd.DataFrame, target labels

  • metadata : a dictionary containing the name, description, source and target