fetch_drug_directory#
- skrub.datasets.fetch_drug_directory(data_home=None)[source]#
Fetches the drug directory dataset (classification), available at skrub-data/skrub-data-files
- Description of the dataset:
Product listing data submitted to the U.S. FDA for all unfinished, unapproved drugs. Size on disk: 44MB.
- Parameters:
- data_home: str or path, default=None
The directory where to download and unzip the files.
- Returns:
- bunchsklearn.utils.Bunch
A dictionary-like object with the following keys:
drug_directory: pd.DataFrame, the dataframe. Shape: (120215, 21)X: pd.DataFrame, features, i.e. the dataframe without the target labels. Shape: (120215, 20)y: pd.DataFrame, target labels. Shape: (120215, 1)metadata: a dictionary containing the name, description, source and target