fetch_medical_charge#

skrub.datasets.fetch_medical_charge(data_home=None)[source]#

Fetches the medical charge dataset (regression), available at skrub-data/skrub-data-files

Description of the dataset:

The Inpatient Utilization and Payment Public Use File (Inpatient PUF) provides information on inpatient discharges for Medicare fee-for-service beneficiaries. The Inpatient PUF includes information on utilization, payment (total payment and Medicare payment), and hospital-specific charges for the more than 3,000 U.S. hospitals that receive Medicare Inpatient Prospective Payment System (IPPS) payments. The PUF is organized by hospital and Medicare Severity Diagnosis Related Group (MS-DRG) and covers Fiscal Year (FY) 2011 through FY 2016.

Parameters:
data_home: str or path, default=None

The directory where to download and unzip the files.

Returns:
bunchsklearn.utils.Bunch

A dictionary-like object with the following keys:

  • medical_charge : pd.DataFrame, the dataframe

  • X : pd.DataFrame, features, i.e. the dataframe without the target labels

  • y : pd.DataFrame, target labels

  • metadata : a dictionary containing the name, description, source and target