fetch_traffic_violations#
- skrub.datasets.fetch_traffic_violations(data_home=None)[source]#
Fetches the traffic violations dataset (classification), available at https://github.com/skrub-data/skrub-data-files
- Description of the dataset:
This dataset contains traffic violation information from all electronic traffic violations issued in the Montgomery County, MD. Any information that can be used to uniquely identify the vehicle, the vehicle owner or the officer issuing the violation will not be published. Size on disk: 736MB.
- Parameters:
- data_home
stror path-like, default=None The directory where to download and unzip the files.
- data_home
- Returns:
- bunch
Bunch A dictionary-like object with the following keys:
- traffic_violationsDataFrame of shape (1578154, 43)
The dataframe.
- XDataFrame of shape (1578154, 42)
Features, i.e. the dataframe without the target labels.
- yDataFrame of shape (1578154, 1)
Target labels.
- metadatadict
A dictionary containing the name, description, source and target.
- pathstr
The path to the traffic violations CSV file.
- bunch