fetch_movielens#

skrub.datasets.fetch_movielens(data_home=None)[source]#

Fetch the movielens dataset (regression) available at skrub-data/skrub-data-files

This is a regression use-case, where the goal is to predict movie ratings. More details are provided in the output’s metadata['description'].

Parameters:
data_home: str or path, default=None

The directory where to download and unzip the files.

Returns:
bunchsklearn.utils.Bunch

A dictionary-like object with the following keys:

  • movies : pd.DataFrame, movie ID, title and genres

  • ratings: pd.DataFrame, user ID, movie ID, rating

  • metadata : a dictionary containing the name source and description