Encoding a column#
See encoding for further details.
Generate a lightweight string encoding of a given column using tf-idf vectorization and truncated singular value decomposition (SVD). |
|
Encode string features by applying a pretrained language model downloaded from the HuggingFace Hub. |
|
Encode string categorical features by applying the MinHash method to n-gram decompositions of strings. |
|
Encode string columns by constructing latent topics. |
|
Encode string categories to a similarity matrix, to capture fuzziness across a few categories. |
|
Convert a string column to Categorical dtype. |
|
Extract temporal features such as month, day of the week, … from a datetime column. |
|
Parse datetimes represented as strings and return |
|
Convert DataFrame or column to Datetime dtype. |