Cleaning a dataframe#

deduplicate

Deduplicate categorical data by hierarchically clustering similar strings.