Cleaning a dataframe# deduplicate Deduplicate categorical data by hierarchically clustering similar strings.