Assembling
fuzzy_join()
, Joining
tables on non-normalized categories with approximate matching. Example
Joiner
,
AggJoiner
, transformers for joining multiple tables together. Example
Encoding
TableVectorizer
:
turn a pandas dataframe into a numerical array for
machine learning. Example
GapEncoder
,
OneHotEncoder but robust to typos or non-normalized categories. Example
Cleaning
deduplicate()
: merge
categories of similar morphology (spelling). Example