string#

skrub.selectors.string()[source]#

Select columns that have a String data type.

In pandas, object columns containing (only) strings are also selected.

Examples

>>> from skrub import selectors as s
>>> import pandas as pd
>>> df = pd.DataFrame(
...     dict(
...         os=pd.Series(['A', 'B']),
...         o=pd.Series(['A', 10]),
...         s=pd.Series(['A', 'B']).convert_dtypes(),
...         c=pd.Series(['A', 'B'], dtype="category"),
...     )
... )
>>> df
  os   o  s  c
0  A   A  A  A
1  B  10  B  B
>>> df.dtypes
os            object
o             object
s     string...
c           category
dtype: object
>>> s.select(df, s.string())
  os  s
0  A  A
1  B  B
>>> s.select(df, s.string() | s.categorical())
  os  s  c
0  A  A  A
1  B  B  B