| pclass | survived | name | sex | age | sibsp | parch | ticket | fare | cabin | embarked | boat | body | home.dest | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 1 | Allen, Miss. Elisabeth Walton | female | 29.0 | 0 | 0 | 24160 | 211. | B5 | S | 2 | St Louis, MO | |
| 1 | 1 | 1 | Allison, Master. Hudson Trevor | male | 0.917 | 1 | 2 | 113781 | 152. | C22 C26 | S | 11 | Montreal, PQ / Chesterville, ON | |
| 2 | 1 | 0 | Allison, Miss. Helen Loraine | female | 2.00 | 1 | 2 | 113781 | 152. | C22 C26 | S | Montreal, PQ / Chesterville, ON | ||
| 3 | 1 | 0 | Allison, Mr. Hudson Joshua Creighton | male | 30.0 | 1 | 2 | 113781 | 152. | C22 C26 | S | 135. | Montreal, PQ / Chesterville, ON | |
| 4 | 1 | 0 | Allison, Mrs. Hudson J C (Bessie Waldo Daniels) | female | 25.0 | 1 | 2 | 113781 | 152. | C22 C26 | S | Montreal, PQ / Chesterville, ON | ||
| 1,304 | 3 | 0 | Zabour, Miss. Hileni | female | 14.5 | 1 | 0 | 2665 | 14.5 | C | 328. | |||
| 1,305 | 3 | 0 | Zabour, Miss. Thamine | female | 1 | 0 | 2665 | 14.5 | C | |||||
| 1,306 | 3 | 0 | Zakarian, Mr. Mapriededer | male | 26.5 | 0 | 0 | 2656 | 7.22 | C | 304. | |||
| 1,307 | 3 | 0 | Zakarian, Mr. Ortin | male | 27.0 | 0 | 0 | 2670 | 7.22 | C | ||||
| 1,308 | 3 | 0 | Zimmerman, Mr. Leo | male | 29.0 | 0 | 0 | 315082 | 7.88 | S |
pclass
Int64DType- Null values
- 0 (0.0%)
- Unique values
- 3 (0.2%)
- Mean ± Std
- 2.29 ± 0.838
- Median ± IQR
- 3 ± 1
- Min | Max
- 1 | 3
survived
CategoricalDtype- Null values
- 0 (0.0%)
- Unique values
- 2 (0.2%)
Most frequent values
name
ObjectDType- Null values
- 0 (0.0%)
- Unique values
-
1,307 (99.8%)
This column has a high cardinality (> 40).
Most frequent values
sex
CategoricalDtype- Null values
- 0 (0.0%)
- Unique values
- 2 (0.2%)
Most frequent values
age
Float64DType- Null values
- 263 (20.1%)
- Unique values
-
98 (7.5%)
This column has a high cardinality (> 40).
- Mean ± Std
- 29.9 ± 14.4
- Median ± IQR
- 28.0 ± 18.0
- Min | Max
- 0.167 | 80.0
sibsp
Int64DType- Null values
- 0 (0.0%)
- Unique values
- 7 (0.5%)
- Mean ± Std
- 0.499 ± 1.04
- Median ± IQR
- 0 ± 1
- Min | Max
- 0 | 8
parch
Int64DType- Null values
- 0 (0.0%)
- Unique values
- 8 (0.6%)
- Mean ± Std
- 0.385 ± 0.866
- Median ± IQR
- 0 ± 0
- Min | Max
- 0 | 9
ticket
ObjectDType- Null values
- 0 (0.0%)
- Unique values
-
929 (71.0%)
This column has a high cardinality (> 40).
Most frequent values
fare
Float64DType- Null values
- 1 (< 0.1%)
- Unique values
-
281 (21.5%)
This column has a high cardinality (> 40).
- Mean ± Std
- 33.3 ± 51.8
- Median ± IQR
- 14.5 ± 23.4
- Min | Max
- 0.00 | 512.
cabin
ObjectDType- Null values
- 1,014 (77.5%)
- Unique values
-
186 (14.2%)
This column has a high cardinality (> 40).
Most frequent values
embarked
CategoricalDtype- Null values
- 2 (0.2%)
- Unique values
- 3 (0.2%)
Most frequent values
boat
ObjectDType- Null values
- 823 (62.9%)
- Unique values
- 27 (2.1%)
Most frequent values
body
Float64DType- Null values
- 1,188 (90.8%)
- Unique values
-
121 (9.2%)
This column has a high cardinality (> 40).
- Mean ± Std
- 161. ± 97.7
- Median ± IQR
- 155. ± 184.
- Min | Max
- 1.00 | 328.
home.dest
ObjectDType- Null values
- 564 (43.1%)
- Unique values
-
369 (28.2%)
This column has a high cardinality (> 40).
Most frequent values
No columns match the selected filter: . You can change the column filter in the dropdown menu above.
| Column | Column name | dtype | Is sorted | Null values | Unique values | Mean | Std | Min | Median | Max |
|---|---|---|---|---|---|---|---|---|---|---|
| 0 | pclass | Int64DType | True | 0 (0.0%) | 3 (0.2%) | 2.29 | 0.838 | 1 | 3 | 3 |
| 1 | survived | CategoricalDtype | False | 0 (0.0%) | 2 (0.2%) | |||||
| 2 | name | ObjectDType | False | 0 (0.0%) | 1307 (99.8%) | |||||
| 3 | sex | CategoricalDtype | False | 0 (0.0%) | 2 (0.2%) | |||||
| 4 | age | Float64DType | False | 263 (20.1%) | 98 (7.5%) | 29.9 | 14.4 | 0.167 | 28.0 | 80.0 |
| 5 | sibsp | Int64DType | False | 0 (0.0%) | 7 (0.5%) | 0.499 | 1.04 | 0 | 0 | 8 |
| 6 | parch | Int64DType | False | 0 (0.0%) | 8 (0.6%) | 0.385 | 0.866 | 0 | 0 | 9 |
| 7 | ticket | ObjectDType | False | 0 (0.0%) | 929 (71.0%) | |||||
| 8 | fare | Float64DType | False | 1 (< 0.1%) | 281 (21.5%) | 33.3 | 51.8 | 0.00 | 14.5 | 512. |
| 9 | cabin | ObjectDType | False | 1014 (77.5%) | 186 (14.2%) | |||||
| 10 | embarked | CategoricalDtype | False | 2 (0.2%) | 3 (0.2%) | |||||
| 11 | boat | ObjectDType | False | 823 (62.9%) | 27 (2.1%) | |||||
| 12 | body | Float64DType | False | 1188 (90.8%) | 121 (9.2%) | 161. | 97.7 | 1.00 | 155. | 328. |
| 13 | home.dest | ObjectDType | False | 564 (43.1%) | 369 (28.2%) |
No columns match the selected filter: . You can change the column filter in the dropdown menu above.
pclass
Int64DType- Null values
- 0 (0.0%)
- Unique values
- 3 (0.2%)
- Mean ± Std
- 2.29 ± 0.838
- Median ± IQR
- 3 ± 1
- Min | Max
- 1 | 3
survived
CategoricalDtype- Null values
- 0 (0.0%)
- Unique values
- 2 (0.2%)
Most frequent values
name
ObjectDType- Null values
- 0 (0.0%)
- Unique values
-
1,307 (99.8%)
This column has a high cardinality (> 40).
Most frequent values
sex
CategoricalDtype- Null values
- 0 (0.0%)
- Unique values
- 2 (0.2%)
Most frequent values
age
Float64DType- Null values
- 263 (20.1%)
- Unique values
-
98 (7.5%)
This column has a high cardinality (> 40).
- Mean ± Std
- 29.9 ± 14.4
- Median ± IQR
- 28.0 ± 18.0
- Min | Max
- 0.167 | 80.0
sibsp
Int64DType- Null values
- 0 (0.0%)
- Unique values
- 7 (0.5%)
- Mean ± Std
- 0.499 ± 1.04
- Median ± IQR
- 0 ± 1
- Min | Max
- 0 | 8
parch
Int64DType- Null values
- 0 (0.0%)
- Unique values
- 8 (0.6%)
- Mean ± Std
- 0.385 ± 0.866
- Median ± IQR
- 0 ± 0
- Min | Max
- 0 | 9
ticket
ObjectDType- Null values
- 0 (0.0%)
- Unique values
-
929 (71.0%)
This column has a high cardinality (> 40).
Most frequent values
fare
Float64DType- Null values
- 1 (< 0.1%)
- Unique values
-
281 (21.5%)
This column has a high cardinality (> 40).
- Mean ± Std
- 33.3 ± 51.8
- Median ± IQR
- 14.5 ± 23.4
- Min | Max
- 0.00 | 512.
cabin
ObjectDType- Null values
- 1,014 (77.5%)
- Unique values
-
186 (14.2%)
This column has a high cardinality (> 40).
Most frequent values
embarked
CategoricalDtype- Null values
- 2 (0.2%)
- Unique values
- 3 (0.2%)
Most frequent values
boat
ObjectDType- Null values
- 823 (62.9%)
- Unique values
- 27 (2.1%)
Most frequent values
body
Float64DType- Null values
- 1,188 (90.8%)
- Unique values
-
121 (9.2%)
This column has a high cardinality (> 40).
- Mean ± Std
- 161. ± 97.7
- Median ± IQR
- 155. ± 184.
- Min | Max
- 1.00 | 328.
home.dest
ObjectDType- Null values
- 564 (43.1%)
- Unique values
-
369 (28.2%)
This column has a high cardinality (> 40).
Most frequent values
No columns match the selected filter: . You can change the column filter in the dropdown menu above.
| Column 1 | Column 2 | Cramér's V | Pearson's Correlation |
|---|---|---|---|
| survived | boat | 0.948 | |
| sibsp | ticket | 0.619 | |
| pclass | cabin | 0.599 | |
| pclass | home.dest | 0.537 | |
| survived | sex | 0.529 | |
| sex | boat | 0.506 | |
| pclass | fare | 0.446 | -0.559 |
| pclass | boat | 0.422 | |
| fare | cabin | 0.409 | |
| sibsp | home.dest | 0.395 | |
| parch | ticket | 0.368 | |
| pclass | age | 0.343 | -0.408 |
| ticket | home.dest | 0.341 | |
| cabin | home.dest | 0.327 | |
| pclass | survived | 0.313 | |
| survived | cabin | 0.309 | |
| ticket | fare | 0.308 | |
| ticket | cabin | 0.284 | |
| pclass | embarked | 0.284 | |
| fare | boat | 0.274 |
The table below shows the strength of association between the most similar columns in the dataframe.
Cramér's V statistic is a number between 0 and 1.
When it is close to 1 the columns are strongly associated — they contain similar information.
In this case, one of them may be redundant and for some models (such as linear models) it might be beneficial to remove it.
Please enable javascript
The skrub table reports need javascript to display correctly. If you are displaying a report in a Jupyter notebook and you see this message, you may need to re-execute the cell or to trust the notebook (button on the top right or "File > Trust notebook").