Data

Titanic

active
ARFF
Publicly available Visibility: public Uploaded 09-11-2018 by Merlin Raabe

0 likes downloaded by 1 people , 1 total downloads 0 issues 0 downvotes

0 likes downloaded by 1 people , 1 total downloads 0 issues 0 downvotes

Issue | #Downvotes for this reason | By |
---|

Loading wiki

Help us complete this description
Edit

The goal is to predict the Fare.
Variable description:
pclass: A proxy for socio-economic status (SES)
1st = Upper
2nd = Middle
3rd = Lower
age: Age is fractional if less than 1. If the age is estimated, is it in the form of xx.5
sibsp: The dataset defines family relations in this way...
Sibling = brother, sister, stepbrother, stepsister
Spouse = husband, wife (mistresses and fiances were ignored)
parch: The dataset defines family relations in this way...
Parent = mother, father
Child = daughter, son, stepdaughter, stepson
Some children travelled only with a nanny, therefore parch=0 for them

Fare (target) | numeric | 280 unique values 0 missing | |

Age | numeric | 98 unique values 0 missing | |

Sex | numeric | 2 unique values 0 missing | |

sibsp | numeric | 7 unique values 0 missing | |

Parch | numeric | 8 unique values 0 missing | |

Pclass | numeric | 3 unique values 0 missing | |

Embarked | numeric | 3 unique values 0 missing | |

X2urvived | numeric | 2 unique values 0 missing |

Second quartile (Median) of mutual information between the nominal attributes and the target attribute.

0.85

Second quartile (Median) of skewness among attributes of the numeric type.

0.85

Second quartile (Median) of standard deviation of attributes of the numeric type.

21.13

Third quartile of kurtosis among attributes of the numeric type.

Minimal mutual information between the nominal attributes and the target attribute.

Maximum mutual information between the nominal attributes and the target attribute.

The minimal number of distinct values among attributes of the nominal type.

Third quartile of mutual information between the nominal attributes and the target attribute.

The maximum number of distinct values among attributes of the nominal type.

-1.18

First quartile of kurtosis among attributes of the numeric type.

9.92

Third quartile of standard deviation of attributes of the numeric type.

Standard deviation of the number of distinct values among attributes of the nominal type.

First quartile of mutual information between the nominal attributes and the target attribute.

-0.32

First quartile of skewness among attributes of the numeric type.

0.56

First quartile of standard deviation of attributes of the numeric type.

Average mutual information between the nominal attributes and the target attribute.

An estimate of the amount of irrelevant information in the attributes regarding the class. Equals (MeanAttributeEntropy - MeanMutualInformation) divided by MeanMutualInformation.

0.22

Second quartile (Median) of kurtosis among attributes of the numeric type.

Average number of distinct values among the attributes of the nominal type.

1

Second quartile (Median) of means among attributes of the numeric type.

Number of attributes needed to optimally describe the class (under the assumption of independence among attributes). Equals ClassEntropy divided by MeanMutualInformation.