Data
ames-housing

ames-housing

in_preparation ARFF Publicly available Visibility: public Uploaded 03-11-2018 by Florian Pargent
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
A processed version of the 'Ames Iowa Housing' dataset as provided by the make_ames() function in the R-package 'AmesHousing' [Max Kuhn (2017). AmesHousing: The Ames Iowa Housing Data. R package version 0.0.3. https://CRAN.R-project.org/package=AmesHousing]. The original data was published in [De Cock, D. (2011). 'Ames, Iowa: Alternative to the Boston Housing Data as an End of Semester Regression Project', Journal of Statistics Education, Volume 19, Number 3]. For a description of the dataset checkout either the documentation of the R-package or the original publication. The variable 'Sale_Price' was chosen as target variable. Note that all factors are unordered in this version of the dataset as provided by the make_ames() function, in contrast to the version provided by the make_ordinal_ames() function.

81 features

Sale_Price (target)numeric1032 unique values
0 missing
MS_SubClassnominal16 unique values
0 missing
MS_Zoningnominal7 unique values
0 missing
Lot_Frontagenumeric129 unique values
0 missing
Lot_Areanumeric1960 unique values
0 missing
Streetnominal2 unique values
0 missing
Alleynominal3 unique values
0 missing
Lot_Shapenominal4 unique values
0 missing
Land_Contournominal4 unique values
0 missing
Utilitiesnominal3 unique values
0 missing
Lot_Confignominal5 unique values
0 missing
Land_Slopenominal3 unique values
0 missing
Neighborhoodnominal28 unique values
0 missing
Condition_1nominal9 unique values
0 missing
Condition_2nominal8 unique values
0 missing
Bldg_Typenominal5 unique values
0 missing
House_Stylenominal8 unique values
0 missing
Overall_Qualnominal10 unique values
0 missing
Overall_Condnominal9 unique values
0 missing
Year_Builtnumeric118 unique values
0 missing
Year_Remod_Addnumeric61 unique values
0 missing
Roof_Stylenominal6 unique values
0 missing
Roof_Matlnominal8 unique values
0 missing
Exterior_1stnominal16 unique values
0 missing
Exterior_2ndnominal17 unique values
0 missing
Mas_Vnr_Typenominal5 unique values
0 missing
Mas_Vnr_Areanumeric445 unique values
0 missing
Exter_Qualnominal4 unique values
0 missing
Exter_Condnominal5 unique values
0 missing
Foundationnominal6 unique values
0 missing
Bsmt_Qualnominal6 unique values
0 missing
Bsmt_Condnominal6 unique values
0 missing
Bsmt_Exposurenominal5 unique values
0 missing
BsmtFin_Type_1nominal7 unique values
0 missing
BsmtFin_SF_1numeric8 unique values
0 missing
BsmtFin_Type_2nominal7 unique values
0 missing
BsmtFin_SF_2numeric274 unique values
0 missing
Bsmt_Unf_SFnumeric1137 unique values
0 missing
Total_Bsmt_SFnumeric1058 unique values
0 missing
Heatingnominal6 unique values
0 missing
Heating_QCnominal5 unique values
0 missing
Central_Airnominal2 unique values
0 missing
Electricalnominal6 unique values
0 missing
First_Flr_SFnumeric1083 unique values
0 missing
Second_Flr_SFnumeric635 unique values
0 missing
Low_Qual_Fin_SFnumeric36 unique values
0 missing
Gr_Liv_Areanumeric1292 unique values
0 missing
Bsmt_Full_Bathnumeric4 unique values
0 missing
Bsmt_Half_Bathnumeric3 unique values
0 missing
Full_Bathnumeric5 unique values
0 missing
Half_Bathnumeric3 unique values
0 missing
Bedroom_AbvGrnumeric8 unique values
0 missing
Kitchen_AbvGrnumeric4 unique values
0 missing
Kitchen_Qualnominal5 unique values
0 missing
TotRms_AbvGrdnumeric14 unique values
0 missing
Functionalnominal8 unique values
0 missing
Fireplacesnumeric5 unique values
0 missing
Fireplace_Qunominal6 unique values
0 missing
Garage_Typenominal7 unique values
0 missing
Garage_Finishnominal4 unique values
0 missing
Garage_Carsnumeric6 unique values
0 missing
Garage_Areanumeric603 unique values
0 missing
Garage_Qualnominal6 unique values
0 missing
Garage_Condnominal6 unique values
0 missing
Paved_Drivenominal3 unique values
0 missing
Wood_Deck_SFnumeric380 unique values
0 missing
Open_Porch_SFnumeric252 unique values
0 missing
Enclosed_Porchnumeric183 unique values
0 missing
Three_season_porchnumeric31 unique values
0 missing
Screen_Porchnumeric121 unique values
0 missing
Pool_Areanumeric14 unique values
0 missing
Pool_QCnominal5 unique values
0 missing
Fencenominal5 unique values
0 missing
Misc_Featurenominal6 unique values
0 missing
Misc_Valnumeric38 unique values
0 missing
Mo_Soldnumeric12 unique values
0 missing
Year_Soldnumeric5 unique values
0 missing
Sale_Typenominal10 unique values
0 missing
Sale_Conditionnominal6 unique values
0 missing
Longitudenumeric2776 unique values
0 missing
Latitudenumeric2762 unique values
0 missing

62 properties

2930
Number of instances (rows) of the dataset.
81
Number of attributes (columns) of the dataset.
0
Number of distinct values of the target attribute (if it is nominal).
0
Number of missing values in the dataset.
0
Number of instances with at least one value missing.
35
Number of numeric attributes.
46
Number of nominal attributes.
559.07
Third quartile of means among attributes of the numeric type.
Maximum mutual information between the nominal attributes and the target attribute.
2
The minimal number of distinct values among attributes of the nominal type.
43.21
Percentage of numeric attributes.
Third quartile of mutual information between the nominal attributes and the target attribute.
28
The maximum number of distinct values among attributes of the nominal type.
-0.6
Minimum skewness among attributes of the numeric type.
56.79
Percentage of nominal attributes.
3.96
Third quartile of skewness among attributes of the numeric type.
22
Maximum skewness among attributes of the numeric type.
0.02
Minimum standard deviation of attributes of the numeric type.
First quartile of entropy among attributes.
215.19
Third quartile of standard deviation of attributes of the numeric type.
79886.69
Maximum standard deviation of attributes of the numeric type.
Percentage of instances belonging to the least frequent class.
-0.45
First quartile of kurtosis among attributes of the numeric type.
4.55
Standard deviation of the number of distinct values among attributes of the nominal type.
Average entropy of the attributes.
Number of instances belonging to the least frequent class.
2.24
First quartile of means among attributes of the numeric type.
45.92
Mean kurtosis among attributes of the numeric type.
2
Number of binary attributes.
First quartile of mutual information between the nominal attributes and the target attribute.
5783.08
Mean of means among attributes of the numeric type.
0.17
First quartile of skewness among attributes of the numeric type.
-43326.53
Average class difference between consecutive instances.
Average mutual information between the nominal attributes and the target attribute.
0.76
First quartile of standard deviation of attributes of the numeric type.
Entropy of the target attribute values.
An estimate of the amount of irrelevant information in the attributes regarding the class. Equals (MeanAttributeEntropy - MeanMutualInformation) divided by MeanMutualInformation.
Second quartile (Median) of entropy among attributes.
0.03
Number of attributes divided by the number of instances.
6.91
Average number of distinct values among the attributes of the nominal type.
2.16
Second quartile (Median) of kurtosis among attributes of the numeric type.
Number of attributes needed to optimally describe the class (under the assumption of independence among attributes). Equals ClassEntropy divided by MeanMutualInformation.
3.2
Mean skewness among attributes of the numeric type.
42.03
Second quartile (Median) of means among attributes of the numeric type.
Percentage of instances belonging to the most frequent class.
2617.72
Mean standard deviation of attributes of the numeric type.
Second quartile (Median) of mutual information between the nominal attributes and the target attribute.
Number of instances belonging to the most frequent class.
Minimal entropy among attributes.
0.92
Second quartile (Median) of skewness among attributes of the numeric type.
Maximum entropy among attributes.
-1.5
Minimum kurtosis among attributes of the numeric type.
2.47
Percentage of binary attributes.
33.5
Second quartile (Median) of standard deviation of attributes of the numeric type.
Third quartile of entropy among attributes.
566.2
Maximum kurtosis among attributes of the numeric type.
-93.64
Minimum of means among attributes of the numeric type.
0
Percentage of instances having missing values.
17.86
Third quartile of kurtosis among attributes of the numeric type.
180796.06
Maximum of means among attributes of the numeric type.
Minimal mutual information between the nominal attributes and the target attribute.
0
Percentage of missing values.

1 tasks

0 runs - estimation_procedure: 50 times Clustering
Define a new task