Data
mu284

mu284

active ARFF Publicly available Visibility: public Uploaded 29-09-2014 by Joaquin Vanschoren
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Author: Source: Unknown - Date unknown Please cite: This file contains the data in "The MU284 Population" from Appendix B of the book "Model Assisted Survey Sampling" by Sarndal, Swensson and Wretman, published by Springer-Verlag, New York, 1992. The data set contains 284 observations on 11 variables, plus a line with variabel names. Please consult the mentioned appendix for more information about this data set. The data were scanned from the book and interpreted with OCR-technique. Please note that errors may occur in such a process. The result was macro-edited against "The Clustered MU284 Population" in Appendix C of the book. Please use the data at your own risk - I take no responsibility for any problems eventual remaining errors will cause you. four typos in the first printing of the book have been corrected: Label 107, ME84 should be 1100, not 1110 Label 141, RMT85 should be 396, not 369 Label 220, ME84 should be 461, not 491 Label 229, ME84 should be 1239, not 1238. The data was submitted to StatLib with the permission of Springer-Verlag (ref: John Kimmel). Esbjorn Ohlsson Stockholm University esbj@matematik.su.se Information about the dataset CLASSTYPE: numeric CLASSINDEX: none specific

10 features

CL (target)numeric50 unique values
0 missing
LABEL (ignore)numeric283 unique values
0 missing
P85numeric69 unique values
0 missing
P75numeric68 unique values
0 missing
RMT85numeric186 unique values
0 missing
CS82numeric25 unique values
0 missing
SS82numeric36 unique values
0 missing
S82numeric19 unique values
0 missing
ME84numeric264 unique values
0 missing
REV84numeric277 unique values
0 missing
REGnumeric8 unique values
0 missing

19 properties

284
Number of instances (rows) of the dataset.
10
Number of attributes (columns) of the dataset.
0
Number of distinct values of the target attribute (if it is nominal).
0
Number of missing values in the dataset.
0
Number of instances with at least one value missing.
10
Number of numeric attributes.
0
Number of nominal attributes.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
0
Percentage of instances having missing values.
0.83
Average class difference between consecutive instances.
0
Percentage of missing values.
0.04
Number of attributes divided by the number of instances.
100
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.

13 tasks

0 runs - estimation_procedure: 10 times 10-fold Crossvalidation - evaluation_measure: mean_absolute_error - target_feature: CL
0 runs - estimation_procedure: 10-fold Crossvalidation - evaluation_measure: mean_absolute_error - target_feature: CL
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
Define a new task