Data
BNG(sick,nominal,1000000)

BNG(sick,nominal,1000000)

active ARFF Publicly available Visibility: public Uploaded 08-04-2014 by Jan van Rijn
0 likes downloaded by 5 people , 5 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit

30 features

Class (target)nominal2 unique values
0 missing
agenominal3 unique values
0 missing
sexnominal2 unique values
0 missing
on_thyroxinenominal2 unique values
0 missing
query_on_thyroxinenominal2 unique values
0 missing
on_antithyroid_medicationnominal2 unique values
0 missing
sicknominal2 unique values
0 missing
pregnantnominal2 unique values
0 missing
thyroid_surgerynominal2 unique values
0 missing
I131_treatmentnominal2 unique values
0 missing
query_hypothyroidnominal2 unique values
0 missing
query_hyperthyroidnominal2 unique values
0 missing
lithiumnominal2 unique values
0 missing
goitrenominal2 unique values
0 missing
tumornominal2 unique values
0 missing
hypopituitarynominal2 unique values
0 missing
psychnominal2 unique values
0 missing
TSH_measurednominal2 unique values
0 missing
TSHnominal3 unique values
0 missing
T3_measurednominal2 unique values
0 missing
T3nominal3 unique values
0 missing
TT4_measurednominal2 unique values
0 missing
TT4nominal3 unique values
0 missing
T4U_measurednominal2 unique values
0 missing
T4Unominal3 unique values
0 missing
FTI_measurednominal2 unique values
0 missing
FTInominal3 unique values
0 missing
TBG_measurednominal1 unique values
0 missing
TBGnominal1 unique values
0 missing
referral_sourcenominal5 unique values
0 missing

19 properties

1000000
Number of instances (rows) of the dataset.
30
Number of attributes (columns) of the dataset.
2
Number of distinct values of the target attribute (if it is nominal).
0
Number of missing values in the dataset.
0
Number of instances with at least one value missing.
0
Number of numeric attributes.
30
Number of nominal attributes.
21
Number of binary attributes.
70
Percentage of binary attributes.
0
Percentage of instances having missing values.
0.89
Average class difference between consecutive instances.
0
Percentage of missing values.
0
Number of attributes divided by the number of instances.
0
Percentage of numeric attributes.
93.88
Percentage of instances belonging to the most frequent class.
100
Percentage of nominal attributes.
938761
Number of instances belonging to the most frequent class.
6.12
Percentage of instances belonging to the least frequent class.
61239
Number of instances belonging to the least frequent class.

18 tasks

23 runs - estimation_procedure: 10-fold Crossvalidation - evaluation_measure: predictive_accuracy - target_feature: Class
0 runs - estimation_procedure: 5 times 2-fold Crossvalidation - evaluation_measure: predictive_accuracy - target_feature: Class
0 runs - estimation_procedure: 10 times 10-fold Crossvalidation - evaluation_measure: predictive_accuracy - target_feature: Class
0 runs - estimation_procedure: 10-fold Crossvalidation - target_feature: Class
0 runs - estimation_procedure: 33% Holdout set - evaluation_measure: predictive_accuracy - target_feature: Class
0 runs - estimation_procedure: 10-fold Learning Curve - evaluation_measure: predictive_accuracy - target_feature: Class
50 runs - estimation_procedure: Interleaved Test then Train - target_feature: Class
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
Define a new task