Data
BNG(hepatitis,nominal,1000000)

BNG(hepatitis,nominal,1000000)

active ARFF Publicly available Visibility: public Uploaded 09-04-2014 by Jan van Rijn
0 likes downloaded by 4 people , 5 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit

20 features

Class (target)nominal2 unique values
0 missing
AGEnominal3 unique values
0 missing
SEXnominal2 unique values
0 missing
STEROIDnominal2 unique values
0 missing
ANTIVIRALSnominal2 unique values
0 missing
FATIGUEnominal2 unique values
0 missing
MALAISEnominal2 unique values
0 missing
ANOREXIAnominal2 unique values
0 missing
LIVER_BIGnominal2 unique values
0 missing
LIVER_FIRMnominal2 unique values
0 missing
SPLEEN_PALPABLEnominal2 unique values
0 missing
SPIDERSnominal2 unique values
0 missing
ASCITESnominal2 unique values
0 missing
VARICESnominal2 unique values
0 missing
BILIRUBINnominal3 unique values
0 missing
ALK_PHOSPHATEnominal3 unique values
0 missing
SGOTnominal3 unique values
0 missing
ALBUMINnominal3 unique values
0 missing
PROTIMEnominal3 unique values
0 missing
HISTOLOGYnominal2 unique values
0 missing

19 properties

1000000
Number of instances (rows) of the dataset.
20
Number of attributes (columns) of the dataset.
2
Number of distinct values of the target attribute (if it is nominal).
0
Number of missing values in the dataset.
0
Number of instances with at least one value missing.
0
Number of numeric attributes.
20
Number of nominal attributes.
70
Percentage of binary attributes.
0
Percentage of instances having missing values.
0
Percentage of missing values.
0.67
Average class difference between consecutive instances.
0
Percentage of numeric attributes.
0
Number of attributes divided by the number of instances.
100
Percentage of nominal attributes.
79.1
Percentage of instances belonging to the most frequent class.
791048
Number of instances belonging to the most frequent class.
20.9
Percentage of instances belonging to the least frequent class.
208952
Number of instances belonging to the least frequent class.
14
Number of binary attributes.

18 tasks

23 runs - estimation_procedure: 10-fold Crossvalidation - evaluation_measure: predictive_accuracy - target_feature: Class
0 runs - estimation_procedure: 5 times 2-fold Crossvalidation - evaluation_measure: predictive_accuracy - target_feature: Class
0 runs - estimation_procedure: 10 times 10-fold Crossvalidation - evaluation_measure: predictive_accuracy - target_feature: Class
0 runs - estimation_procedure: 10-fold Crossvalidation - target_feature: Class
0 runs - estimation_procedure: 33% Holdout set - evaluation_measure: predictive_accuracy - target_feature: Class
0 runs - estimation_procedure: 10-fold Learning Curve - evaluation_measure: predictive_accuracy - target_feature: Class
46 runs - estimation_procedure: Interleaved Test then Train - target_feature: Class
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
Define a new task