Data
BNG(credit-g,nominal,1000000)

BNG(credit-g,nominal,1000000)

active ARFF Publicly available Visibility: public Uploaded 08-04-2014 by Jan van Rijn
0 likes downloaded by 4 people , 5 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit

21 features

class (target)nominal2 unique values
0 missing
checking_statusnominal4 unique values
0 missing
durationnominal3 unique values
0 missing
credit_historynominal5 unique values
0 missing
purposenominal11 unique values
0 missing
credit_amountnominal3 unique values
0 missing
savings_statusnominal5 unique values
0 missing
employmentnominal5 unique values
0 missing
installment_commitmentnominal3 unique values
0 missing
personal_statusnominal5 unique values
0 missing
other_partiesnominal3 unique values
0 missing
residence_sincenominal3 unique values
0 missing
property_magnitudenominal4 unique values
0 missing
agenominal3 unique values
0 missing
other_payment_plansnominal3 unique values
0 missing
housingnominal3 unique values
0 missing
existing_creditsnominal3 unique values
0 missing
jobnominal4 unique values
0 missing
num_dependentsnominal3 unique values
0 missing
own_telephonenominal2 unique values
0 missing
foreign_workernominal2 unique values
0 missing

19 properties

1000000
Number of instances (rows) of the dataset.
21
Number of attributes (columns) of the dataset.
2
Number of distinct values of the target attribute (if it is nominal).
0
Number of missing values in the dataset.
0
Number of instances with at least one value missing.
0
Number of numeric attributes.
21
Number of nominal attributes.
0
Percentage of instances having missing values.
0.58
Average class difference between consecutive instances.
0
Percentage of missing values.
0
Number of attributes divided by the number of instances.
0
Percentage of numeric attributes.
69.96
Percentage of instances belonging to the most frequent class.
100
Percentage of nominal attributes.
699587
Number of instances belonging to the most frequent class.
30.04
Percentage of instances belonging to the least frequent class.
300413
Number of instances belonging to the least frequent class.
3
Number of binary attributes.
14.29
Percentage of binary attributes.

26 tasks

21 runs - estimation_procedure: 10-fold Crossvalidation - evaluation_measure: predictive_accuracy - target_feature: class
0 runs - estimation_procedure: 5 times 2-fold Crossvalidation - evaluation_measure: predictive_accuracy - target_feature: class
0 runs - estimation_procedure: 10 times 10-fold Crossvalidation - evaluation_measure: predictive_accuracy - target_feature: class
0 runs - estimation_procedure: 10-fold Crossvalidation - evaluation_measure: precision - target_feature: class
0 runs - estimation_procedure: 33% Holdout set - evaluation_measure: predictive_accuracy - target_feature: class
0 runs - estimation_procedure: 10-fold Learning Curve - evaluation_measure: predictive_accuracy - target_feature: class
0 runs - estimation_procedure: 10-fold Learning Curve - target_feature: class
0 runs - estimation_procedure: 10-fold Learning Curve - target_feature: class
0 runs - estimation_procedure: 10-fold Learning Curve - target_feature: class
0 runs - estimation_procedure: 10-fold Learning Curve - target_feature: class
0 runs - estimation_procedure: 10-fold Learning Curve - target_feature: class
0 runs - estimation_procedure: 10-fold Learning Curve - target_feature: class
0 runs - estimation_procedure: 10-fold Learning Curve - target_feature: class
0 runs - estimation_procedure: 10-fold Learning Curve - target_feature: class
47 runs - estimation_procedure: Interleaved Test then Train - target_feature: class
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
Define a new task