Data
BNG(cylinder-bands,nominal,1000000)

BNG(cylinder-bands,nominal,1000000)

active ARFF Publicly available Visibility: public Uploaded 08-04-2014 by Jan van Rijn
0 likes downloaded by 4 people , 4 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit

40 features

band_type (target)nominal2 unique values
0 missing
timestampnominal297 unique values
0 missing
cylinder_numbernominal429 unique values
0 missing
customernominal72 unique values
0 missing
job_numbernominal3 unique values
0 missing
grain_screenednominal3 unique values
0 missing
ink_colornominal2 unique values
0 missing
proof_on_ctd_inknominal3 unique values
0 missing
blade_mfgnominal3 unique values
0 missing
cylinder_divisionnominal2 unique values
0 missing
paper_typenominal4 unique values
0 missing
ink_typenominal3 unique values
0 missing
direct_steamnominal3 unique values
0 missing
solvent_typenominal3 unique values
0 missing
type_on_cylindernominal2 unique values
0 missing
press_typenominal4 unique values
0 missing
pressnominal8 unique values
0 missing
unit_numbernominal3 unique values
0 missing
cylinder_sizenominal4 unique values
0 missing
paper_mill_locationnominal5 unique values
0 missing
plating_tanknominal3 unique values
0 missing
proof_cutnominal3 unique values
0 missing
viscositynominal3 unique values
0 missing
calipernominal21 unique values
0 missing
ink_temperaturenominal3 unique values
0 missing
humifitynominal3 unique values
0 missing
roughnessnominal3 unique values
0 missing
blade_pressurenominal3 unique values
0 missing
varnish_pctnominal3 unique values
0 missing
press_speednominal3 unique values
0 missing
ink_pctnominal3 unique values
0 missing
solvent_pctnominal3 unique values
0 missing
ESA_Voltagenominal3 unique values
0 missing
ESA_Amperagenominal3 unique values
0 missing
waxnominal3 unique values
0 missing
hardenernominal3 unique values
0 missing
roller_durometernominal3 unique values
0 missing
current_densitynominal7 unique values
0 missing
anode_space_rationominal3 unique values
0 missing
chrome_contentnominal3 unique values
0 missing

19 properties

1000000
Number of instances (rows) of the dataset.
40
Number of attributes (columns) of the dataset.
2
Number of distinct values of the target attribute (if it is nominal).
0
Number of missing values in the dataset.
0
Number of instances with at least one value missing.
0
Number of numeric attributes.
40
Number of nominal attributes.
10
Percentage of binary attributes.
0
Percentage of instances having missing values.
0.51
Average class difference between consecutive instances.
0
Percentage of missing values.
0
Number of attributes divided by the number of instances.
0
Percentage of numeric attributes.
57.7
Percentage of instances belonging to the most frequent class.
100
Percentage of nominal attributes.
577023
Number of instances belonging to the most frequent class.
42.3
Percentage of instances belonging to the least frequent class.
422977
Number of instances belonging to the least frequent class.
4
Number of binary attributes.

18 tasks

19 runs - estimation_procedure: 10-fold Crossvalidation - evaluation_measure: predictive_accuracy - target_feature: band_type
0 runs - estimation_procedure: 5 times 2-fold Crossvalidation - evaluation_measure: predictive_accuracy - target_feature: band_type
0 runs - estimation_procedure: 10 times 10-fold Crossvalidation - evaluation_measure: predictive_accuracy - target_feature: band_type
0 runs - estimation_procedure: 10-fold Crossvalidation - evaluation_measure: area_under_roc_curve - target_feature: band_type
0 runs - estimation_procedure: 33% Holdout set - evaluation_measure: predictive_accuracy - target_feature: band_type
0 runs - estimation_procedure: 10-fold Learning Curve - evaluation_measure: predictive_accuracy - target_feature: band_type
46 runs - estimation_procedure: Interleaved Test then Train - target_feature: band_type
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
Define a new task