Data
Filter results by:
No data.
29 runs0 likes1 downloads1 reach13 impact
1000000 instances - 39 features - 6 classes - 0 missing values
No data.
28 runs0 likes1 downloads1 reach13 impact
1000000 instances - 39 features - 6 classes - 0 missing values
No data.
28 runs0 likes1 downloads1 reach13 impact
1000000 instances - 39 features - 6 classes - 0 missing values
No data.
143 runs0 likes4 downloads4 reach12 impact
1000000 instances - 39 features - 6 classes - 0 missing values
No data.
66 runs0 likes2 downloads2 reach12 impact
1000000 instances - 39 features - 6 classes - 0 missing values
No data.
206 runs0 likes3 downloads3 reach12 impact
1000000 instances - 39 features - 6 classes - 0 missing values
No data.
67 runs0 likes2 downloads2 reach12 impact
1000000 instances - 39 features - 6 classes - 0 missing values
This is a preprocessed version of the anneal dataset (version 1). All missing values are treated as a nominal value with label '?'. (Quotes for clarity). Because this is not good…
0 runs0 likes0 downloads0 reach2 impact
898 instances - 39 features - 5 classes - 0 missing values
Data from https://doi.org/10.5281/zenodo.269636
0 runs0 likes5 downloads5 reach14 impact
4758 instances - 39 features - classes - 0 missing values
Automated file upload of BNG(anneal)
100 runs0 likes3 downloads3 reach13 impact
1000000 instances - 39 features - 6 classes - 0 missing values
The original Annealing dataset from UCI. The exact meaning of the features and classes is largely unknown. Annealing, in metallurgy and materials science, is a heat treatment that alters the physical…
13779 runs0 likes16 downloads16 reach13 impact
898 instances - 39 features - 5 classes - 22175 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
712 runs0 likes8 downloads8 reach15 impact
898 instances - 39 features - 2 classes - 22175 missing values
One of the NASA Metrics Data Program defect data sets. The specific type of software is unknown. Data comes from McCabe and Halstead features extractors of source code. These features were defined in…
815 runs0 likes15 downloads15 reach18 impact
9466 instances - 39 features - 2 classes - 0 missing values
Kung chi
1 runs0 likes4 downloads4 reach12 impact
123 instances - 40 features - 2 classes - 0 missing values
knugget chase 3
0 runs0 likes2 downloads2 reach12 impact
194 instances - 40 features - 2 classes - 0 missing values
Mind Cave 2
0 runs0 likes3 downloads3 reach11 impact
125 instances - 40 features - 2 classes - 0 missing values
### Description Cylinder bands UCI dataset - Process delays known as cylinder banding in rotogravure printing were substantially mitigated using control rules discovered by decision tree induction.…
21477 runs0 likes8 downloads8 reach26 impact
540 instances - 40 features - 2 classes - 999 missing values
No data.
65 runs0 likes4 downloads4 reach10 impact
1000000 instances - 40 features - 2 classes - 0 missing values
No data.
65 runs0 likes3 downloads3 reach9 impact
1000000 instances - 40 features - 2 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach13 impact
22 instances - 40 features - 0 classes - 0 missing values
ARFF Training Data
0 runs0 likes0 downloads0 reach0 impact
177640 instances - 40 features - classes - 0 missing values
One of the NASA Metrics Data Program defect data sets. The specific type of software is unknown. Data comes from McCabe and Halstead features extractors of source code. These features were defined in…
777 runs0 likes9 downloads9 reach15 impact
458 instances - 40 features - 2 classes - 0 missing values
One of the NASA Metrics Data Program defect data sets. The specific type of software is unknown. Data comes from McCabe and Halstead features extractors of source code. These features were defined in…
772 runs0 likes10 downloads10 reach15 impact
161 instances - 40 features - 2 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach9 impact
1000000 instances - 41 features - 0 classes - 0 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au6-250-drift-au6-cd1-500 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and…
11011 runs0 likes9 downloads9 reach47 impact
750 instances - 41 features - 8 classes - 0 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au6-cd1-400 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and heterogeneity…
144 runs0 likes3 downloads3 reach13 impact
400 instances - 41 features - 8 classes - 0 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au6-1000 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and heterogeneity of…
11010 runs0 likes16 downloads16 reach47 impact
1000 instances - 41 features - 8 classes - 0 missing values
No data.
307 runs0 likes3 downloads3 reach12 impact
1000000 instances - 41 features - 3 classes - 0 missing values
No data.
63 runs0 likes2 downloads2 reach12 impact
1000000 instances - 41 features - 3 classes - 0 missing values
This data set addresses a control problem, namely flying a F16 aircraft. The attributes describe the status of the aeroplane, while the goal is to predict the control action on the ailerons of the…
0 runs0 likes6 downloads6 reach14 impact
13750 instances - 41 features - 0 classes - 0 missing values
One of two multivariate regression data sets from paper industry, from an experiment at the paper plant Saugbruksforeningen, Norway. They have been described and analysed in: Aldrin, M. (1996),…
0 runs0 likes0 downloads0 reach13 impact
30 instances - 41 features - 0 classes - 0 missing values
####1. Summary This database was generated by the Laboratory of Image Processing and Pattern Recognition (INPG-LTIRF) in the development of the Esprit project ELENA No. 6891 and the Esprit working…
20229 runs0 likes13 downloads13 reach18 impact
5500 instances - 41 features - 11 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
602 runs1 likes12 downloads13 reach15 impact
13750 instances - 41 features - 2 classes - 0 missing values
Generator generating 3 classes of waves. Each class is generated from a combination of 2 of 3 "base" waves. For details, see Breiman,L., Friedman,J.H., Olshen,R.A., and Stone,C.J. (1984).…
19675 runs1 likes53 downloads54 reach12 impact
5000 instances - 41 features - 3 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
778 runs0 likes9 downloads9 reach15 impact
5000 instances - 41 features - 2 classes - 0 missing values
Datasets from ACM KDD Cup (http://www.sigkdd.org/kddcup/index.php) Data set for KDD Cup 1999 Modified by TunedIT (converted to ARFF format)…
4 runs1 likes21 downloads22 reach15 impact
4898431 instances - 42 features - 23 classes - 0 missing values
QSAR biodegradation Data Set * Abstract: Data set containing values for 41 attributes (molecular descriptors) used to classify 1055 chemicals into 2 classes (ready and not ready biodegradable). *…
267507 runs1 likes23 downloads24 reach28 impact
1055 instances - 42 features - 2 classes - 0 missing values
This is a 10% stratified subsample of the data from the 1999 ACM KDD Cup (http://www.sigkdd.org/kddcup/index.php). Modified by TunedIT (converted to ARFF format)…
25 runs1 likes35 downloads36 reach15 impact
494020 instances - 42 features - 23 classes - 0 missing values
Source: Original Owner: U.S. Census Bureau http://www.census.gov/ United States Department of Commerce Donor: Terran Lane and Ronny Kohavi Data Mining and Visualization Silicon Graphics. terran '@'…
0 runs1 likes9 downloads10 reach15 impact
299285 instances - 42 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
20000 instances - 42 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
6435 instances - 42 features - classes - 0 missing values
INTRUSION DETECTOR LEARNING Software to detect network intrusions protects a computer network from unauthorized users, including perhaps insiders. The intrusion detector learning task is to build a…
0 runs1 likes0 downloads1 reach1 impact
4898431 instances - 42 features - 23 classes - 0 missing values
The dataset contains all the statistics for each player from 2008 to 2016.
0 runs0 likes1 downloads1 reach8 impact
183978 instances - 42 features - classes - 47301 missing values
This is a part of collection of 8 files containing the match statistics for both women and men at the four major tennis tournaments of the year 2013. Each file has 42 columns and a minimum of 76 rows.…
0 runs0 likes0 downloads0 reach0 impact
125 instances - 42 features - classes - 362 missing values
This is a part of collection of 8 files containing the match statistics for both women and men at the four major tennis tournaments of the year 2013. Each file has 42 columns and a minimum of 76 rows.…
0 runs0 likes0 downloads0 reach0 impact
114 instances - 42 features - classes - 562 missing values
This is a part of collection of 8 files containing the match statistics for both women and men at the four major tennis tournaments of the year 2013. Each file has 42 columns and a minimum of 76 rows.…
0 runs0 likes0 downloads0 reach0 impact
122 instances - 42 features - classes - 906 missing values
This is a part of collection of 8 files containing the match statistics for both women and men at the four major tennis tournaments of the year 2013. Each file has 42 columns and a minimum of 76 rows.…
0 runs0 likes0 downloads0 reach0 impact
76 instances - 42 features - classes - 574 missing values
This is a part of collection of 8 files containing the match statistics for both women and men at the four major tennis tournaments of the year 2013. Each file has 42 columns and a minimum of 76 rows.…
0 runs0 likes0 downloads0 reach0 impact
127 instances - 42 features - classes - 722 missing values
This is a part of collection of 8 files containing the match statistics for both women and men at the four major tennis tournaments of the year 2013. Each file has 42 columns and a minimum of 76 rows.…
0 runs0 likes0 downloads0 reach0 impact
126 instances - 42 features - classes - 978 missing values
This is a part of collection of 8 files containing the match statistics for both women and men at the four major tennis tournaments of the year 2013. Each file has 42 columns and a minimum of 76 rows.…
0 runs0 likes0 downloads0 reach0 impact
126 instances - 42 features - classes - 446 missing values
This is a part of collection of 8 files containing the match statistics for both women and men at the four major tennis tournaments of the year 2013. Each file has 42 columns and a minimum of 76 rows.…
0 runs0 likes0 downloads0 reach0 impact
127 instances - 42 features - classes - 788 missing values
This version has feature names based on https://www2.1010data.com/documentationcenter/beta/Tutorials/MachineLearningExamples/CensusIncomeDataSet.html Missing data is also properly encoded in this…
0 runs0 likes1 downloads1 reach0 impact
199523 instances - 42 features - 2 classes - 415717 missing values
This database contains all legal 8-ply positions in the game of connect-4 in which neither player has won yet, and in which the next move is not forced. Attributes represent board positions on a 6x6…
9607 runs0 likes10 downloads10 reach26 impact
67557 instances - 43 features - 3 classes - 0 missing values
This dataset contains traffic violation information from all electronic traffic violations issued in the County. Any information that can be used to uniquely identify the vehicle, the vehicle owner or…
0 runs0 likes1 downloads1 reach8 impact
1578154 instances - 43 features - 4 classes - 8006541 missing values
No data.
43 runs0 likes2 downloads2 reach9 impact
1000000 instances - 45 features - 2 classes - 0 missing values
No data.
47 runs0 likes1 downloads1 reach9 impact
1000000 instances - 45 features - 2 classes - 0 missing values
This is a corrected version of the previous data file in version 1, which contained a dataset (349 instances) incorrectly merged from the original training and test sets available on UCI (there are…
0 runs0 likes3 downloads3 reach12 impact
267 instances - 45 features - 2 classes - 0 missing values
Modified version for the automl benchmark. Regroups information for about 7800 different US colleges. Including geographical information, stats about the population attending and post graduation…
0 runs0 likes0 downloads0 reach0 impact
7063 instances - 45 features - 0 classes - 104249 missing values
)), [PMLB](https://github.com/EpistasisLab/penn-ml-benchmarks/tree/master/datasets/classification/tokyo1) This is Performance co-pilot (PCP) data for the Tokyo server at Silicon Graphics International…
37 runs0 likes1 downloads1 reach22 impact
959 instances - 45 features - 2 classes - 0 missing values
The BoT-IoT dataset was created by designing a realistic network environment in the Cyber Range Lab of The center of UNSW Canberra Cyber. The environment incorporates a combination of normal and…
0 runs0 likes0 downloads0 reach9 impact
3668522 instances - 45 features - 0 classes - 0 missing values
# Achieved Frames per Second (FPS) in video games This dataset contains FPS measurement of video games executed on computers. Each row of the dataset describes the outcome of FPS measurement (outcome…
0 runs0 likes0 downloads0 reach1 impact
425833 instances - 45 features - 0 classes - 1299988 missing values
SPECTF heart data This is a merged version of the separate train and test set which are usually distributed. On OpenML this train-test split can be found as one of the possible tasks. NOTE: See the…
1103 runs0 likes12 downloads12 reach15 impact
349 instances - 45 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
106 runs0 likes5 downloads5 reach14 impact
76 instances - 45 features - 2 classes - 22 missing values
These data are estimated correlations between daily 3 p.m. wind measurements during September and October 1997 for a network of 45 stations in the Sydney region. The first column below gives a list of…
0 runs0 likes0 downloads0 reach11 impact
45 instances - 47 features - classes - 0 missing values
Version with corrected feature types. 'PrivacySuppressed' are converted to None. Regroups information for about 7800 different US colleges. Including geographical information, stats about the…
0 runs0 likes0 downloads0 reach0 impact
7063 instances - 47 features - 0 classes - 104305 missing values
### Description ### This dataset is part of a collection datasets based on the game "Jungle Chess" (a.k.a. Dou Shou Qi). For a description of the rules, please refer to the paper (link attached). The…
12 runs0 likes0 downloads0 reach14 impact
4704 instances - 47 features - 3 classes - 0 missing values
### Description ### This dataset is part of a collection datasets based on the game "Jungle Chess" (a.k.a. Dou Shou Qi). For a description of the rules, please refer to the paper (link attached). The…
11 runs0 likes0 downloads0 reach14 impact
4704 instances - 47 features - 3 classes - 0 missing values
### Description ### This dataset is part of a collection datasets based on the game "Jungle Chess" (a.k.a. Dou Shou Qi). For a description of the rules, please refer to the paper (link attached). The…
12 runs0 likes0 downloads0 reach14 impact
2351 instances - 47 features - 2 classes - 0 missing values
### Description ### This dataset is part of a collection datasets based on the game "Jungle Chess" (a.k.a. Dou Shou Qi). For a description of the rules, please refer to the paper (link attached). The…
15 runs0 likes0 downloads0 reach14 impact
4704 instances - 47 features - 3 classes - 0 missing values
### Description ### This dataset is part of a collection datasets based on the game "Jungle Chess" (a.k.a. Dou Shou Qi). For a description of the rules, please refer to the paper (link attached). The…
11 runs0 likes1 downloads1 reach13 impact
44819 instances - 47 features - 3 classes - 10584 missing values
### Description ### This dataset is part of a collection datasets based on the game "Jungle Chess" (a.k.a. Dou Shou Qi). For a description of the rules, please refer to the paper (link attached). The…
11 runs0 likes0 downloads0 reach14 impact
5880 instances - 47 features - 3 classes - 3528 missing values
### Description ### This dataset is part of a collection datasets based on the game "Jungle Chess" (a.k.a. Dou Shou Qi). For a description of the rules, please refer to the paper (link attached). The…
11 runs0 likes0 downloads0 reach14 impact
5880 instances - 47 features - 3 classes - 3528 missing values
### Description ### This dataset is part of a collection datasets based on the game "Jungle Chess" (a.k.a. Dou Shou Qi). For a description of the rules, please refer to the paper (link attached). The…
11 runs0 likes0 downloads0 reach14 impact
4704 instances - 47 features - 3 classes - 0 missing values
### Description ### This dataset is part of a collection datasets based on the game "Jungle Chess" (a.k.a. Dou Shou Qi). For a description of the rules, please refer to the paper (link attached). The…
10 runs0 likes0 downloads0 reach14 impact
3660 instances - 47 features - 2 classes - 0 missing values
### Description ### This dataset is part of a collection datasets based on the game "Jungle Chess" (a.k.a. Dou Shou Qi). For a description of the rules, please refer to the paper (link attached). The…
10 runs0 likes0 downloads0 reach14 impact
5880 instances - 47 features - 3 classes - 3528 missing values
### Description ### This dataset is part of a collection datasets based on the game "Jungle Chess" (a.k.a. Dou Shou Qi). For a description of the rules, please refer to the paper (link attached). The…
10 runs0 likes0 downloads0 reach14 impact
2352 instances - 47 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
729 runs0 likes9 downloads9 reach14 impact
45 instances - 47 features - 2 classes - 0 missing values
No data.
52 runs0 likes3 downloads3 reach12 impact
1000000 instances - 48 features - 10 classes - 0 missing values
No data.
51 runs1 likes4 downloads5 reach12 impact
1000000 instances - 48 features - 10 classes - 0 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. Corresponding patterns in different datasets correspond to the same…
34558 runs0 likes23 downloads23 reach12 impact
2000 instances - 48 features - 10 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
792 runs0 likes8 downloads8 reach15 impact
2000 instances - 48 features - 2 classes - 0 missing values
This is a commercial application described in Weiss & Indurkhya (1995). The data describes a telecommunication problem. No further information is available. Characteristics: (10000+5000) cases, 49…
2 runs0 likes4 downloads4 reach11 impact
15000 instances - 49 features - 0 classes - 0 missing values
__Major change w.r.t. version 1: updated data type of binary variables to factor type.__ Dataset from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch), which…
0 runs0 likes1 downloads1 reach10 impact
4562 instances - 49 features - classes - 0 missing values
ef f
0 runs0 likes0 downloads0 reach9 impact
4 instances - 49 features - classes - 0 missing values
rrvrf 4rr
0 runs0 likes0 downloads0 reach9 impact
4 instances - 49 features - classes - 0 missing values
efe def
0 runs0 likes0 downloads0 reach9 impact
4 instances - 49 features - classes - 0 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
1 runs0 likes1 downloads1 reach17 impact
4147 instances - 49 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
624 runs0 likes10 downloads10 reach15 impact
15000 instances - 49 features - 2 classes - 0 missing values
sd vfv
0 runs0 likes0 downloads0 reach7 impact
4 instances - 50 features - 2 classes - 0 missing values
r rg
0 runs0 likes0 downloads0 reach8 impact
4 instances - 50 features - classes - 0 missing values
dd ref
0 runs0 likes0 downloads0 reach7 impact
4 instances - 50 features - classes - 0 missing values
Regroups information for about 7800 different US colleges. Including geographical information, stats about the population attending and post graduation career earnings.
0 runs0 likes1 downloads1 reach9 impact
7063 instances - 50 features - 0 classes - 125494 missing values
This data has been prepared to analyze factors related to readmission as well as other outcomes pertaining to patients with diabetes. The data are submitted on behalf of the Center for Clinical and…
0 runs2 likes16 downloads18 reach16 impact
101766 instances - 50 features - 3 classes - 0 missing values
Oil dataset Past Usage: 1. Kubat, M., Holte, R.,
204 runs3 likes19 downloads22 reach25 impact
937 instances - 50 features - 2 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach13 impact
14 instances - 51 features - 0 classes - 0 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes0 downloads0 reach13 impact
500 instances - 51 features - 0 classes - 0 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes0 downloads0 reach13 impact
250 instances - 51 features - 0 classes - 0 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes0 downloads0 reach13 impact
500 instances - 51 features - 0 classes - 0 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes1 downloads1 reach13 impact
1000 instances - 51 features - 0 classes - 0 missing values
This dataset is taken from the MiniBooNE experiment and is used to distinguish electron neutrinos (signal) from muon neutrinos (background). This dataset is ordered. It first contains all signal…
12 runs0 likes4 downloads4 reach13 impact
130064 instances - 51 features - 2 classes - 0 missing values