OpenML
Filter results by:
No data.
87 runs0 likes5 downloads5 reach2 impact
295245 instances - 11 features - 5 classes - 0 missing values
No data.
324 runs0 likes5 downloads5 reach2 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
48 runs1 likes4 downloads5 reach2 impact
1000000 instances - 77 features - 10 classes - 0 missing values
Donor: David W. Aha (aha@ics.uci.edu) This database contains 76 attributes, but all published experiments refer to using a subset of 14 of them. In particular, the Cleveland database is the only one…
37 runs0 likes5 downloads5 reach1 impact
303 instances - 14 features - 0 classes - 6 missing values
This data set consists of three types of entities: (a) the specification of an auto in terms of various characteristics; (b) its assigned insurance risk rating,; (c) its normalized losses in use as…
7 runs1 likes4 downloads5 reach1 impact
159 instances - 16 features - 0 classes - 0 missing values
This is a family of datasets synthetically generated from a realistic simulation of the dynamics of a Unimation Puma 560 robot arm. There are eight datastets in this family . In this repository we…
2 runs0 likes5 downloads5 reach1 impact
8192 instances - 9 features - 0 classes - 0 missing values
This is an artificial data set described in Breiman et al. (1984,p.238) (with variance 1 instead of 2). Generate the values of the 10 attributes independently using the following probabilities: P(X_1…
2 runs1 likes4 downloads5 reach2 impact
40768 instances - 11 features - 0 classes - 0 missing values
No data.
326 runs1 likes4 downloads5 reach2 impact
1000000 instances - 23 features - 2 classes - 0 missing values
No data.
73 runs0 likes5 downloads5 reach1 impact
1000000 instances - 30 features - 2 classes - 0 missing values
No data.
330 runs0 likes5 downloads5 reach2 impact
1000000 instances - 4 features - 2 classes - 0 missing values
No data.
290 runs0 likes5 downloads5 reach2 impact
1000000 instances - 77 features - 10 classes - 0 missing values
No data.
163 runs0 likes5 downloads5 reach1 impact
1000000 instances - 28 features - 2 classes - 0 missing values
Automated file upload of BNG(ionosphere)
99 runs1 likes4 downloads5 reach3 impact
1000000 instances - 35 features - 2 classes - 0 missing values
General Description of Thyroid Disease Databases and Related Files This directory contains 6 databases, corresponding test set, and corresponding documentation. They were left at the University of…
92 runs0 likes5 downloads5 reach6 impact
2800 instances - 27 features - 5 classes - 0 missing values
This database was derived from a simple hierarchical decision model originally developed for the demonstration of DEX (M. Bohanec, V. Rajkovic: Expert system for decision making. Sistemica 1(1), pp.…
5819 runs0 likes5 downloads5 reach12 impact
1728 instances - 7 features - 4 classes - 0 missing values
No data.
51 runs1 likes4 downloads5 reach2 impact
1000000 instances - 48 features - 10 classes - 0 missing values
No data.
71 runs0 likes5 downloads5 reach2 impact
1000000 instances - 17 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
117 runs0 likes5 downloads5 reach6 impact
39 instances - 4 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
103 runs0 likes5 downloads5 reach6 impact
107 instances - 13 features - 2 classes - 71 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
106 runs0 likes5 downloads5 reach6 impact
76 instances - 46 features - 2 classes - 22 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
721 runs0 likes5 downloads5 reach7 impact
226 instances - 70 features - 2 classes - 317 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
732 runs0 likes5 downloads5 reach6 impact
63 instances - 32 features - 2 classes - 0 missing values
No data.
219 runs0 likes5 downloads5 reach10 impact
414 instances - 6430 features - 9 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
114 runs0 likes5 downloads5 reach6 impact
70 instances - 5 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
112 runs0 likes5 downloads5 reach6 impact
42 instances - 17 features - 2 classes - 0 missing values
No data.
203 runs0 likes5 downloads5 reach10 impact
878 instances - 7455 features - 10 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
50 runs0 likes5 downloads5 reach6 impact
95 instances - 10 features - 5 classes - 9 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
110 runs0 likes5 downloads5 reach6 impact
42 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
744 runs0 likes5 downloads5 reach6 impact
130 instances - 10 features - 2 classes - 97 missing values
No data.
283 runs0 likes5 downloads5 reach13 impact
96 instances - 4027 features - 11 classes - 19667 missing values
No data.
296 runs0 likes5 downloads5 reach13 impact
96 instances - 4027 features - 9 classes - 19667 missing values
Dataset from the MLRR repository: http://axon.cs.byu.edu:5000/
731 runs0 likes5 downloads5 reach13 impact
151 instances - 7 features - 3 classes - 0 missing values
Dataset from the MLRR repository: http://axon.cs.byu.edu:5000/
180 runs0 likes5 downloads5 reach14 impact
294 instances - 12 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
721 runs0 likes5 downloads5 reach7 impact
412 instances - 9 features - 2 classes - 96 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
722 runs0 likes5 downloads5 reach7 impact
285 instances - 8 features - 2 classes - 27 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
698 runs0 likes5 downloads5 reach6 impact
36 instances - 23 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
717 runs0 likes5 downloads5 reach6 impact
90 instances - 9 features - 2 classes - 3 missing values
No data.
718 runs0 likes5 downloads5 reach6 impact
63 instances - 30 features - 2 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
581 runs0 likes5 downloads5 reach6 impact
400 instances - 6 features - 4 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
117 runs0 likes5 downloads5 reach6 impact
50 instances - 7 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
707 runs0 likes5 downloads5 reach6 impact
52 instances - 25 features - 2 classes - 7 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
727 runs0 likes5 downloads5 reach7 impact
205 instances - 26 features - 2 classes - 59 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
708 runs0 likes5 downloads5 reach7 impact
365 instances - 4 features - 2 classes - 30 missing values
No data.
697 runs0 likes5 downloads5 reach6 impact
89 instances - 9 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
736 runs0 likes5 downloads5 reach6 impact
92 instances - 6 features - 2 classes - 26 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
700 runs0 likes5 downloads5 reach6 impact
67 instances - 16 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
687 runs0 likes5 downloads5 reach6 impact
52 instances - 24 features - 2 classes - 39 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
717 runs0 likes5 downloads5 reach7 impact
303 instances - 14 features - 2 classes - 7 missing values
Dataset from `Pattern Recognition and Neural Networks' by B.D. Ripley. Cambridge University Press (1996) ISBN 0-521-46086-7 The background to the datasets is described in section 1.4; this file…
587 runs0 likes5 downloads5 reach6 impact
61 instances - 19 features - 4 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
706 runs0 likes5 downloads5 reach6 impact
62 instances - 6 features - 2 classes - 0 missing values
No data.
940 runs0 likes5 downloads5 reach2 impact
74 instances - 63 features - 4 classes - 0 missing values
No data.
65 runs0 likes5 downloads5 reach1 impact
1000000 instances - 30 features - 4 classes - 0 missing values
* Twonorm dataset This is an implementation of Leo Breiman's twonorm example[1]. It is a 20 dimensional, 2 class classification example. Each class is drawn from a multivariate normal distribution…
118 runs0 likes5 downloads5 reach6 impact
7400 instances - 21 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
109 runs0 likes5 downloads5 reach6 impact
52 instances - 4 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
721 runs0 likes5 downloads5 reach6 impact
34 instances - 9 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
686 runs0 likes5 downloads5 reach7 impact
782 instances - 9 features - 2 classes - 466 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
683 runs0 likes5 downloads5 reach6 impact
60 instances - 11 features - 2 classes - 14 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
735 runs0 likes5 downloads5 reach6 impact
47 instances - 8 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
453 runs0 likes5 downloads5 reach6 impact
108 instances - 5 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
738 runs0 likes5 downloads5 reach6 impact
51 instances - 7 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
101 runs0 likes5 downloads5 reach7 impact
1161 instances - 17 features - 2 classes - 256 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
729 runs0 likes5 downloads5 reach6 impact
93 instances - 7 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
118 runs0 likes5 downloads5 reach6 impact
50 instances - 8 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
737 runs0 likes5 downloads5 reach7 impact
303 instances - 14 features - 2 classes - 6 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
764 runs0 likes5 downloads5 reach6 impact
55 instances - 3 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
721 runs0 likes5 downloads5 reach6 impact
60 instances - 8 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
730 runs0 likes5 downloads5 reach6 impact
93 instances - 23 features - 2 classes - 14 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
511 runs0 likes5 downloads5 reach7 impact
185 instances - 4 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
723 runs0 likes5 downloads5 reach7 impact
418 instances - 19 features - 2 classes - 1239 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
750 runs0 likes5 downloads5 reach6 impact
48 instances - 5 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
726 runs0 likes5 downloads5 reach6 impact
52 instances - 10 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
680 runs0 likes5 downloads5 reach7 impact
1945 instances - 19 features - 2 classes - 1133 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
705 runs0 likes5 downloads5 reach7 impact
398 instances - 8 features - 2 classes - 6 missing values
Abstract: A chess endgame data set representing the positions on the board of the white king, the white rook, and the black king. The task is to determine the optimum number of turn required for white…
25 runs0 likes5 downloads5 reach6 impact
28056 instances - 7 features - 18 classes - 0 missing values
pie chart 1
102 runs0 likes5 downloads5 reach5 impact
705 instances - 38 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
59 runs0 likes6 downloads6 reach8 impact
1545 instances - 10937 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
2866 runs0 likes6 downloads6 reach16 impact
546 instances - 10937 features - 2 classes - 0 missing values
pie chart 3
103 runs0 likes6 downloads6 reach5 impact
1077 instances - 38 features - 2 classes - 0 missing values
Pizza cutter 3
188 runs0 likes6 downloads6 reach6 impact
1043 instances - 38 features - 2 classes - 0 missing values
Costa madre 1
90 runs0 likes6 downloads6 reach7 impact
296 instances - 38 features - 2 classes - 0 missing values
* Dataset Title: Robot Execution Failures Data Set * Abstract: This dataset contains force and torque measurements on a robot after failure detection. Each failure is characterized by 15 force/torque…
130 runs0 likes6 downloads6 reach5 impact
164 instances - 91 features - 5 classes - 0 missing values
* Title of Database: Spoken Arabic Digit * Abstract: This dataset contains time series of mel-frequency cepstrum coefficients (MFCCs) corresponding to spoken Arabic digits. Includes data from 44 males…
1 runs0 likes6 downloads6 reach6 impact
263256 instances - 15 features - 10 classes - 0 missing values
* Title: Thoracic Surgery Data Data Set * Abstract: The data is dedicated to classification problem related to the post-operative life expectancy in the lung cancer patients: class 1 - death within…
145 runs0 likes6 downloads6 reach6 impact
470 instances - 17 features - 2 classes - 0 missing values
A family of datasets synthetically generated from a simulation of how bank-customers choose their banks. Tasks are based on predicting the fraction of bank customers who leave the bank because of full…
0 runs0 likes6 downloads6 reach5 impact
8192 instances - 9 features - 0 classes - 0 missing values
The Computer Activity databases are a collection of computer systems activity measures. The data was collected from a Sun Sparcstation 20/712 with 128 Mbytes of memory running in a multi-user…
0 runs0 likes6 downloads6 reach5 impact
8192 instances - 22 features - 0 classes - 0 missing values
Datasets of Data And Story Library, project illustrating use of basic statistic methods, converted to arff format by Hakan Kjellerstrand. Source: TunedIT: http://tunedit.org/repo/DASL DASL file…
593 runs0 likes6 downloads6 reach6 impact
478 instances - 11 features - 3 classes - 0 missing values
No data.
29 runs0 likes6 downloads6 reach2 impact
1000000 instances - 26 features - 7 classes - 0 missing values
This dataset is taken from the Places Rated Almanac, by Richard Boyer and David Savageau, copyrighted and published by Rand McNally. This book order (SBN) number is 0-528-88008-X, and it retails for…
2 runs0 likes6 downloads6 reach5 impact
329 instances - 10 features - 0 classes - 0 missing values
S&P Letters Data We collected information on the variables using all the block groups in California from the 1990 Census. In this sample a block group on average includes 1425.5 individuals living in…
0 runs0 likes6 downloads6 reach5 impact
20640 instances - 9 features - 0 classes - 0 missing values
* Dataset: DBworld e-mails data set Task: dbworld-bodies * Source: Michele Filannino, PhD University of Manchester Centre for Doctoral Training Email: filannim_AT_cs.man.ac.uk * Data Set Information:…
3 runs0 likes6 downloads6 reach5 impact
64 instances - 4703 features - 2 classes - 0 missing values
* Dataset: This is a reprocessed version of heart-h (hungarian), the heart disease reprocessed hungarian dataset from UCI.
138 runs0 likes6 downloads6 reach5 impact
294 instances - 14 features - 5 classes - 0 missing values
* Title: Nursery Database * Abstract: 4-class version of the original Nursery dataset
121 runs0 likes6 downloads6 reach6 impact
12958 instances - 9 features - 4 classes - 0 missing values
DOROTHEA is a drug discovery dataset. Chemical compounds represented by structural molecular features must be classified as active (binding to thrombin) or inactive. This is one of 5 datasets of the…
0 runs0 likes6 downloads6 reach11 impact
1150 instances - 100001 features - 2 classes - 0 missing values
Data Set Information: The data has been produced using Monte Carlo simulations. The first 21 features (columns 2-22) are kinematic properties measured by the particle detectors in the accelerator. The…
0 runs1 likes5 downloads6 reach5 impact
98050 instances - 29 features - 0 classes - 9 missing values
Source: Original Owner: U.S. Census Bureau http://www.census.gov/ United States Department of Commerce Donor: Terran Lane and Ronny Kohavi Data Mining and Visualization Silicon Graphics. terran '@'…
0 runs1 likes5 downloads6 reach5 impact
299285 instances - 42 features - classes - 0 missing values
This is a dataset obtained from the StatLib repository. Here is the included description: The data provided are daily stock prices from January 1988 through October 1991, for ten aerospace companies.…
5 runs1 likes5 downloads6 reach1 impact
950 instances - 10 features - 0 classes - 0 missing values
This data set is also obtained from the task of controlling a F16 aircraft, although the target variable and attributes are different from the ailerons domain. In this case the goal variable is…
2 runs0 likes6 downloads6 reach1 impact
16599 instances - 19 features - 0 classes - 0 missing values
No data.
304 runs0 likes6 downloads6 reach2 impact
1000000 instances - 25 features - 10 classes - 0 missing values
Internet Usage Data Data Type multivariate Abstract This data contains general demographic information on internet users in 1997. Sources Original Owner [1]Graphics, Visualization, & Usability Center…
0 runs1 likes5 downloads6 reach3 impact
10108 instances - 72 features - 46 classes - 2699 missing values
This is an artificial data set with dependencies between the attribute values. The cases are generated using the following method: X1 : uniformly distributed over [-5,5] X2 : uniformly distributed…
3 runs1 likes5 downloads6 reach5 impact
40768 instances - 11 features - 0 classes - 0 missing values