OpenML
Filter results by:
No data.
296 runs0 likes5 downloads5 reach13 impact
96 instances - 4027 features - 9 classes - 19667 missing values
No data.
283 runs0 likes5 downloads5 reach13 impact
96 instances - 4027 features - 11 classes - 19667 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
581 runs0 likes5 downloads5 reach6 impact
400 instances - 6 features - 4 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
117 runs0 likes5 downloads5 reach6 impact
50 instances - 7 features - 2 classes - 0 missing values
Dataset from `Pattern Recognition and Neural Networks' by B.D. Ripley. Cambridge University Press (1996) ISBN 0-521-46086-7 The background to the datasets is described in section 1.4; this file…
587 runs0 likes5 downloads5 reach6 impact
61 instances - 19 features - 4 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
705 runs0 likes5 downloads5 reach7 impact
398 instances - 8 features - 2 classes - 6 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
721 runs0 likes5 downloads5 reach6 impact
34 instances - 9 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
686 runs0 likes5 downloads5 reach7 impact
782 instances - 9 features - 2 classes - 466 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
730 runs0 likes5 downloads5 reach6 impact
93 instances - 23 features - 2 classes - 14 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
721 runs0 likes5 downloads5 reach6 impact
60 instances - 8 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
683 runs0 likes5 downloads5 reach6 impact
60 instances - 11 features - 2 classes - 14 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
764 runs0 likes5 downloads5 reach6 impact
55 instances - 3 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
737 runs0 likes5 downloads5 reach7 impact
303 instances - 14 features - 2 classes - 6 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
726 runs0 likes5 downloads5 reach6 impact
52 instances - 10 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
750 runs0 likes5 downloads5 reach6 impact
48 instances - 5 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
706 runs0 likes5 downloads5 reach6 impact
62 instances - 6 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
109 runs0 likes5 downloads5 reach6 impact
52 instances - 4 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
453 runs0 likes5 downloads5 reach6 impact
108 instances - 5 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
735 runs0 likes5 downloads5 reach6 impact
47 instances - 8 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
738 runs0 likes5 downloads5 reach6 impact
51 instances - 7 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
110 runs0 likes5 downloads5 reach6 impact
42 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
106 runs0 likes5 downloads5 reach6 impact
76 instances - 46 features - 2 classes - 22 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
103 runs0 likes5 downloads5 reach6 impact
107 instances - 13 features - 2 classes - 71 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
732 runs0 likes5 downloads5 reach6 impact
63 instances - 32 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
721 runs0 likes5 downloads5 reach7 impact
226 instances - 70 features - 2 classes - 317 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
736 runs1 likes5 downloads6 reach7 impact
452 instances - 280 features - 2 classes - 408 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
722 runs0 likes5 downloads5 reach7 impact
285 instances - 8 features - 2 classes - 27 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
698 runs0 likes5 downloads5 reach6 impact
36 instances - 23 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
708 runs0 likes5 downloads5 reach7 impact
365 instances - 4 features - 2 classes - 30 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
707 runs0 likes5 downloads5 reach6 impact
52 instances - 25 features - 2 classes - 7 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
727 runs0 likes5 downloads5 reach7 impact
205 instances - 26 features - 2 classes - 59 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
717 runs0 likes5 downloads5 reach7 impact
303 instances - 14 features - 2 classes - 7 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
687 runs0 likes5 downloads5 reach6 impact
52 instances - 24 features - 2 classes - 39 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
700 runs0 likes5 downloads5 reach6 impact
67 instances - 16 features - 2 classes - 0 missing values
No data.
311 runs0 likes5 downloads5 reach2 impact
1000000 instances - 10 features - 2 classes - 0 missing values
No data.
307 runs0 likes5 downloads5 reach2 impact
1000000 instances - 4 features - 2 classes - 0 missing values
Normalized form of codrna (351) Andrew V Uzilov, Joshua M Keegan, and David H Mathews. Detection of non-coding RNAs on the basis of predicted secondary structure formation free energy change. BMC…
309 runs0 likes5 downloads5 reach1 impact
488565 instances - 9 features - 2 classes - 0 missing values
libSVM","AAD group #Dataset from the LIBSVM data repository. Preprocessing: scaled to [-1,1]
0 runs0 likes5 downloads5 reach5 impact
270 instances - 14 features - 0 classes - 0 missing values
wind daily average wind speeds for 1961-1978 at 12 synoptic meteorological stations in the Republic of Ireland (Haslett and raftery 1989). These data were analyzed in detail in the following article:…
0 runs0 likes5 downloads5 reach5 impact
6574 instances - 15 features - 0 classes - 0 missing values
The Boston house-price data of Harrison, D. and Rubinfeld, D.L. 'Hedonic prices and the demand for clean air', J. Environ. Economics & Management, vol.5, 81-102, 1978. Used in Belsley, Kuh & Welsch,…
6 runs0 likes5 downloads5 reach9 impact
506 instances - 14 features - 0 classes - 0 missing values
Data from the RSCTC 2010 Discovery Challenge. All datasets contain between 100 and 400 samples, characterized by values of 20,000 - 65,000 attributes. Samples are assigned to several (2-10) classes.…
48 runs0 likes5 downloads5 reach7 impact
159 instances - 61360 features - 2 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
50 runs0 likes5 downloads5 reach6 impact
95 instances - 10 features - 5 classes - 9 missing values
Dataset from the MLRR repository: http://axon.cs.byu.edu:5000/
731 runs0 likes5 downloads5 reach13 impact
151 instances - 7 features - 3 classes - 0 missing values
No data.
697 runs0 likes5 downloads5 reach6 impact
89 instances - 9 features - 2 classes - 0 missing values
No data.
718 runs0 likes5 downloads5 reach6 impact
63 instances - 30 features - 2 classes - 0 missing values
Dataset from the MLRR repository: http://axon.cs.byu.edu:5000/
180 runs0 likes5 downloads5 reach14 impact
294 instances - 12 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
80 runs0 likes5 downloads5 reach7 impact
405 instances - 10937 features - 2 classes - 0 missing values
No data.
27 runs0 likes5 downloads5 reach2 impact
1000000 instances - 26 features - 7 classes - 0 missing values
pie chart 1
102 runs0 likes5 downloads5 reach5 impact
705 instances - 38 features - 2 classes - 0 missing values
pie chart 2
101 runs0 likes5 downloads5 reach5 impact
745 instances - 37 features - 2 classes - 0 missing values
* Source: JP Marques de Sá, INEB-Instituto de Engenharia Biomédica, Porto, Portugal; e-mail: jpmdesa '@' gmail.com J Jossinet, inserm, Lyon, France * Data Set Information: Impedance measurements…
280 runs0 likes5 downloads5 reach5 impact
106 instances - 10 features - 6 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
2862 runs0 likes5 downloads5 reach16 impact
1545 instances - 10937 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
2849 runs0 likes5 downloads5 reach16 impact
1545 instances - 10937 features - 2 classes - 0 missing values
Abstract: A chess endgame data set representing the positions on the board of the white king, the white rook, and the black king. The task is to determine the optimum number of turn required for white…
25 runs0 likes5 downloads5 reach6 impact
28056 instances - 7 features - 18 classes - 0 missing values
Dataset title laLSVT Voice Rehabilitation Data Set Source: The dataset was created by Athanasios Tsanas (tsanasthanasis '@' gmail.com) of the University of Oxford. Abstract: 126 samples from 14…
162 runs0 likes5 downloads5 reach5 impact
126 instances - 311 features - 2 classes - 0 missing values
* Twonorm dataset This is an implementation of Leo Breiman's twonorm example[1]. It is a 20 dimensional, 2 class classification example. Each class is drawn from a multivariate normal distribution…
118 runs0 likes5 downloads5 reach6 impact
7400 instances - 21 features - 2 classes - 0 missing values
* Title: seeds Data Set * Abstract: Measurements of geometrical properties of kernels belonging to three different varieties of wheat. A soft X-ray technique and GRAINS package were used to construct…
190 runs0 likes5 downloads5 reach5 impact
210 instances - 8 features - 3 classes - 0 missing values
* Dataset Title: Vertebra Column - 3 classes * Abstract: Data set containing values for six biomechanical features used to classify orthopaedic patients into 3 classes (normal, disk hernia or…
154 runs0 likes5 downloads5 reach5 impact
310 instances - 7 features - 3 classes - 0 missing values
* Dataset Title: Vertebra Column - 2 classes * Abstract: Data set containing values for six biomechanical features used to classify orthopaedic patients into 3 classes (normal, disk hernia or…
124 runs0 likes5 downloads5 reach6 impact
310 instances - 7 features - 2 classes - 0 missing values
* Dataset Title: Volcanoes on Venus - JARtool experiment Data Set Experiment: A4 * Source: Michael C. Burl MS 126-347, JPL 4800 Oak Grove Drive Pasadena, CA 91109 (818) 393-5345 Michael.C.Burl '@'…
136 runs0 likes5 downloads5 reach6 impact
1515 instances - 4 features - 5 classes - 0 missing values
####1. Summary This dataset contain attributes of dresses and their recommendations according to their sales. Sales are monitor on the basis of alternate days. The attributes present analyzed are:…
16895 runs0 likes5 downloads5 reach10 impact
500 instances - 13 features - 2 classes - 835 missing values
simple engine data
52 runs0 likes5 downloads5 reach4 impact
383 instances - 6 features - 3 classes - 0 missing values
Dataset KDD98 challenge: https://kdd.ics.uci.edu/databases/kddcup98/kddcup98.html The goal is to estimate the return from a direct mailing in order to maximize donation profits. This dataset…
0 runs0 likes5 downloads5 reach4 impact
191260 instances - 479 features - 0 classes - 5587563 missing values
Data Set Information: The data has been produced using Monte Carlo simulations. The first 21 features (columns 2-22) are kinematic properties measured by the particle detectors in the accelerator. The…
0 runs1 likes5 downloads6 reach5 impact
98050 instances - 29 features - 0 classes - 9 missing values
Source: Original Owner: U.S. Census Bureau http://www.census.gov/ United States Department of Commerce Donor: Terran Lane and Ronny Kohavi Data Mining and Visualization Silicon Graphics. terran '@'…
0 runs1 likes5 downloads6 reach5 impact
299285 instances - 42 features - classes - 0 missing values
Om algos te testen
74 runs0 likes5 downloads5 reach7 impact
14240 instances - 31 features - 2 classes - 0 missing values
DEXTER is a text classification problem in a bag-of-word representation. This is a two-class classification problem with sparse continuous input variables. This dataset is one of five datasets of the…
0 runs0 likes5 downloads5 reach10 impact
600 instances - 20001 features - 2 classes - 0 missing values
This simple domain contains 7 Boolean attributes and 10 classes, the set of decimal digits. Recall that LED displays contain 7 light-emitting diodes -- hence the reason for 7 attributes. The class…
12691 runs0 likes5 downloads5 reach9 impact
500 instances - 8 features - 10 classes - 0 missing values
Citation Request: This dataset is public available for research. The details are described in [Cortez et al., 2009]. Please include this citation if you plan to use this database: P. Cortez, A.…
64 runs1 likes5 downloads6 reach7 impact
4898 instances - 12 features - 7 classes - 0 missing values
General Description of Thyroid Disease Databases and Related Files This directory contains 6 databases, corresponding test set, and corresponding documentation. They were left at the University of…
92 runs0 likes5 downloads5 reach6 impact
2800 instances - 27 features - 5 classes - 0 missing values
Multi-label dataset. Audio dataset (emotions) consists of 593 musical files with 6 clustered emotional labels and 72 predictors. Each song can be labeled with one or more of the labels…
0 runs2 likes5 downloads7 reach3 impact
593 instances - 78 features - 2 classes - 0 missing values
No data.
68 runs0 likes4 downloads4 reach2 impact
1000000 instances - 21 features - 2 classes - 0 missing values
No data.
65 runs0 likes4 downloads4 reach1 impact
1000000 instances - 40 features - 2 classes - 0 missing values
No data.
63 runs0 likes4 downloads4 reach2 impact
1000000 instances - 19 features - 4 classes - 0 missing values
No data.
48 runs1 likes4 downloads5 reach2 impact
1000000 instances - 77 features - 10 classes - 0 missing values
No data.
51 runs1 likes4 downloads5 reach2 impact
1000000 instances - 48 features - 10 classes - 0 missing values
No data.
326 runs1 likes4 downloads5 reach2 impact
1000000 instances - 23 features - 2 classes - 0 missing values
No data.
90 runs0 likes4 downloads4 reach1 impact
137781 instances - 10 features - 7 classes - 0 missing values
No data.
219 runs0 likes4 downloads4 reach2 impact
1000000 instances - 58 features - 2 classes - 0 missing values
No data.
334 runs0 likes4 downloads4 reach2 impact
1000000 instances - 33 features - 2 classes - 0 missing values
No data.
69 runs0 likes4 downloads4 reach1 impact
1000000 instances - 20 features - 2 classes - 0 missing values
No data.
230 runs0 likes4 downloads4 reach2 impact
1000000 instances - 35 features - 2 classes - 0 missing values
1. Title: Wisconsin Prognostic Breast Cancer (WPBC) 2. Source Information a) Creators: Dr. William H. Wolberg, General Surgery Dept., University of Wisconsin, Clinical Sciences Center, Madison, WI…
5 runs0 likes4 downloads4 reach1 impact
194 instances - 33 features - 0 classes - 0 missing values
This data set consists of three types of entities: (a) the specification of an auto in terms of various characteristics; (b) its assigned insurance risk rating,; (c) its normalized losses in use as…
7 runs1 likes4 downloads5 reach1 impact
159 instances - 16 features - 0 classes - 0 missing values
No data.
310 runs0 likes4 downloads4 reach2 impact
1000000 instances - 11 features - 2 classes - 0 missing values
Synthetic dataset. Almost identical to [dataset 152](https://www.openml.org/d/153/edit)
319 runs0 likes4 downloads4 reach2 impact
1000000 instances - 11 features - 2 classes - 0 missing values
No data.
291 runs0 likes4 downloads4 reach1 impact
1000000 instances - 18 features - 7 classes - 0 missing values
No data.
326 runs0 likes4 downloads4 reach2 impact
1000000 instances - 14 features - 2 classes - 0 missing values
No data.
68 runs0 likes4 downloads4 reach1 impact
1000000 instances - 23 features - 2 classes - 0 missing values
No data.
326 runs0 likes4 downloads4 reach2 impact
1000000 instances - 16 features - 2 classes - 0 missing values
University of Sao Paulo, School of Art, Sciences and Humanities, Sao Paulo, SP, Brazil ### LIBRAS Movement Database LIBRAS, acronym of the Portuguese name "LIngua BRAsileira de Sinais", is the…
0 runs0 likes4 downloads4 reach7 impact
360 instances - 91 features - 0 classes - 0 missing values
No data.
332 runs0 likes4 downloads4 reach2 impact
1000000 instances - 17 features - 2 classes - 0 missing values
No data.
310 runs0 likes4 downloads4 reach2 impact
1000000 instances - 19 features - 4 classes - 0 missing values
No data.
941 runs0 likes4 downloads4 reach2 impact
74 instances - 63 features - 4 classes - 0 missing values
This is an artificial data set described in Breiman et al. (1984,p.238) (with variance 1 instead of 2). Generate the values of the 10 attributes independently using the following probabilities: P(X_1…
2 runs1 likes4 downloads5 reach2 impact
40768 instances - 11 features - 0 classes - 0 missing values
No data.
108 runs0 likes4 downloads4 reach10 impact
927 instances - 10129 features - 7 classes - 0 missing values
No data.
211 runs0 likes4 downloads4 reach10 impact
313 instances - 5805 features - 8 classes - 0 missing values
Squash Harvest Unstored Data source: Winna Harvey Crop and Food Research, Christchurch, New Zealand The purpose of the research was to determine the changes taking place in squash fruit during the…
876 runs0 likes4 downloads4 reach7 impact
52 instances - 24 features - 3 classes - 39 missing values
White Clover Persistence Trials Data source: Ian Tarbotton AgResearch, Whatawhata Research Centre, Hamilton, New Zealand The objective was to determine the mechanisms which influence the persistence…
858 runs0 likes4 downloads4 reach7 impact
63 instances - 32 features - 4 classes - 0 missing values
Squash Harvest Stored Data source: Winna Harvey Crop and Food Research, Christchurch, New Zealand The purpose of the research was to determine the changes taking place in squash fruit during the…
867 runs0 likes4 downloads4 reach7 impact
52 instances - 25 features - 3 classes - 7 missing values