OpenML
Filter results by:
No data.
44 runs0 likes1 downloads1 reach0 impact
1000000 instances - 13 features - 11 classes - 0 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au7-300-drift-au7-cpd1-800 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and…
7130 runs0 likes10 downloads10 reach26 impact
1100 instances - 13 features - 5 classes - 0 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au7-700 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and heterogeneity of…
4537 runs0 likes6 downloads6 reach17 impact
700 instances - 13 features - 3 classes - 0 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au7-cpd1-500 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and heterogeneity…
7145 runs0 likes7 downloads7 reach25 impact
500 instances - 13 features - 5 classes - 0 missing values
* Donor: David W. Aha (aha '@' ics.uci.edu) (714) 856-8779 * Data Set Information: This database contains 76 attributes, but all published experiments refer to using a subset of 14 of them. In…
170 runs0 likes8 downloads8 reach4 impact
123 instances - 13 features - 5 classes - 0 missing values
* Title: Planning Relax Data Set * Abstract: The dataset concerns with the classification of two mental stages from recorded EEG signals: Planning (during imagination of motor act) and Relax state. *…
141 runs0 likes8 downloads8 reach5 impact
182 instances - 13 features - 2 classes - 0 missing values
####1. Summary This dataset contain attributes of dresses and their recommendations according to their sales. Sales are monitor on the basis of alternate days. The attributes present analyzed are:…
13194 runs0 likes5 downloads5 reach8 impact
500 instances - 13 features - 2 classes - 835 missing values
Relevant Information: -- The database contains 3 potential classes, one for the number of times a certain type of solar flare occured in a 24 hour period. -- Each instance represents captured features…
31 runs0 likes0 downloads0 reach8 impact
1066 instances - 13 features - 6 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
2 runs0 likes0 downloads0 reach4 impact
66 instances - 13 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
103 runs0 likes5 downloads5 reach5 impact
107 instances - 13 features - 2 classes - 71 missing values
No data.
90 runs2 likes3 downloads5 reach0 impact
663552 instances - 13 features - 2 classes - 0 missing values
This file is a text file giving details about the time series analysed in 'The Analysis of Time Series' by Chris Chatfield. The 5th edn was published in 1996 and the 6th edn in 2003. The series are…
0 runs0 likes0 downloads0 reach4 impact
235 instances - 13 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
107 runs0 likes4 downloads4 reach5 impact
66 instances - 13 features - 2 classes - 0 missing values
The Computer Activity databases are a collection of computer systems activity measures. The data was collected from a Sun Sparcstation 20/712 with 128 Mbytes of memory running in a multi-user…
12 runs0 likes3 downloads3 reach4 impact
8192 instances - 13 features - 0 classes - 0 missing values
No data.
306 runs0 likes3 downloads3 reach0 impact
1000000 instances - 13 features - 6 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
753 runs0 likes10 downloads10 reach6 impact
8192 instances - 13 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
810 runs0 likes8 downloads8 reach6 impact
235 instances - 13 features - 2 classes - 0 missing values
Dataset from the MLRR repository: http://axon.cs.byu.edu:5000/
153 runs0 likes8 downloads8 reach5 impact
81 instances - 13 features - 3 classes - 0 missing values
Relevant Information: -- The database contains 3 potential classes, one for the number of times a certain type of solar flare occured in a 24 hour period. -- Each instance represents captured features…
31 runs0 likes1 downloads1 reach8 impact
315 instances - 13 features - 5 classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Solar Flare dataset (Lichman 2013) has 3 target variables that correspond to the number of…
0 runs0 likes0 downloads0 reach0 impact
1066 instances - 13 features - classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Solar Flare dataset (Lichman 2013) has 3 target variables that correspond to the number of…
0 runs0 likes0 downloads0 reach0 impact
323 instances - 13 features - classes - 0 missing values
Publication Request: >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> This file describes the contents of the heart-disease directory. This directory contains 4 databases…
1787 runs0 likes9 downloads9 reach0 impact
294 instances - 14 features - 2 classes - 782 missing values
This database contains 13 attributes (which have been extracted from a larger set of 75) Attribute Information: ------------------------ -- 1. age -- 2. sex -- 3. chest pain type (4 values) -- 4.…
3208 runs0 likes17 downloads17 reach0 impact
270 instances - 14 features - 2 classes - 0 missing values
Publication Request: >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> This file describes the contents of the heart-disease directory. This directory contains 4 databases…
1761 runs0 likes10 downloads10 reach0 impact
303 instances - 14 features - 2 classes - 7 missing values
No data.
66 runs0 likes2 downloads2 reach0 impact
1000000 instances - 14 features - 5 classes - 0 missing values
No data.
66 runs0 likes2 downloads2 reach0 impact
1000000 instances - 14 features - 5 classes - 0 missing values
No data.
70 runs0 likes2 downloads2 reach0 impact
1000000 instances - 14 features - 2 classes - 0 missing values
Donor: David W. Aha (aha@ics.uci.edu) This database contains 76 attributes, but all published experiments refer to using a subset of 14 of them. In particular, the Cleveland database is the only one…
36 runs0 likes5 downloads5 reach0 impact
303 instances - 14 features - 0 classes - 6 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Cholesterol treated as the class attribute. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using…
20 runs0 likes3 downloads3 reach0 impact
303 instances - 14 features - 0 classes - 6 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) This is the data set called `DETROIT' in the book `Subset selection in regression' by Alan J. Miller published in the Chapman & Hall series of monographs…
2 runs0 likes0 downloads0 reach0 impact
13 instances - 14 features - 0 classes - 0 missing values
No data.
310 runs0 likes2 downloads2 reach0 impact
1000000 instances - 14 features - 5 classes - 0 missing values
No data.
326 runs0 likes4 downloads4 reach0 impact
1000000 instances - 14 features - 2 classes - 0 missing values
Determinants of Plasma Retinol and Beta-Carotene Levels Summary: Observational studies have suggested that low dietary intake or low plasma concentrations of retinol, beta-carotene, or other…
14 runs0 likes0 downloads0 reach4 impact
315 instances - 14 features - 0 classes - 0 missing values
No data.
0 runs0 likes1 downloads1 reach0 impact
1000000 instances - 14 features - 0 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
1000000 instances - 14 features - 0 classes - 0 missing values
libSVM","AAD group #Dataset from the LIBSVM data repository. Preprocessing: scaled to [-1,1]
0 runs0 likes5 downloads5 reach4 impact
270 instances - 14 features - 0 classes - 0 missing values
* Dataset: This is a reprocessed version of heart-h (hungarian), the heart disease reprocessed hungarian dataset from UCI.
138 runs0 likes6 downloads6 reach4 impact
294 instances - 14 features - 5 classes - 0 missing values
* Donor: David W. Aha (aha '@' ics.uci.edu) (714) 856-8779 * Data Set Information: This database contains 76 attributes, but all published experiments refer to using a subset of 14 of them. In…
159 runs1 likes4 downloads5 reach4 impact
200 instances - 14 features - 5 classes - 0 missing values
Data on the homicide rate in Detroit for the years 1961-1973. This is the data set called DETROIT in the book 'Subset selection in regression' by Alan J. Miller published in the Chapman & Hall series…
0 runs0 likes0 downloads0 reach4 impact
13 instances - 14 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
717 runs0 likes5 downloads5 reach6 impact
303 instances - 14 features - 2 classes - 7 missing values
Publication Request: >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> This file describes the contents of the heart-disease directory. This directory contains 4 databases…
10 runs0 likes0 downloads0 reach0 impact
294 instances - 14 features - 0 classes - 782 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
714 runs0 likes4 downloads4 reach6 impact
303 instances - 14 features - 2 classes - 6 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
700 runs0 likes4 downloads4 reach6 impact
294 instances - 14 features - 2 classes - 782 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
737 runs0 likes5 downloads5 reach6 impact
303 instances - 14 features - 2 classes - 6 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
688 runs0 likes4 downloads4 reach5 impact
294 instances - 14 features - 2 classes - 782 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
762 runs0 likes9 downloads9 reach6 impact
315 instances - 14 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
780 runs0 likes8 downloads8 reach6 impact
178 instances - 14 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
729 runs0 likes9 downloads9 reach6 impact
506 instances - 14 features - 2 classes - 0 missing values
The Boston house-price data of Harrison, D. and Rubinfeld, D.L. 'Hedonic prices and the demand for clean air', J. Environ. Economics & Management, vol.5, 81-102, 1978. Used in Belsley, Kuh & Welsch,…
4 runs0 likes5 downloads5 reach8 impact
506 instances - 14 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
131 runs1 likes9 downloads10 reach6 impact
990 instances - 14 features - 2 classes - 0 missing values
No data.
312 runs0 likes4 downloads4 reach0 impact
1000000 instances - 14 features - 3 classes - 0 missing values
Datasets of Data And Story Library, project illustrating use of basic statistic methods, converted to arff format by Hakan Kjellerstrand. Source: TunedIT: http://tunedit.org/repo/DASL DASL file…
0 runs0 likes1 downloads1 reach4 impact
47 instances - 14 features - 0 classes - 0 missing values
The original Titanic dataset, describing the survival status of individual passengers on the Titanic. The titanic data does not contain information from the crew, but it does contain actual ages of…
0 runs0 likes6 downloads6 reach2 impact
1309 instances - 14 features - 2 classes - 3855 missing values
1. Title of Database: Wine recognition data Updated Sept 21, 1998 by C.Blake : Added attribute information 2. Sources: (a) Forina, M. et al, PARVUS - An Extendible Package for Data Exploration,…
1180 runs1 likes15 downloads16 reach0 impact
178 instances - 14 features - 3 classes - 0 missing values
cleve-pmlb
32 runs0 likes1 downloads1 reach8 impact
303 instances - 14 features - 2 classes - 0 missing values
Prediction task is to determine whether a person makes over 50K a year. Extraction was done by Barry Becker from the 1994 Census database. A set of reasonably clean records was extracted using the…
2671 runs1 likes28 downloads29 reach0 impact
48842 instances - 15 features - 2 classes - 6465 missing values
This dataset records 640 time series of 12 LPC cepstrum coefficients taken from nine male speakers. The data was collected for examining our newly developed classifier for multidimensional curves…
19127 runs0 likes10 downloads10 reach45 impact
9961 instances - 15 features - 9 classes - 0 missing values
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 1. Title: Assessing the Reliability of a Human Estimator…
0 runs0 likes0 downloads0 reach4 impact
75 instances - 15 features - 0 classes - 0 missing values
No data.
288 runs0 likes2 downloads2 reach0 impact
1000000 instances - 15 features - 9 classes - 0 missing values
Prediction task is to determine whether a person makes over 50K a year. Extraction was done by Barry Becker from the 1994 Census database. A set of reasonably clean records was extracted using the…
9100 runs0 likes13 downloads13 reach22 impact
48842 instances - 15 features - 2 classes - 6465 missing values
* Title of Database: Spoken Arabic Digit * Abstract: This dataset contains time series of mel-frequency cepstrum coefficients (MFCCs) corresponding to spoken Arabic digits. Includes data from 44 males…
1 runs0 likes6 downloads6 reach5 impact
263256 instances - 15 features - 10 classes - 0 missing values
In the early 2000s, Billy Beane and Paul DePodesta worked for the Oakland Athletics. While there, they literally changed the game of baseball. They didn't do it using a bat or glove, and they…
0 runs0 likes7 downloads7 reach2 impact
1232 instances - 15 features - 0 classes - 3600 missing values
Zurich public transport delay data 2016-10-30 03:30:00 CET - 2016-11-27 01:20:00 CET cleaned and prepared at Open Data Day 2017.
0 runs0 likes2 downloads2 reach3 impact
5465575 instances - 15 features - 0 classes - 132617 missing values
Datasets from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch) Dataset from: http://www.agnostic.inf.ethz.ch/datasets.php Modified by TunedIT (converted to ARFF…
777 runs0 likes8 downloads8 reach7 impact
4562 instances - 15 features - 2 classes - 88 missing values
All data is from one continuous EEG measurement with the Emotiv EEG Neuroheadset. The duration of the measurement was 117 seconds. The eye state was detected via a camera during the EEG measurement…
162175 runs2 likes85 downloads87 reach17 impact
14980 instances - 15 features - 2 classes - 0 missing values
wind daily average wind speeds for 1961-1978 at 12 synoptic meteorological stations in the Republic of Ireland (Haslett and raftery 1989). These data were analyzed in detail in the following article:…
0 runs0 likes5 downloads5 reach4 impact
6574 instances - 15 features - 0 classes - 0 missing values
Short Summary: Lists estimates of the percentage of body fat determined by underwater weighing and various body circumference measurements for 252 men. Classroom use of this data set: This data set…
0 runs0 likes4 downloads4 reach6 impact
252 instances - 15 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
760 runs0 likes11 downloads11 reach6 impact
6574 instances - 15 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
857 runs0 likes12 downloads12 reach7 impact
9961 instances - 15 features - 2 classes - 0 missing values
No data.
44 runs0 likes2 downloads2 reach0 impact
1000000 instances - 15 features - 2 classes - 0 missing values
No data.
51 runs0 likes2 downloads2 reach0 impact
1000000 instances - 15 features - 2 classes - 0 missing values
This dataset was retrieved 2014-11-14 from the UCI site and converted to the ARFF format. __Major changes w.r.t. version 3: dataset from UCI that matches description and data types__ ### Feature…
4192 runs0 likes3 downloads3 reach3 impact
690 instances - 15 features - 2 classes - 0 missing values
Schizophrenic Eye-Tracking Data in Rubin and Wu (1997) Biometrics. Yingnian Wu (wu@hustat.harvard.edu) [14/Oct/97] Information about the dataset CLASSTYPE: nominal CLASSINDEX: last
748 runs0 likes7 downloads7 reach12 impact
340 instances - 15 features - 2 classes - 834 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
769 runs0 likes11 downloads11 reach6 impact
252 instances - 15 features - 2 classes - 0 missing values
This data set consists of three types of entities: (a) the specification of an auto in terms of various characteristics; (b) its assigned insurance risk rating,; (c) its normalized losses in use as…
6 runs1 likes4 downloads5 reach0 impact
159 instances - 16 features - 0 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! All nominal attributes and instances with missing values are deleted. Price treated as the class attribute. As used by…
2 runs0 likes0 downloads0 reach0 impact
159 instances - 16 features - 0 classes - 0 missing values
No data.
326 runs0 likes4 downloads4 reach0 impact
1000000 instances - 16 features - 2 classes - 0 missing values
One of the data sets used in the book "Analyzing Categorical Data" by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on statistical…
0 runs0 likes1 downloads1 reach2 impact
31 instances - 16 features - classes - 150 missing values
No data.
0 runs0 likes1 downloads1 reach0 impact
1000000 instances - 16 features - 0 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
1000000 instances - 16 features - 0 classes - 0 missing values
Abstract: This dataset consists in a collection of shape and texture features extracted from digital images of leaf specimens originating from a total of 40 different plant species. Source: This…
112 runs0 likes8 downloads8 reach4 impact
340 instances - 16 features - 30 classes - 0 missing values
No data.
73 runs0 likes5 downloads5 reach0 impact
1000000 instances - 16 features - 2 classes - 0 missing values
Dataset from the MLRR repository: http://axon.cs.byu.edu:5000/
68 runs0 likes7 downloads7 reach13 impact
32561 instances - 16 features - 2 classes - 4262 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
2 runs0 likes0 downloads0 reach4 impact
67 instances - 16 features - 0 classes - 0 missing values
This is the pollution data so loved by writers of papers on ridge regression. Source: McDonald, G.C. and Schwing, R.C. (1973) 'Instabilities of regression estimates relating air pollution to…
0 runs0 likes1 downloads1 reach4 impact
60 instances - 16 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
718 runs0 likes6 downloads6 reach6 impact
159 instances - 16 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
790 runs0 likes10 downloads10 reach6 impact
159 instances - 16 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
102 runs0 likes4 downloads4 reach5 impact
67 instances - 16 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
754 runs0 likes10 downloads10 reach5 impact
60 instances - 16 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
700 runs0 likes5 downloads5 reach5 impact
67 instances - 16 features - 2 classes - 0 missing values
This file concerns credit card applications. All attribute names and values have been changed to meaningless symbols to protect the confidentiality of the data. This dataset is interesting because…
22109 runs1 likes28 downloads29 reach0 impact
690 instances - 16 features - 2 classes - 67 missing values
No data.
67 runs0 likes2 downloads2 reach0 impact
1000000 instances - 17 features - 10 classes - 0 missing values
No data.
71 runs0 likes5 downloads5 reach0 impact
1000000 instances - 17 features - 2 classes - 0 missing values
No data.
60 runs0 likes2 downloads2 reach0 impact
1000000 instances - 17 features - 26 classes - 0 missing values
1. Title: 1984 United States Congressional Voting Records Database 2. Source Information: (a) Source: Congressional Quarterly Almanac, 98th Congress, 2nd session 1984, Volume XL: Congressional…
2250 runs0 likes16 downloads16 reach0 impact
435 instances - 17 features - 2 classes - 392 missing values
No data.
356 runs0 likes7 downloads7 reach0 impact
131072 instances - 17 features - 2 classes - 0 missing values
No data.
293 runs0 likes2 downloads2 reach0 impact
1000000 instances - 17 features - 10 classes - 0 missing values
No data.
311 runs0 likes3 downloads3 reach0 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
28 runs0 likes2 downloads2 reach0 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
32 runs0 likes1 downloads1 reach0 impact
1000000 instances - 17 features - 26 classes - 0 missing values