OpenML
Filter results by:
test
0 runs0 likes0 downloads0 reach0 impact
8553 instances - 10 features - classes - 18454 missing values
test
0 runs0 likes0 downloads0 reach0 impact
2580 instances - 7 features - classes - 2541 missing values
test
0 runs0 likes0 downloads0 reach0 impact
2580 instances - 7 features - classes - 2541 missing values
test
0 runs0 likes0 downloads0 reach0 impact
60197 instances - 6 features - classes - 42138 missing values
test
0 runs0 likes0 downloads0 reach0 impact
60197 instances - 6 features - classes - 42138 missing values
test
0 runs0 likes0 downloads0 reach0 impact
60197 instances - 6 features - classes - 42138 missing values
test
0 runs0 likes0 downloads0 reach0 impact
60197 instances - 6 features - classes - 42138 missing values
test
0 runs0 likes0 downloads0 reach0 impact
20058 instances - 16 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
16598 instances - 11 features - classes - 329 missing values
test
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
60197 instances - 6 features - classes - 128136 missing values
Rows with NaN and inf values removed. Data converted from CSV to ARFF.
0 runs0 likes0 downloads0 reach0 impact
18982 instances - 80 features - classes - 0 missing values
xxx
0 runs0 likes0 downloads0 reach0 impact
891 instances - 8 features - 2 classes - 689 missing values
Data for an stock long position
0 runs0 likes0 downloads0 reach0 impact
4477 instances - 20 features - 0 classes - 0 missing values
xxx
0 runs0 likes0 downloads0 reach0 impact
891 instances - 8 features - classes - 689 missing values
Rows with NaN and inf values removed. Converted file format from CSV to ARFF.
0 runs0 likes0 downloads0 reach0 impact
18982 instances - 80 features - 5 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 21 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
270 instances - 14 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
270 instances - 14 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
270 instances - 14 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
270 instances - 14 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
4324 instances - 9 features - classes - 3360 missing values
test
0 runs0 likes0 downloads0 reach0 impact
60197 instances - 6 features - classes - 128136 missing values
Salary Emp
0 runs0 likes0 downloads0 reach0 impact
31 instances - 2 features - classes - 0 missing values
DESCRIPTIVE ABSTRACT: The data set contains the oral, written and combined test scores for 2003 New Haven Fire Department promotion exams. The Race and Position for each test taker are also given.…
0 runs0 likes0 downloads0 reach0 impact
118 instances - 6 features - 2 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
101 instances - 18 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
2178 instances - 4 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
336 instances - 8 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
8124 instances - 23 features - classes - 2480 missing values
URL dataset
0 runs0 likes0 downloads0 reach0 impact
121001 instances - 501 features - 0 classes - 0 missing values
URL dataset 3
0 runs0 likes0 downloads0 reach0 impact
18982 instances - 80 features - 5 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes0 downloads0 reach9 impact
163 instances - 28 features - 5 classes - 9 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes0 downloads0 reach11 impact
379 instances - 9 features - 4 classes - 1418 missing values
Email dataset 1a
0 runs0 likes0 downloads0 reach0 impact
4585 instances - 4 features - 0 classes - 0 missing values
Email dataset 1c
0 runs0 likes0 downloads0 reach0 impact
4585 instances - 792 features - 0 classes - 0 missing values
Email dataset 1b
0 runs0 likes0 downloads0 reach0 impact
4585 instances - 24 features - 0 classes - 161 missing values
Phishing website 1
0 runs0 likes0 downloads0 reach0 impact
11055 instances - 31 features - 0 classes - 0 missing values
URL dataset 2
0 runs0 likes0 downloads0 reach0 impact
95911 instances - 13 features - 0 classes - 0 missing values
Email dataset 1d
0 runs0 likes0 downloads0 reach0 impact
4585 instances - 11 features - 0 classes - 0 missing values
Email dataset 1e
0 runs0 likes0 downloads0 reach0 impact
4585 instances - 580 features - 0 classes - 0 missing values
Email dataset 2
0 runs0 likes0 downloads0 reach0 impact
11507 instances - 4 features - 0 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
67 runs0 likes1 downloads1 reach7 impact
458 instances - 10937 features - 2 classes - 0 missing values
This file contains data from Regression Analysis By Example, 2nd Edition, by Samprit Chatterjee and Bertram Price, John Wiley, 1991. Data sets have names of the form 'rabe.xxx' where xxx is the page…
0 runs0 likes1 downloads1 reach5 impact
40 instances - 3 features - 0 classes - 0 missing values
This S dump contains 22 data sets from the book Visualizing Data published by Hobart Press (books@hobart.com). The dump was created by data.dump() and can be read back into S by data.restore(). The…
2 runs0 likes1 downloads1 reach5 impact
8641 instances - 5 features - 0 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
65 runs0 likes1 downloads1 reach7 impact
321 instances - 10937 features - 2 classes - 0 missing values
No data.
0 runs0 likes1 downloads1 reach1 impact
1000000 instances - 33 features - 0 classes - 0 missing values
No data.
0 runs0 likes1 downloads1 reach1 impact
1000000 instances - 14 features - 0 classes - 0 missing values
No data.
0 runs0 likes1 downloads1 reach1 impact
1000000 instances - 16 features - 0 classes - 0 missing values
1. Title: Ozone Level Detection 2. Source: Kun Zhang zhang.kun05 '@' gmail.com Department of Computer Science, Xavier University of Lousiana Wei Fan wei.fan '@' gmail.com IBM T.J.Watson Research…
0 runs0 likes1 downloads1 reach5 impact
2536 instances - 73 features - 0 classes - 0 missing values
Pittsburgh bridges This version is derived from version 1 by removing all instances with missing values in the last (target) attribute. The bridges dataset is originally not a classification dataset,…
31 runs0 likes1 downloads1 reach7 impact
105 instances - 13 features - 6 classes - 61 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes1 downloads1 reach5 impact
1000 instances - 26 features - 0 classes - 0 missing values
Systematic determination of genetic network architecture. Nature Genetics, 1999 Jul;22(3):281-5. Data also used in Biclustering of Expression Data, by Yizong Cheng and George M. Church (web…
0 runs0 likes1 downloads1 reach5 impact
17 instances - 2884 features - 0 classes - 0 missing values
Data from the RSCTC 2010 Discovery Challenge. All datasets contain between 100 and 400 samples, characterized by values of 20,000 - 65,000 attributes. Samples are assigned to several (2-10) classes.…
11 runs0 likes1 downloads1 reach6 impact
283 instances - 54622 features - 3 classes - 0 missing values
Datasets of Data And Story Library, project illustrating use of basic statistic methods, converted to arff format by Hakan Kjellerstrand. Source: TunedIT: http://tunedit.org/repo/DASL DASL file…
0 runs0 likes1 downloads1 reach5 impact
39 instances - 5 features - 0 classes - 0 missing values
------------------------------------------------------------------------------- TIME SERIES USED IN LONG-MEMORY PROCESSES, THE ALLAN VARIANCE AND WAVELETS BY D. B. PERCIVAL AND P. GUTTORP, A CHAPTER…
0 runs0 likes1 downloads1 reach5 impact
6875 instances - 1 features - 0 classes - 0 missing values
No data.
28 runs0 likes1 downloads1 reach2 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
29 runs0 likes1 downloads1 reach2 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
30 runs0 likes1 downloads1 reach2 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
28 runs0 likes1 downloads1 reach3 impact
1000000 instances - 70 features - 24 classes - 0 missing values
No data.
31 runs0 likes1 downloads1 reach3 impact
1000000 instances - 70 features - 24 classes - 0 missing values
No data.
28 runs0 likes1 downloads1 reach2 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
28 runs0 likes1 downloads1 reach2 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
28 runs0 likes1 downloads1 reach2 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
29 runs0 likes1 downloads1 reach2 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
29 runs0 likes1 downloads1 reach2 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
30 runs0 likes1 downloads1 reach2 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
10 runs0 likes1 downloads1 reach3 impact
1000000 instances - 39 features - 6 classes - 0 missing values
No data.
7 runs0 likes1 downloads1 reach3 impact
1000000 instances - 39 features - 6 classes - 0 missing values
No data.
7 runs0 likes1 downloads1 reach3 impact
1000000 instances - 39 features - 6 classes - 0 missing values
No data.
6 runs0 likes1 downloads1 reach3 impact
1000000 instances - 39 features - 6 classes - 0 missing values
No data.
7 runs0 likes1 downloads1 reach3 impact
1000000 instances - 39 features - 6 classes - 0 missing values
No data.
30 runs0 likes1 downloads1 reach3 impact
1000000 instances - 39 features - 6 classes - 0 missing values
No data.
29 runs0 likes1 downloads1 reach3 impact
1000000 instances - 39 features - 6 classes - 0 missing values
No data.
28 runs0 likes1 downloads1 reach3 impact
1000000 instances - 39 features - 6 classes - 0 missing values
No data.
28 runs0 likes1 downloads1 reach3 impact
1000000 instances - 39 features - 6 classes - 0 missing values
No data.
28 runs0 likes1 downloads1 reach3 impact
1000000 instances - 39 features - 6 classes - 0 missing values
No data.
28 runs0 likes1 downloads1 reach3 impact
1000000 instances - 39 features - 6 classes - 0 missing values
No data.
29 runs0 likes1 downloads1 reach2 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
30 runs0 likes1 downloads1 reach3 impact
1000000 instances - 70 features - 24 classes - 0 missing values
No data.
32 runs0 likes1 downloads1 reach2 impact
1000000 instances - 26 features - 7 classes - 0 missing values
No data.
29 runs0 likes1 downloads1 reach3 impact
1000000 instances - 19 features - 4 classes - 0 missing values
No data.
28 runs0 likes1 downloads1 reach3 impact
1000000 instances - 19 features - 4 classes - 0 missing values
No data.
27 runs0 likes1 downloads1 reach3 impact
1000000 instances - 19 features - 4 classes - 0 missing values
No data.
28 runs0 likes1 downloads1 reach3 impact
1000000 instances - 19 features - 4 classes - 0 missing values
No data.
28 runs0 likes1 downloads1 reach3 impact
1000000 instances - 19 features - 4 classes - 0 missing values
No data.
27 runs0 likes1 downloads1 reach3 impact
1000000 instances - 19 features - 4 classes - 0 missing values
No data.
28 runs0 likes1 downloads1 reach3 impact
1000000 instances - 19 features - 4 classes - 0 missing values
libSVM","AAD group #Dataset from the LIBSVM data repository. Preprocessing: The original Adult data set has 14 features, among which six are continuous and eight are categorical. In this data set,…
0 runs0 likes1 downloads1 reach5 impact
32561 instances - 124 features - 0 classes - 0 missing values
libSVM","AAD group #Dataset from the LIBSVM data repository. Preprocessing: The original Adult data set has 14 features, among which six are continuous and eight are categorical. In this data set,…
0 runs0 likes1 downloads1 reach5 impact
32561 instances - 124 features - 0 classes - 0 missing values
No data.
47 runs0 likes1 downloads1 reach1 impact
1000000 instances - 45 features - 2 classes - 0 missing values
No data.
0 runs0 likes1 downloads1 reach1 impact
177147 instances - 11 features - 0 classes - 0 missing values
No data.
44 runs0 likes1 downloads1 reach1 impact
1000000 instances - 13 features - 11 classes - 0 missing values
No data.
0 runs0 likes1 downloads1 reach1 impact
1000000 instances - 33 features - 0 classes - 0 missing values
No data.
0 runs0 likes1 downloads1 reach1 impact
144 instances - 77 features - 0 classes - 0 missing values
No data.
0 runs0 likes1 downloads1 reach1 impact
17496 instances - 10 features - 0 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes1 downloads1 reach5 impact
365 instances - 4 features - 0 classes - 30 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes1 downloads1 reach3 impact
228 instances - 8 features - classes - 20 missing values
One of the data sets used in the book "Analyzing Categorical Data" by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on statistical…
0 runs0 likes1 downloads1 reach3 impact
31 instances - 16 features - classes - 150 missing values
This analysis describes and summarizes the relationships between 1987 salaries of major league baseball players and the player's performance. The salary data were taken from Sports Illustrated, April…
0 runs1 likes1 downloads2 reach5 impact
26 instances - 9 features - 0 classes - 0 missing values
Source: Creators : François Kawala (1,2) Ahlame Douzal (1) Eric Gaussier (1) Eustache Diemert (2) Institutions : (1) Université Joseph Fourier (Grenoble I) Laboratoire d'informatique de…
0 runs0 likes1 downloads1 reach3 impact
28179 instances - 97 features - classes - 0 missing values