OpenML
Filter results by:
source: http://www.cs.ubc.ca/labs/beta/Projects/SATzilla/ authors: L. Xu, F. Hutter, H. Hoos, K. Leyton-Brown translator in coseal format: M. Lindauer with the help of Alexandre Frechette the data do…
0 runs0 likes0 downloads0 reach1 impact
296 instances - 116 features - 14 classes - 1810 missing values
The midwest survey dataset contain individual responses from surveys about regional identification conducted for FiveThirtyEight by SurveyMonkey.
0 runs0 likes0 downloads0 reach0 impact
2778 instances - 28 features - 10 classes - 1744 missing values
The midwest survey dataset contain individual responses from surveys about regional identification conducted for FiveThirtyEight by SurveyMonkey.
0 runs0 likes0 downloads0 reach0 impact
2778 instances - 28 features - 10 classes - 1744 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes0 downloads0 reach11 impact
379 instances - 9 features - 4 classes - 1418 missing values
Expression levels of 77 proteins measured in the cerebral cortex of 8 classes of control and Down syndrome mice exposed to context fear conditioning, a task used to assess associative learning. The…
9537 runs0 likes0 downloads0 reach14 impact
1080 instances - 82 features - 8 classes - 1396 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
104 runs0 likes6 downloads6 reach13 impact
379 instances - 9 features - 2 classes - 1368 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Case number deleted. X treated as the class attribute. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric…
10 runs0 likes1 downloads1 reach1 impact
418 instances - 19 features - 0 classes - 1239 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
723 runs0 likes5 downloads5 reach13 impact
418 instances - 19 features - 2 classes - 1239 missing values
PRO FOOTBALL SCORES (raw data appears after the description below) How well do the oddsmakers of Las Vegas predict the outcome of professional football games? Is there really a home field advantage -…
15930 runs0 likes19 downloads19 reach23 impact
672 instances - 10 features - 2 classes - 1200 missing values
Primary Biliary Cirrhosis This data set is a follow-up to the original PBC data set, as discussed in appendix D of Fleming and Harrington, Counting Processes and Survival Analysis, Wiley, 1991. An…
0 runs0 likes5 downloads5 reach7 impact
1945 instances - 19 features - 0 classes - 1133 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
680 runs0 likes5 downloads5 reach13 impact
1945 instances - 19 features - 2 classes - 1133 missing values
------------------------------------------------------------------------ Primary Biliary Cirrhosis The data set found in appendix D of Fleming and Harrington, Counting Processes and Survival Analysis,…
18 runs1 likes3 downloads4 reach7 impact
418 instances - 20 features - 0 classes - 1033 missing values
### Description Cylinder bands UCI dataset - Process delays known as cylinder banding in rotogravure printing were substantially mitigated using control rules discovered by decision tree induction.…
21477 runs0 likes8 downloads8 reach20 impact
540 instances - 40 features - 2 classes - 999 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
####1. Summary This dataset contain attributes of dresses and their recommendations according to their sales. Sales are monitor on the basis of alternate days. The attributes present analyzed are:…
19054 runs1 likes6 downloads7 reach12 impact
500 instances - 13 features - 2 classes - 835 missing values
Schizophrenic Eye-Tracking Data in Rubin and Wu (1997) Biometrics. Yingnian Wu (wu@hustat.harvard.edu) [14/Oct/97] Information about the dataset CLASSTYPE: nominal CLASSINDEX: last
748 runs0 likes7 downloads7 reach21 impact
340 instances - 15 features - 2 classes - 834 missing values
Publication Request: >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> This file describes the contents of the heart-disease directory. This directory contains 4 databases…
10 runs0 likes0 downloads0 reach1 impact
294 instances - 14 features - 0 classes - 782 missing values
Publication Request: >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> This file describes the contents of the heart-disease directory. This directory contains 4 databases…
1789 runs0 likes12 downloads12 reach7 impact
294 instances - 14 features - 2 classes - 782 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
688 runs0 likes4 downloads4 reach12 impact
294 instances - 14 features - 2 classes - 782 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
700 runs0 likes4 downloads4 reach13 impact
294 instances - 14 features - 2 classes - 782 missing values
xxx
0 runs0 likes0 downloads0 reach0 impact
891 instances - 8 features - 2 classes - 689 missing values
xxx
0 runs0 likes0 downloads0 reach0 impact
891 instances - 8 features - classes - 689 missing values
Zurich public transport delay data 2016-10-30 03:30:00 CET - 2016-11-27 01:20:00 CET cleaned and prepared at Open Data Day 2017. For this version, the task was downsampled to 0.5 percent. Some…
0 runs0 likes0 downloads0 reach0 impact
27327 instances - 18 features - 0 classes - 657 missing values
This data represents crime reported to the Seattle Police Department (SPD). Each row contains the record of a unique event where at least one criminal offense was reported by a member of the community…
0 runs0 likes0 downloads0 reach0 impact
52358 instances - 8 features - 0 classes - 650 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
102 runs0 likes3 downloads3 reach13 impact
527 instances - 39 features - 2 classes - 560 missing values
1. Title: meta-data 2. Sources: (a) Creator: LIACC - University of Porto R.Campo Alegre 823 4150 PORTO (b) Donor: P.B.Brazdil or J.Gama Tel.: +351 600 1672 LIACC, University of Porto Fax.: +351 600…
32 runs0 likes2 downloads2 reach12 impact
528 instances - 22 features - 0 classes - 504 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
691 runs0 likes6 downloads6 reach13 impact
528 instances - 22 features - 2 classes - 504 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
686 runs0 likes5 downloads5 reach13 impact
782 instances - 9 features - 2 classes - 466 missing values
The objective was to determine which seedlots in a species are best for soil conservation in seasonally dry hill country. Determination is found by measurement of height, diameter by height, survival,…
27229 runs0 likes11 downloads11 reach7 impact
736 instances - 20 features - 5 classes - 448 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
701 runs0 likes3 downloads3 reach13 impact
736 instances - 20 features - 2 classes - 448 missing values
The aim is to determine the type of arrhythmia from the ECG recordings. This database contains 279 attributes, 206 of which are linear valued and the rest are nominal. Concerning the study of H. Altay…
4430 runs0 likes50 downloads50 reach12 impact
452 instances - 280 features - 13 classes - 408 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
736 runs1 likes5 downloads6 reach13 impact
452 instances - 280 features - 2 classes - 408 missing values
1. Title: 1984 United States Congressional Voting Records Database 2. Source Information: (a) Source: Congressional Quarterly Almanac, 98th Congress, 2nd session 1984, Volume XL: Congressional…
2262 runs0 likes17 downloads17 reach7 impact
435 instances - 17 features - 2 classes - 392 missing values
test
0 runs0 likes0 downloads0 reach0 impact
16598 instances - 11 features - classes - 329 missing values
Date: Tue, 15 Nov 88 15:44:08 EST From: stan To: aha@ICS.UCI.EDU 1. Title: Final settlements in labor negotitions in Canadian industry 2. Source Information -- Creators:…
7681 runs0 likes16 downloads16 reach9 impact
57 instances - 17 features - 2 classes - 326 missing values
No data.
7303 runs0 likes12 downloads12 reach9 impact
226 instances - 70 features - 24 classes - 317 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
721 runs0 likes5 downloads5 reach13 impact
226 instances - 70 features - 2 classes - 317 missing values
Test dataset
0 runs0 likes1 downloads1 reach3 impact
15547 instances - 61 features - 0 classes - 280 missing values
Test dataset
0 runs0 likes1 downloads1 reach3 impact
15547 instances - 61 features - 0 classes - 280 missing values
Test dataset
0 runs0 likes0 downloads0 reach3 impact
15547 instances - 61 features - 0 classes - 280 missing values
Test dataset
3 runs0 likes0 downloads0 reach7 impact
15547 instances - 61 features - 2 classes - 280 missing values
The AAUP dataset for the ASA Statistical Graphics Section's 1995 Data Analysis Exposition contains information on faculty salaries for 1161 American colleges and universities. The data may be obtained…
32 runs0 likes3 downloads3 reach12 impact
1161 instances - 17 features - 4 classes - 256 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
101 runs0 likes5 downloads5 reach13 impact
1161 instances - 17 features - 2 classes - 256 missing values
Citation Request: This primary tumor domain was obtained from the University Medical Centre, Institute of Oncology, Ljubljana, Yugoslavia. Thanks go to M. Zwitter and M. Soklic for providing the data.…
1261 runs0 likes16 downloads16 reach9 impact
339 instances - 18 features - 21 classes - 225 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
752 runs0 likes7 downloads7 reach13 impact
339 instances - 18 features - 2 classes - 225 missing values
Data are collected from Kickstarter Platform You'll find most useful data for project analysis. Columns are self explanatory except: usd_pledged: conversion in US dollars of the pledged column…
0 runs0 likes0 downloads0 reach1 impact
331675 instances - 14 features - classes - 210 missing values
#modelage
87 runs0 likes0 downloads0 reach1 impact
224 instances - 20 features - 6 classes - 205 missing values
#modelage
28 runs0 likes0 downloads0 reach1 impact
202 instances - 13 features - 3 classes - 202 missing values
1. Title: Hepatitis Domain 2. Sources: (a) unknown (b) Donor: G.Gong (Carnegie-Mellon University) via Bojan Cestnik Jozef Stefan Institute Jamova 39 61000 Ljubljana Yugoslavia (tel.: (38)(+61) 214-399…
2134 runs1 likes12 downloads13 reach7 impact
155 instances - 20 features - 2 classes - 167 missing values
Email dataset 1b
0 runs0 likes0 downloads0 reach0 impact
4585 instances - 24 features - 0 classes - 161 missing values
One of the data sets used in the book "Analyzing Categorical Data" by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on statistical…
0 runs0 likes1 downloads1 reach3 impact
31 instances - 16 features - classes - 150 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
100 runs0 likes3 downloads3 reach12 impact
31 instances - 17 features - 2 classes - 150 missing values
This is the hip measurement data from Table B.13 in Chatfield's Problem Solving (1995, 2nd edn, Chapman and Hall). It is given in 8 columns. First 4 columns are for Control Group. Last 4 columns are…
0 runs0 likes0 downloads0 reach3 impact
54 instances - 8 features - classes - 120 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
755 runs0 likes4 downloads4 reach12 impact
54 instances - 8 features - 2 classes - 120 missing values
1. Title: Echocardiogram Data 2. Source Information: -- Donor: Steven Salzberg (salzberg@cs.jhu.edu) -- Collector: -- Dr. Evlin Kinney -- The Reed Institute -- P.O. Box 402603 -- Maimi, FL 33140-0603…
0 runs0 likes0 downloads0 reach1 impact
132 instances - 8 features - 4 classes - 103 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
490 runs0 likes4 downloads4 reach11 impact
364 instances - 33 features - 6 classes - 101 missing values
Asteroid Dataset
0 runs0 likes1 downloads1 reach1 impact
126131 instances - 34 features - 2 classes - 99 missing values
Asteroid Dataset
0 runs0 likes1 downloads1 reach2 impact
126131 instances - 34 features - 2 classes - 99 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Survival treated as the class attribute As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using…
12 runs0 likes2 downloads2 reach1 impact
130 instances - 10 features - 0 classes - 97 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
744 runs0 likes5 downloads5 reach12 impact
130 instances - 10 features - 2 classes - 97 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
1187 runs1 likes10 downloads11 reach7 impact
412 instances - 9 features - 7 classes - 96 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
721 runs0 likes5 downloads5 reach13 impact
412 instances - 9 features - 2 classes - 96 missing values
Datasets from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch) Dataset from: http://www.agnostic.inf.ethz.ch/datasets.php Modified by TunedIT (converted to ARFF…
778 runs0 likes8 downloads8 reach9 impact
4562 instances - 15 features - 2 classes - 88 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Weight treated as the class attribute. Identifier deleted. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric…
10 runs0 likes2 downloads2 reach1 impact
158 instances - 8 features - 0 classes - 87 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
672 runs0 likes4 downloads4 reach13 impact
158 instances - 8 features - 2 classes - 87 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
736 runs0 likes6 downloads6 reach13 impact
364 instances - 33 features - 2 classes - 80 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
103 runs0 likes5 downloads5 reach12 impact
107 instances - 13 features - 2 classes - 71 missing values
This file concerns credit card applications. All attribute names and values have been changed to meaningless symbols to protect the confidentiality of the data. This dataset is interesting because…
25075 runs1 likes33 downloads34 reach9 impact
690 instances - 16 features - 2 classes - 67 missing values
Pittsburgh bridges This version is derived from version 1 by removing all instances with missing values in the last (target) attribute. The bridges dataset is originally not a classification dataset,…
31 runs0 likes1 downloads1 reach7 impact
105 instances - 13 features - 6 classes - 61 missing values
Pittsburgh bridges This version is derived from version 2 (the discretized version) by removing all instances with missing values in the last (target) attribute. The bridges dataset is originally not…
31 runs0 likes3 downloads3 reach12 impact
105 instances - 13 features - 6 classes - 61 missing values
Conventional and Social Media Movies (CSM) - Dataset 2014 and 2015 Data Set 12 features categorized as conventional and social media features. Both conventional features, collected from movies…
0 runs0 likes0 downloads0 reach0 impact
232 instances - 14 features - classes - 60 missing values
This data set consists of three types of entities: (a) the specification of an auto in terms of various characteristics, (b) its assigned insurance risk rating, (c) its normalized losses in use as…
3252 runs2 likes26 downloads28 reach8 impact
205 instances - 26 features - 6 classes - 59 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
727 runs0 likes5 downloads5 reach13 impact
205 instances - 26 features - 2 classes - 59 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
707 runs0 likes6 downloads6 reach13 impact
205 instances - 26 features - 2 classes - 57 missing values
%%%%%%%%%%%%%%%%%%% Data-Description % %%%%%%%%%%%%%%%%%%% COIL 1999 Competition Data Data Type multivariate Abstract This data set is from the 1999 Computational Intelligence and Learning (COIL)…
0 runs0 likes0 downloads0 reach5 impact
316 instances - 12 features - 0 classes - 56 missing values
%%%%%%%%%%%%%%%%%%% Data-Description % %%%%%%%%%%%%%%%%%%% COIL 1999 Competition Data Data Type multivariate Abstract This data set is from the 1999 Computational Intelligence and Learning (COIL)…
0 runs0 likes0 downloads0 reach7 impact
316 instances - 12 features - 0 classes - 56 missing values
%%%%%%%%%%%%%%%%%%% Data-Description % %%%%%%%%%%%%%%%%%%% COIL 1999 Competition Data Data Type multivariate Abstract This data set is from the 1999 Computational Intelligence and Learning (COIL)…
12 runs0 likes0 downloads0 reach6 impact
316 instances - 12 features - 0 classes - 56 missing values
%%%%%%%%%%%%%%%%%%% Data-Description % %%%%%%%%%%%%%%%%%%% COIL 1999 Competition Data Data Type multivariate Abstract This data set is from the 1999 Computational Intelligence and Learning (COIL)…
0 runs0 likes0 downloads0 reach6 impact
316 instances - 12 features - 0 classes - 56 missing values