OpenML
Filter results by:
https://www.kaggle.com/dansbecker/nba-shot-logs
0 runs0 likes0 downloads0 reach0 impact
128069 instances - 21 features - classes - 5567 missing values
hmeq_p,BAD,binary
0 runs0 likes0 downloads0 reach8 impact
5960 instances - 15 features - classes - 5271 missing values
Dataset from the MLRR repository: http://axon.cs.byu.edu:5000/
68 runs0 likes7 downloads7 reach25 impact
32561 instances - 16 features - 2 classes - 4262 missing values
The original Titanic dataset, describing the survival status of individual passengers on the Titanic. The titanic data does not contain information from the crew, but it does contain actual ages of…
0 runs2 likes32 downloads34 reach12 impact
1309 instances - 14 features - 2 classes - 3855 missing values
In the early 2000s, Billy Beane and Paul DePodesta worked for the Oakland Athletics. While there, they literally changed the game of baseball. They didn't do it using a bat or glove, and they…
0 runs0 likes8 downloads8 reach13 impact
1232 instances - 15 features - 0 classes - 3600 missing values
### Description ### This dataset is part of a collection datasets based on the game "Jungle Chess" (a.k.a. Dou Shou Qi). For a description of the rules, please refer to the paper (link attached). The…
11 runs0 likes0 downloads0 reach13 impact
5880 instances - 47 features - 3 classes - 3528 missing values
### Description ### This dataset is part of a collection datasets based on the game "Jungle Chess" (a.k.a. Dou Shou Qi). For a description of the rules, please refer to the paper (link attached). The…
11 runs0 likes0 downloads0 reach13 impact
5880 instances - 47 features - 3 classes - 3528 missing values
### Description ### This dataset is part of a collection datasets based on the game "Jungle Chess" (a.k.a. Dou Shou Qi). For a description of the rules, please refer to the paper (link attached). The…
10 runs0 likes0 downloads0 reach13 impact
5880 instances - 47 features - 3 classes - 3528 missing values
test
0 runs0 likes0 downloads0 reach3 impact
4324 instances - 9 features - classes - 3360 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The river flow datasets concern the prediction of river network flows for 48 h in the future at…
0 runs0 likes0 downloads0 reach9 impact
9125 instances - 72 features - classes - 3264 missing values
### Internet Usage Data #### Data Type multivariate #### Abstract This data contains general demographic information on internet users in 1997. ### Data Characteristics This data comes from a survey…
0 runs1 likes6 downloads7 reach12 impact
10108 instances - 72 features - 46 classes - 2699 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
622 runs0 likes6 downloads6 reach17 impact
10108 instances - 69 features - 2 classes - 2699 missing values
test
0 runs0 likes0 downloads0 reach3 impact
2580 instances - 7 features - classes - 2541 missing values
test
0 runs0 likes0 downloads0 reach3 impact
2580 instances - 7 features - classes - 2541 missing values
test
0 runs0 likes0 downloads0 reach2 impact
8124 instances - 23 features - classes - 2480 missing values
### Description This dataset describes mushrooms in terms of their physical characteristics. They are classified into: poisonous or edible. ### Source ``` (a) Origin: Mushroom records are drawn from…
16392 runs1 likes42 downloads43 reach13 impact
8124 instances - 23 features - 2 classes - 2480 missing values
This is the large soybean database from the UCI repository, with its training and test database combined into a single file. There are 19 classes, only the first 15 of which have been used in prior…
40719 runs1 likes53 downloads54 reach13 impact
683 instances - 36 features - 19 classes - 2337 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
722 runs0 likes6 downloads6 reach15 impact
683 instances - 36 features - 2 classes - 2337 missing values
This dataset contains traffic violation information from all electronic traffic violations issued in the County. Any information that can be used to uniquely identify the vehicle, the vehicle owner or…
0 runs1 likes1 downloads2 reach9 impact
70340 instances - 21 features - 3 classes - 2288 missing values
Donor: Will Taylor (taylor@pluto.arc.nasa.gov) In this version (version 2), some features were removed. It is unclear why of how this was done.
1883 runs1 likes10 downloads11 reach9 impact
368 instances - 23 features - 2 classes - 1927 missing values
Donor: Will Taylor (taylor@pluto.arc.nasa.gov) Database of surgeries on horses. Possible class attributes: 24 (whether lesion is surgical), others include: 23, 25, 26, and 27 Notes: * Hospital_Number…
236 runs0 likes9 downloads9 reach9 impact
368 instances - 27 features - 2 classes - 1927 missing values
source: http://www.cs.ubc.ca/labs/beta/Projects/SATzilla/ authors: L. Xu, F. Hutter, H. Hoos, K. Leyton-Brown translator in coseal format: M. Lindauer with the help of Alexandre Frechette the data do…
0 runs0 likes0 downloads0 reach8 impact
296 instances - 116 features - 14 classes - 1810 missing values
The midwest survey dataset contain individual responses from surveys about regional identification conducted for FiveThirtyEight by SurveyMonkey.
0 runs0 likes0 downloads0 reach6 impact
2778 instances - 28 features - 10 classes - 1744 missing values
The midwest survey dataset contain individual responses from surveys about regional identification conducted for FiveThirtyEight by SurveyMonkey.
0 runs0 likes0 downloads0 reach6 impact
2778 instances - 28 features - 10 classes - 1744 missing values
Survey to know if people self-identify as Midwesterners.
0 runs0 likes0 downloads0 reach0 impact
2778 instances - 28 features - 10 classes - 1737 missing values
Survey to know if people self-identify as Midwesterners.
0 runs0 likes0 downloads0 reach0 impact
2778 instances - 28 features - 10 classes - 1737 missing values
A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on…
0 runs0 likes0 downloads0 reach13 impact
379 instances - 8 features - 4 classes - 1418 missing values
Expression levels of 77 proteins measured in the cerebral cortex of 8 classes of control and Down syndrome mice exposed to context fear conditioning, a task used to assess associative learning. The…
9537 runs0 likes0 downloads0 reach20 impact
1080 instances - 82 features - 8 classes - 1396 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
104 runs0 likes6 downloads6 reach15 impact
379 instances - 8 features - 2 classes - 1368 missing values
D. Harrington\ D. Harrington, (1991), published by John Wiley & Sons NAME: PBC Data SIZE: 418 observations, 20 variables DESCRIPTIVE ABSTRACT: Below is a description of the variables recorded from the…
10 runs0 likes1 downloads1 reach12 impact
418 instances - 19 features - 0 classes - 1239 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
723 runs0 likes5 downloads5 reach15 impact
418 instances - 19 features - 2 classes - 1239 missing values
PRO FOOTBALL SCORES How well do the oddsmakers of Las Vegas predict the outcome of professional football games? Is there really a home field advantage - if so how large is it? Are teams that play the…
15930 runs0 likes19 downloads19 reach25 impact
672 instances - 10 features - 2 classes - 1200 missing values
Primary Biliary Cirrhosis This data set is a follow-up to the original PBC data set, as discussed in appendix D of Fleming and Harrington, Counting Processes and Survival Analysis, Wiley, 1991. An…
0 runs0 likes5 downloads5 reach13 impact
1945 instances - 19 features - 0 classes - 1133 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
680 runs0 likes5 downloads5 reach15 impact
1945 instances - 19 features - 2 classes - 1133 missing values
Primary Biliary Cirrhosis The data set found in appendix D of Fleming and Harrington, Counting Processes and Survival Analysis, Wiley, 1991. The only differences are: age is in days status is coded as…
18 runs1 likes3 downloads4 reach14 impact
418 instances - 20 features - 0 classes - 1033 missing values
### Description Cylinder bands UCI dataset - Process delays known as cylinder banding in rotogravure printing were substantially mitigated using control rules discovered by decision tree induction.…
21477 runs0 likes8 downloads8 reach26 impact
540 instances - 40 features - 2 classes - 999 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach7 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
####1. Summary This dataset contain attributes of dresses and their recommendations according to their sales. Sales are monitor on the basis of alternate days. The attributes present analyzed are:…
19054 runs1 likes6 downloads7 reach18 impact
500 instances - 13 features - 2 classes - 835 missing values
Schizophrenic Eye-Tracking Data in Rubin and Wu (1997) Biometrics.\ Information about the dataset CLASSTYPE: nominal CLASSINDEX: last
748 runs0 likes7 downloads7 reach24 impact
340 instances - 15 features - 2 classes - 834 missing values
1. Hungarian Institute of Cardiology. Budapest: Andras Janosi, M.D. 2. University Hospital, Zurich, Switzerland: William Steinbrunn, M.D. 3. University Hospital, Basel, Switzerland: Matthias…
10 runs0 likes0 downloads0 reach9 impact
294 instances - 14 features - 0 classes - 782 missing values
Publication Request: >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> This file describes the contents of the heart-disease directory. This directory contains 4 databases…
1789 runs0 likes12 downloads12 reach9 impact
294 instances - 14 features - 2 classes - 782 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
700 runs0 likes4 downloads4 reach15 impact
294 instances - 14 features - 2 classes - 782 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
688 runs0 likes4 downloads4 reach14 impact
294 instances - 14 features - 2 classes - 782 missing values
xxx
0 runs0 likes0 downloads0 reach6 impact
891 instances - 8 features - classes - 689 missing values
xxx
0 runs0 likes0 downloads0 reach6 impact
891 instances - 8 features - 2 classes - 689 missing values
Zurich public transport delay data 2016-10-30 03:30:00 CET - 2016-11-27 01:20:00 CET cleaned and prepared at Open Data Day 2017. For this version, the task was downsampled to 0.5 percent. Some…
0 runs0 likes0 downloads0 reach6 impact
27327 instances - 18 features - 0 classes - 657 missing values
This data represents crime reported to the Seattle Police Department (SPD). Each row contains the record of a unique event where at least one criminal offense was reported by a member of the community…
0 runs0 likes0 downloads0 reach6 impact
52358 instances - 8 features - 0 classes - 650 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
102 runs0 likes3 downloads3 reach15 impact
527 instances - 37 features - 2 classes - 542 missing values
1. Title: meta-data 2. Sources: (a) Creator: LIACC - University of Porto R.Campo Alegre 823 4150 PORTO (b) Donor: P.B.Brazdil or J.Gama Tel.: +351 600 1672 LIACC, University of Porto Fax.: +351 600…
32 runs0 likes2 downloads2 reach21 impact
528 instances - 22 features - 0 classes - 504 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
691 runs0 likes6 downloads6 reach15 impact
528 instances - 22 features - 2 classes - 504 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
686 runs0 likes5 downloads5 reach15 impact
782 instances - 9 features - 2 classes - 466 missing values
The objective was to determine which seedlots in a species are best for soil conservation in seasonally dry hill country. Determination is found by measurement of height, diameter by height, survival,…
27229 runs0 likes11 downloads11 reach10 impact
736 instances - 20 features - 5 classes - 448 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
701 runs0 likes3 downloads3 reach15 impact
736 instances - 20 features - 2 classes - 448 missing values
The aim is to determine the type of arrhythmia from the ECG recordings. This database contains 279 attributes, 206 of which are linear valued and the rest are nominal. Concerning the study of H. Altay…
4430 runs0 likes50 downloads50 reach15 impact
452 instances - 280 features - 13 classes - 408 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
736 runs1 likes5 downloads6 reach15 impact
452 instances - 280 features - 2 classes - 408 missing values
1. Title: 1984 United States Congressional Voting Records Database 2. Source Information: (a) Source: Congressional Quarterly Almanac, 98th Congress, 2nd session 1984, Volume XL: Congressional…
2262 runs0 likes17 downloads17 reach9 impact
435 instances - 17 features - 2 classes - 392 missing values
test
0 runs0 likes0 downloads0 reach3 impact
16598 instances - 11 features - classes - 329 missing values
Date: Tue, 15 Nov 88 15:44:08 EST From: stan To: aha@ICS.UCI.EDU 1. Title: Final settlements in labor negotitions in Canadian industry 2. Source Information -- Creators:…
7681 runs0 likes17 downloads17 reach12 impact
57 instances - 17 features - 2 classes - 326 missing values
This database is a standardized version of the original audiology database (see audiology.* in this directory). The non-standard set of attributes have been converted to a standard set of attributes…
7303 runs0 likes12 downloads12 reach12 impact
226 instances - 70 features - 24 classes - 317 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
721 runs0 likes5 downloads5 reach15 impact
226 instances - 70 features - 2 classes - 317 missing values
Test dataset
0 runs0 likes1 downloads1 reach13 impact
15547 instances - 61 features - 0 classes - 280 missing values
Test dataset
0 runs0 likes1 downloads1 reach13 impact
15547 instances - 61 features - 0 classes - 280 missing values
Test dataset
0 runs0 likes0 downloads0 reach13 impact
15547 instances - 61 features - 0 classes - 280 missing values
Test dataset
3 runs0 likes0 downloads0 reach15 impact
15547 instances - 61 features - 2 classes - 280 missing values
The AAUP dataset for the ASA Statistical Graphics Section's 1995 Data Analysis Exposition contains information on faculty salaries for 1161 American colleges and universities. The data may be obtained…
32 runs0 likes3 downloads3 reach14 impact
1161 instances - 15 features - 4 classes - 256 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
101 runs0 likes5 downloads5 reach15 impact
1161 instances - 16 features - 2 classes - 256 missing values
Primary Tumor Domain - Donors: - I. Kononenko, University E.Kardelj, Faculty for electrical engineering - B. Cestnik, Jozef Stefan Institute - Past Usage: (sveral) 1. Cestnik,G., Konenenko,I, &…
1261 runs0 likes16 downloads16 reach12 impact
339 instances - 18 features - 21 classes - 225 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
752 runs0 likes7 downloads7 reach15 impact
339 instances - 18 features - 2 classes - 225 missing values
Data are collected from Kickstarter Platform You'll find most useful data for project analysis. Columns are self explanatory except: usd_pledged: conversion in US dollars of the pledged column…
0 runs0 likes0 downloads0 reach8 impact
331675 instances - 14 features - classes - 210 missing values
#modelage
87 runs0 likes0 downloads0 reach8 impact
224 instances - 20 features - 6 classes - 205 missing values
#modelage
28 runs0 likes0 downloads0 reach8 impact
202 instances - 13 features - 3 classes - 202 missing values
1. Title: Hepatitis Domain 2. Sources: (a) unknown (b) Donor: G.Gong (Carnegie-Mellon University) via Bojan Cestnik Jozef Stefan Institute Jamova 39 61000 Ljubljana Yugoslavia (tel.: (38)(+61) 214-399…
2134 runs1 likes12 downloads13 reach9 impact
155 instances - 20 features - 2 classes - 167 missing values
Email dataset 1b
0 runs0 likes0 downloads0 reach2 impact
4585 instances - 24 features - 0 classes - 161 missing values
A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on…
0 runs0 likes1 downloads1 reach11 impact
31 instances - 16 features - classes - 150 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
100 runs0 likes3 downloads3 reach14 impact
31 instances - 16 features - 2 classes - 150 missing values
This is the hip measurement data from Table B.13 in Chatfield's Problem Solving (1995, 2nd edn, Chapman and Hall). It is given in 8 columns. First 4 columns are for Control Group. Last 4 columns are…
0 runs0 likes0 downloads0 reach11 impact
54 instances - 8 features - classes - 120 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
755 runs0 likes4 downloads4 reach14 impact
54 instances - 8 features - 2 classes - 120 missing values
1. Title: Echocardiogram Data 2. Source Information: -- Donor: Steven Salzberg (salzberg@cs.jhu.edu) -- Collector: -- Dr. Evlin Kinney -- The Reed Institute -- P.O. Box 402603 -- Maimi, FL 33140-0603…
0 runs0 likes0 downloads0 reach8 impact
132 instances - 8 features - 4 classes - 103 missing values