OpenML
Filter results by:
testtest
0 runs0 likes0 downloads0 reach9 impact
1994 instances - 127 features - 0 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach11 impact
150 instances - 5 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach11 impact
150 instances - 5 features - classes - 0 missing values
This data represents crime reported to the Seattle Police Department (SPD). Each row contains the record of a unique event where at least one criminal offense was reported by a member of the community…
0 runs0 likes0 downloads0 reach8 impact
523590 instances - 8 features - 144 classes - 6916 missing values
exercises
0 runs0 likes0 downloads0 reach8 impact
15000 instances - 8 features - classes - 0 missing values
exercises
0 runs0 likes0 downloads0 reach8 impact
15000 instances - 8 features - classes - 0 missing values
The ILPD dataset from the OpenCC18 with all categorical variables label encoded
0 runs0 likes0 downloads0 reach8 impact
583 instances - 11 features - 0 classes - 0 missing values
The sick dataset from the OpenCC18 with all categorical data label encoded so all data is numeric
0 runs0 likes0 downloads0 reach8 impact
3772 instances - 30 features - classes - 0 missing values
The ILPD liver dataset from the OpenCC18 with the gender binary encoded so all features are numeric
1 runs0 likes0 downloads0 reach10 impact
583 instances - 11 features - 2 classes - 0 missing values
Sick dataset from the opencc18 with all textual binary variables label encoded.
1 runs0 likes2 downloads2 reach9 impact
3772 instances - 30 features - 2 classes - 0 missing values
Elegibilidade ecommerce
0 runs0 likes1 downloads1 reach8 impact
269177 instances - 2 features - 2 classes - 0 missing values
iris dataset test upload
0 runs0 likes0 downloads0 reach11 impact
150 instances - 5 features - 3 classes - 0 missing values
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach8 impact
150 instances - 5 features - classes - 0 missing values
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach8 impact
150 instances - 5 features - 3 classes - 0 missing values
UserID
0 runs0 likes0 downloads0 reach8 impact
1974675 instances - 10 features - classes - 1974675 missing values
web services evaluations in this table
0 runs0 likes0 downloads0 reach9 impact
1974675 instances - 10 features - classes - 1974675 missing values
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach8 impact
150 instances - 5 features - classes - 0 missing values
source: An Algorithm Selection Benchmark for the Container Pre-Marshalling Problem (CPMP) authors: K. Tierney and Y. Malitsky (features) / K. Tierney and D. Pacino and S. Voss (algorithms) translator…
0 runs0 likes1 downloads1 reach8 impact
2108 instances - 24 features - 0 classes - 0 missing values
This simple domain contains 7 Boolean attributes and 10 classes, the set of decimal digits. Recall that LED displays contain 7 light-emitting diodes -- hence the reason for 7 attributes. The class…
13006 runs0 likes9 downloads9 reach18 impact
500 instances - 8 features - 10 classes - 0 missing values
####1. Summary This database was generated by the Laboratory of Image Processing and Pattern Recognition (INPG-LTIRF) in the development of the Esprit project ELENA No. 6891 and the Esprit working…
20229 runs0 likes13 downloads13 reach18 impact
5500 instances - 41 features - 11 classes - 0 missing values
UCI Thyroid allbp dataset.
97 runs0 likes10 downloads10 reach14 impact
2800 instances - 27 features - 5 classes - 0 missing values
General Description of Thyroid Disease Databases and Related Files This directory contains 6 databases, corresponding test set, and corresponding documentation. They were left at the University of…
92 runs0 likes6 downloads6 reach14 impact
2800 instances - 27 features - 5 classes - 0 missing values
General Description of Thyroid Disease Databases and Related Files This directory contains 6 databases, corresponding test set, and corresponding documentation. They were left at the University of…
32 runs0 likes8 downloads8 reach13 impact
2800 instances - 27 features - 5 classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Airline Ticket Price dataset concerns the prediction of airline ticket prices. The rows are a…
0 runs0 likes0 downloads0 reach9 impact
296 instances - 417 features - classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Electrical Discharge Machining dataset (Karalic and Bratko 1997) represents a two-target…
0 runs0 likes0 downloads0 reach9 impact
154 instances - 18 features - classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Energy Building dataset (Tsanas and Xifara 2012) concerns the prediction of the heating load…
0 runs0 likes0 downloads0 reach9 impact
768 instances - 10 features - classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Jura (Goovaerts 1997) dataset consists of measurements of concentrations of seven heavy…
0 runs0 likes0 downloads0 reach9 impact
359 instances - 18 features - classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : This is a pre-processed version of the dataset used in Kaggles See Click Predict Fix competition…
0 runs0 likes0 downloads0 reach9 impact
1137 instances - 26 features - classes - 9255 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Solar Flare dataset (Lichman 2013) has 3 target variables that correspond to the number of…
0 runs0 likes0 downloads0 reach9 impact
323 instances - 13 features - classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Solar Flare dataset (Lichman 2013) has 3 target variables that correspond to the number of…
0 runs0 likes0 downloads0 reach9 impact
1066 instances - 13 features - classes - 0 missing values
The YouTube personality dataset consists of a collection of behavorial features, speech transcriptions, and personality impression scores for a set of 404 YouTube vloggers that explicitly show…
0 runs0 likes1 downloads1 reach9 impact
404 instances - 31 features - classes - 0 missing values
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach9 impact
150 instances - 5 features - classes - 0 missing values
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach9 impact
150 instances - 5 features - 3 classes - 0 missing values
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach9 impact
150 instances - 5 features - classes - 0 missing values
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach9 impact
150 instances - 5 features - 3 classes - 0 missing values
sde c
5 runs0 likes0 downloads0 reach7 impact
1556 instances - 5629 features - classes - 0 missing values
as cscs
3 runs0 likes0 downloads0 reach7 impact
1557 instances - 5629 features - classes - 0 missing values
sd vfv
0 runs0 likes0 downloads0 reach7 impact
4 instances - 50 features - 2 classes - 0 missing values
r rg
0 runs0 likes0 downloads0 reach8 impact
4 instances - 50 features - classes - 0 missing values
as dwd
1 runs0 likes0 downloads0 reach7 impact
1557 instances - 5629 features - classes - 0 missing values
ef r
2 runs0 likes0 downloads0 reach7 impact
1557 instances - 5629 features - classes - 0 missing values
dd ref
0 runs0 likes0 downloads0 reach7 impact
4 instances - 50 features - classes - 0 missing values
The Inpatient Utilization and Payment Public Use File (Inpatient PUF) provides information on inpatient discharges for Medicare fee-for-service beneficiaries. The Inpatient PUF includes…
0 runs0 likes0 downloads0 reach7 impact
163065 instances - 12 features - 0 classes - 0 missing values
sqs efrf
0 runs0 likes0 downloads0 reach7 impact
4 instances - 5 features - classes - 0 missing values
b gtrg
0 runs0 likes0 downloads0 reach7 impact
4 instances - 7 features - classes - 0 missing values
Airlines Dataset Inspired in the regression dataset from Elena Ikonomovska. The task is to predict whether a given flight will be delayed, given the information of the scheduled departure. For this…
0 runs0 likes2 downloads2 reach6 impact
26969 instances - 8 features - 2 classes - 0 missing values
Zurich public transport delay data 2016-10-30 03:30:00 CET - 2016-11-27 01:20:00 CET cleaned and prepared at Open Data Day 2017. For this version, the task was downsampled to 0.5 percent. Some…
0 runs0 likes0 downloads0 reach7 impact
27327 instances - 18 features - 0 classes - 657 missing values
This data represents crime reported to the Seattle Police Department (SPD). Each row contains the record of a unique event where at least one criminal offense was reported by a member of the community…
0 runs0 likes0 downloads0 reach7 impact
52358 instances - 8 features - 0 classes - 650 missing values
ede wey
0 runs0 likes1 downloads1 reach6 impact
589 instances - 2909 features - classes - 0 missing values
eevrr der
0 runs0 likes0 downloads0 reach9 impact
1557 instances - 5629 features - classes - 0 missing values
Touch Signals
0 runs0 likes0 downloads0 reach6 impact
265 instances - 11 features - classes - 0 missing values
Touch samples 2
0 runs0 likes0 downloads0 reach8 impact
265 instances - 11 features - 8 classes - 0 missing values
valores de saida de fardamento com temperaturas, admissões e demissões
0 runs0 likes0 downloads0 reach9 impact
6277 instances - 7 features - 0 classes - 0 missing values
ef f
0 runs0 likes0 downloads0 reach9 impact
4 instances - 49 features - classes - 0 missing values
dsd efe
1 runs0 likes0 downloads0 reach9 impact
601 instances - 7 features - classes - 0 missing values
rrvrf 4rr
0 runs0 likes0 downloads0 reach9 impact
4 instances - 49 features - classes - 0 missing values
de d
3 runs0 likes0 downloads0 reach9 impact
1556 instances - 5628 features - classes - 0 missing values
fr frf
2 runs0 likes0 downloads0 reach7 impact
1556 instances - 5629 features - classes - 0 missing values
The midwest survey dataset contain individual responses from surveys about regional identification conducted for FiveThirtyEight by SurveyMonkey.
0 runs0 likes0 downloads0 reach6 impact
2778 instances - 28 features - 10 classes - 1744 missing values
dd efrg
15 runs0 likes1 downloads1 reach15 impact
1556 instances - 5629 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach8 impact
150 instances - 5 features - classes - 0 missing values
efe def
0 runs0 likes0 downloads0 reach9 impact
4 instances - 49 features - classes - 0 missing values
swd dced
0 runs0 likes0 downloads0 reach6 impact
589 instances - 3 features - classes - 0 missing values
sdsw frfr
0 runs0 likes0 downloads0 reach6 impact
1556 instances - 3 features - classes - 0 missing values
efe rgrg
0 runs0 likes0 downloads0 reach6 impact
Data Set Information: This research aimed at the case of customers’ default payments in Taiwan and compares the predictive accuracy of probability of default among six data mining methods. From…
0 runs0 likes1 downloads1 reach7 impact
30000 instances - 24 features - 2 classes - 0 missing values
e fvr
0 runs0 likes0 downloads0 reach6 impact
2 instances - 11 features - classes - 0 missing values
dd fgrfg
0 runs0 likes0 downloads0 reach7 impact
2 instances - 3 features - classes - 0 missing values
efef ffrf
0 runs0 likes0 downloads0 reach6 impact
9 instances - 3 features - classes - 0 missing values
Water stress dataset for Indian variety of wheat crop: The data consist of a file system-based data of Raj 3765 variety of wheat. There are twenty-four chlorophyll fluorescence images captured every…
0 runs0 likes2 downloads2 reach8 impact
1188 instances - 23 features - 0 classes - 0 missing values
Identify jets of particles from the LHC, created for the study of ultra low latency inference with hls4ml. Use 16 high level features to identify the 5 jet classes: quark (q), gluon (g), W boson (w),…
0 runs0 likes0 downloads0 reach8 impact
830000 instances - 17 features - 5 classes - 0 missing values
ssc vdv
0 runs0 likes0 downloads0 reach6 impact
1556 instances - 2 features - classes - 0 missing values
Experiment data obtained by running random configurations of the hnsw kNN through mlr on 116 different classification tasks from openml.
0 runs0 likes0 downloads0 reach7 impact
111753 instances - 13 features - classes - 0 missing values
Experiment data obtained by running random configurations of glmnet through mlr on 114 different classification tasks from openml.
0 runs0 likes0 downloads0 reach7 impact
104820 instances - 10 features - classes - 0 missing values
Experiment data obtained by running random configurations of an SVM through mlr on 106 different classification tasks from openml.
0 runs0 likes0 downloads0 reach7 impact
540576 instances - 15 features - classes - 658962 missing values
Experiment data obtained by running random configurations of rpart through mlr on 115 different classification tasks from openml.
0 runs0 likes0 downloads0 reach7 impact
92067 instances - 12 features - classes - 0 missing values
Experiment data obtained by running random configurations of ranger through mlr on 119 different classification tasks from openml.
0 runs0 likes0 downloads0 reach7 impact
278863 instances - 16 features - classes - 138965 missing values
Experiment data obtained by running random configurations of xgboost through mlr on 118 different classification tasks from openml. Parameter descriptions:…
0 runs0 likes0 downloads0 reach7 impact
2955210 instances - 21 features - classes - 7051006 missing values
dataset for bme
0 runs0 likes0 downloads0 reach7 impact
63 instances - 12 features - classes - 52 missing values
The midwest survey dataset contain individual responses from surveys about regional identification conducted for FiveThirtyEight by SurveyMonkey.
0 runs0 likes0 downloads0 reach6 impact
2778 instances - 28 features - 10 classes - 1744 missing values
frf r
0 runs0 likes0 downloads0 reach6 impact
2 instances - 3 features - classes - 0 missing values
e eded
0 runs0 likes0 downloads0 reach6 impact
2 instances - 4 features - classes - 0 missing values
e3r4vr t4r
0 runs0 likes0 downloads0 reach6 impact
2 instances - 5 features - classes - 0 missing values
f fr
0 runs0 likes0 downloads0 reach6 impact
2 instances - 5 features - classes - 0 missing values
Data from https://doi.org/10.5281/zenodo.269636
0 runs0 likes5 downloads5 reach14 impact
4758 instances - 39 features - classes - 0 missing values
#study_1
0 runs0 likes0 downloads0 reach10 impact
944 instances - 17 features - classes - 0 missing values
This dataset is gather to detect whether a person is running or walking based on deep neural networks and sensor data collected from iOS devices. The dataset represents 88588 sensor data samples…
1 runs0 likes4 downloads4 reach14 impact
88588 instances - 7 features - 2 classes - 0 missing values
This is a 20,000 instance sample of the original CIFAR-10 dataset. Sampled randomly and stratified, with 2000 examples per class. Training and test set are merged. Find the corresponding task for the…
380 runs0 likes5 downloads5 reach22 impact
20000 instances - 3073 features - 10 classes - 0 missing values
0. airplane 1. automobile 2. bird 3. cat 4. deer 5. dog 6. frog 7. horse 8. ship 9. truck CIFAR-10 contains 6000 images per class. The original train-test split randomly divided these into 5000 train…
159 runs0 likes6 downloads6 reach21 impact
60000 instances - 3073 features - 10 classes - 0 missing values
Data used in an analysis of the Brown and Frown corpora for my doctoral dissertation titled ``Variations in Written English: Characterizing Authors' Rhetorical Language Choices Across Corpora of…
2048 runs0 likes1 downloads1 reach12 impact
1000 instances - 24 features - 30 classes - 0 missing values
Author: Gregory Gay, Tim Menzies, Misty Davies, Karen Gundy-Burlet Source: [Zenodo](https://zenodo.org/record/322475) Please cite: Misty Davies. (2009). bike [Data set]. Zenodo. DOI:…
0 runs0 likes4 downloads4 reach10 impact
4435 instances - 11 features - classes - 0 missing values
### Description __Changes to version 1:__ all categorical features transformed as such. This dataset represents a set of possible advertisements on Internet pages. ### Sources (a) Creator and donor:…
1430 runs0 likes5 downloads5 reach23 impact
3279 instances - 1559 features - 2 classes - 0 missing values
"The speech dataset was also provided by (see citation request) and contains real world data from recorded English language. The normal class contains data from persons having an American accent…
1599 runs0 likes6 downloads6 reach17 impact
3686 instances - 401 features - 2 classes - 0 missing values
Expression levels of 77 proteins measured in the cerebral cortex of 8 classes of control and Down syndrome mice exposed to context fear conditioning, a task used to assess associative learning. The…
9537 runs0 likes0 downloads0 reach21 impact
1080 instances - 82 features - 8 classes - 1396 missing values
The happiness scores and rankings use data from the Gallup World Poll. The scores are based on answers to the main life evaluation question asked in the poll. This question, known as the Cantril…
2 runs0 likes3 downloads3 reach12 impact
158 instances - 12 features - 0 classes - 0 missing values
This file holds global land temperatures by country
0 runs0 likes1 downloads1 reach10 impact
577462 instances - 4 features - classes - 64563 missing values
holds information on average temperature per country
0 runs0 likes0 downloads0 reach10 impact
577462 instances - 4 features - classes - 64563 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
10 runs0 likes1 downloads1 reach20 impact
20000 instances - 4297 features - 2 classes - 0 missing values
One of the biggest challenges of an auto dealership purchasing a used car at an auto auction is the risk of that the vehicle might have serious issues that prevent it from being sold to customers. The…
3 runs0 likes3 downloads3 reach13 impact
72983 instances - 33 features - 2 classes - 149271 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
3 runs0 likes2 downloads2 reach18 impact
10000 instances - 2001 features - 5 classes - 0 missing values