OpenML
Filter results by:
efef fdfef
0 runs0 likes0 downloads0 reach0 impact
2 instances - 2 features - classes - 0 missing values
zaxa xcdc
0 runs0 likes0 downloads0 reach0 impact
2 instances - 2 features - classes - 0 missing values
wdwd cd
0 runs0 likes0 downloads0 reach0 impact
2 instances - 2 features - classes - 0 missing values
ddfef fvdf
0 runs0 likes0 downloads0 reach0 impact
8 instances - 1 features - classes - 0 missing values
Much of machine learning research focuses on producing models which perform well on benchmark tasks, in turn improving our understanding of the challenges associated with those tasks. From the…
0 runs0 likes1 downloads1 reach3 impact
70000 instances - 785 features - 10 classes - 0 missing values
### Description One-hundred plant species leaves dataset (Class = Margin). ### Sources ``` (a) Original owners of colour Leaves Samples: James Cope, Thibaut Beghin, Paolo Remagnino, Sarah Barman. The…
143050 runs1 likes16 downloads17 reach411 impact
1600 instances - 65 features - 100 classes - 0 missing values
The Inpatient Utilization and Payment Public Use File (Inpatient PUF) provides information on inpatient discharges for Medicare fee-for-service beneficiaries. The Inpatient PUF includes information on…
0 runs0 likes2 downloads2 reach1 impact
163065 instances - 12 features - 0 classes - 0 missing values
swd cdef
0 runs0 likes0 downloads0 reach0 impact
3 instances - 2 features - classes - 0 missing values
werr
0 runs0 likes0 downloads0 reach0 impact
2 instances - 2 features - classes - 0 missing values
ddef
0 runs0 likes0 downloads0 reach0 impact
2 instances - 2 features - classes - 0 missing values
swd
0 runs0 likes0 downloads0 reach0 impact
2 instances - 2 features - classes - 0 missing values
sds dcdcc
0 runs0 likes0 downloads0 reach0 impact
3 instances - 2 features - classes - 0 missing values
wded def
0 runs0 likes0 downloads0 reach0 impact
3 instances - 2 features - classes - 0 missing values
sxd cde
0 runs0 likes0 downloads0 reach0 impact
3 instances - 2 features - classes - 0 missing values
xscdc frfgrg
0 runs0 likes0 downloads0 reach0 impact
3 instances - 1 features - classes - 0 missing values
scs
0 runs0 likes0 downloads0 reach0 impact
2 instances - 2 features - classes - 0 missing values
wdede
0 runs0 likes0 downloads0 reach0 impact
2 instances - 2 features - classes - 0 missing values
swdw
0 runs0 likes0 downloads0 reach0 impact
2 instances - 2 features - classes - 0 missing values
qsqs
0 runs0 likes0 downloads0 reach0 impact
2 instances - 1 features - classes - 0 missing values
wdwd
0 runs0 likes0 downloads0 reach0 impact
2 instances - 1 features - classes - 0 missing values
dedfef
0 runs0 likes0 downloads0 reach0 impact
2 instances - 1 features - classes - 0 missing values
The satellite dataset comprises of features extracted from satellite observations. In particular, each image was taken under four different light wavelength, two in visible light (green and red) and…
2074 runs3 likes69 downloads72 reach22 impact
5100 instances - 37 features - 2 classes - 0 missing values
Download test
0 runs0 likes0 downloads0 reach0 impact
3 instances - 2 features - classes - 0 missing values
sdwd dede
0 runs0 likes0 downloads0 reach0 impact
44 instances - 2 features - classes - 0 missing values
xsxs cdf
0 runs0 likes0 downloads0 reach0 impact
6 instances - 2 features - classes - 0 missing values
University of Sao Paulo, School of Art, Sciences and Humanities, Sao Paulo, SP, Brazil ### LIBRAS Movement Database LIBRAS, acronym of the Portuguese name "LIngua BRAsileira de Sinais", is the…
0 runs0 likes4 downloads4 reach11 impact
360 instances - 91 features - 0 classes - 0 missing values
Datasets of Data And Story Library, project illustrating use of basic statistic methods, converted to arff format by Hakan Kjellerstrand. Source: TunedIT: http://tunedit.org/repo/DASL DASL file…
593 runs0 likes7 downloads7 reach7 impact
478 instances - 11 features - 3 classes - 0 missing values
cast metal 1
111 runs0 likes9 downloads9 reach6 impact
327 instances - 38 features - 2 classes - 0 missing values
pie chart 1
102 runs0 likes5 downloads5 reach6 impact
705 instances - 38 features - 2 classes - 0 missing values
Pizza cutter 3
188 runs0 likes6 downloads6 reach7 impact
1043 instances - 38 features - 2 classes - 0 missing values
Costa madre 1
90 runs0 likes6 downloads6 reach8 impact
296 instances - 38 features - 2 classes - 0 missing values
pie chart 2
101 runs0 likes5 downloads5 reach6 impact
745 instances - 37 features - 2 classes - 0 missing values
pie chart 3
103 runs0 likes6 downloads6 reach6 impact
1077 instances - 38 features - 2 classes - 0 missing values
Mega watt
183 runs0 likes8 downloads8 reach8 impact
253 instances - 38 features - 2 classes - 0 missing values
Pizza cutter
197 runs0 likes8 downloads8 reach7 impact
661 instances - 38 features - 2 classes - 0 missing values
* Abstract: The data was created by a medical expert as a data set to test the expert system, which will perform the presumptive diagnosis of two diseases of the urinary system. * Source: Jacek…
391 runs0 likes11 downloads11 reach6 impact
120 instances - 7 features - 2 classes - 0 missing values
No data.
697 runs0 likes7 downloads7 reach8 impact
320 instances - 9 features - 2 classes - 0 missing values
This database has been artificially generated. It describes the structure of the capital letters A, C, D, E, F, G, H, L, P, R, indicated by a number 1-10, in that order (A=1,C=2,...). Each letter's…
24309 runs0 likes10 downloads10 reach50 impact
10218 instances - 8 features - 10 classes - 0 missing values
This simple domain contains 7 Boolean attributes and 10 classes, the set of decimal digits. Recall that LED displays contain 7 light-emitting diodes -- hence the reason for 7 attributes. The class…
13006 runs0 likes9 downloads9 reach11 impact
500 instances - 8 features - 10 classes - 0 missing values
This is a corrected version of the previous data file in version 1, which contained a dataset (349 instances) incorrectly merged from the original training and test sets available on UCI (there are…
0 runs0 likes3 downloads3 reach5 impact
267 instances - 45 features - 2 classes - 0 missing values
UCI Thyroid allbp dataset.
97 runs0 likes9 downloads9 reach7 impact
2800 instances - 27 features - 5 classes - 0 missing values
General Description of Thyroid Disease Databases and Related Files This directory contains 6 databases, corresponding test set, and corresponding documentation. They were left at the University of…
92 runs0 likes5 downloads5 reach7 impact
2800 instances - 27 features - 5 classes - 0 missing values
* Dataset Title: Robot Execution Failures Data Set * Abstract: This dataset contains force and torque measurements on a robot after failure detection. Each failure is characterized by 15 force/torque…
129 runs0 likes3 downloads3 reach6 impact
117 instances - 91 features - 3 classes - 0 missing values
* Dataset Title: Robot Execution Failures Data Set * Abstract: This dataset contains force and torque measurements on a robot after failure detection. Each failure is characterized by 15 force/torque…
130 runs0 likes6 downloads6 reach6 impact
164 instances - 91 features - 5 classes - 0 missing values
* Dataset Title: Vertebra Column - 3 classes * Abstract: Data set containing values for six biomechanical features used to classify orthopaedic patients into 3 classes (normal, disk hernia or…
154 runs0 likes5 downloads5 reach6 impact
310 instances - 7 features - 3 classes - 0 missing values
* Dataset Title: Volcanoes on Venus - JARtool experiment Data Set Experiment: A2 * Source: Michael C. Burl MS 126-347, JPL 4800 Oak Grove Drive Pasadena, CA 91109 (818) 393-5345 Michael.C.Burl '@'…
119 runs0 likes4 downloads4 reach7 impact
1623 instances - 4 features - 5 classes - 0 missing values
* Dataset Title: Volcanoes on Venus - JARtool experiment Data Set Experiment: A3 * Source: Michael C. Burl MS 126-347, JPL 4800 Oak Grove Drive Pasadena, CA 91109 (818) 393-5345 Michael.C.Burl '@'…
133 runs0 likes7 downloads7 reach7 impact
1521 instances - 4 features - 5 classes - 0 missing values
* Title: Wholesale customers Data Set * Abstract: The data set refers to clients of a wholesale distributor. It includes the annual spending in monetary units (m.u.) on diverse product categories *…
161 runs0 likes10 downloads10 reach7 impact
440 instances - 9 features - 2 classes - 0 missing values
* Donor: David W. Aha (aha '@' ics.uci.edu) (714) 856-8779 * Data Set Information: This database contains 76 attributes, but all published experiments refer to using a subset of 14 of them. In…
159 runs1 likes4 downloads5 reach6 impact
200 instances - 14 features - 5 classes - 0 missing values
* Twonorm dataset This is an implementation of Leo Breiman's twonorm example[1]. It is a 20 dimensional, 2 class classification example. Each class is drawn from a multivariate normal distribution…
118 runs0 likes5 downloads5 reach7 impact
7400 instances - 21 features - 2 classes - 0 missing values
* Abstract: Predict the Bankruptcy from Qualitative parameters from experts. * Source: Source Information -- Creator : Mr.A.Martin(jayamartin '@' yahoo.com) Mr.J.Uthayakumar (uthayakumar17691 '@'…
147 runs0 likes11 downloads11 reach7 impact
250 instances - 7 features - 2 classes - 0 missing values
1: Abstract: This is a 20 dimensional, 2 class classification problem. Each class is drawn from a multivariate normal distribution. Class 1 has mean zero and covariance 4 times the identity. Class 2…
120 runs0 likes8 downloads8 reach7 impact
7400 instances - 21 features - 2 classes - 0 missing values
Dataset title laLSVT Voice Rehabilitation Data Set Source: The dataset was created by Athanasios Tsanas (tsanasthanasis '@' gmail.com) of the University of Oxford. Abstract: 126 samples from 14…
162 runs0 likes5 downloads5 reach6 impact
126 instances - 311 features - 2 classes - 0 missing values
* Source: JP Marques de Sá, INEB-Instituto de Engenharia Biomédica, Porto, Portugal; e-mail: jpmdesa '@' gmail.com J Jossinet, inserm, Lyon, France * Data Set Information: Impedance measurements…
280 runs0 likes5 downloads5 reach6 impact
106 instances - 10 features - 6 classes - 0 missing values
2126 fetal cardiotocograms (CTGs) were automatically processed and the respective diagnostic features measured. The CTGs were also classified by three expert obstetricians and a consensus…
24176 runs5 likes29 downloads34 reach49 impact
2126 instances - 36 features - 10 classes - 0 missing values
This collection includes 21 data sets of one-dimensional ultrasound raw RF data (A-Scans) acquired from the calf muscles of 8 healthy volunteers. The subjects were asked to manually annotate the data…
0 runs0 likes1 downloads1 reach1 impact
212872 instances - 4 features - classes - 0 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au6-250-drift-au6-cd1-500 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and…
11011 runs0 likes9 downloads9 reach40 impact
750 instances - 41 features - 8 classes - 0 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au6-cd1-400 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and heterogeneity…
144 runs0 likes3 downloads3 reach6 impact
400 instances - 41 features - 8 classes - 0 missing values
A 4-class version of breast-tissue dataset.
299 runs0 likes4 downloads4 reach6 impact
106 instances - 10 features - 4 classes - 0 missing values
* Dataset: Hill valley dataset. A noiseless version of the data set.
117 runs0 likes8 downloads8 reach8 impact
1212 instances - 101 features - 2 classes - 0 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au7-300-drift-au7-cpd1-800 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and…
7130 runs0 likes11 downloads11 reach28 impact
1100 instances - 13 features - 5 classes - 0 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au7-cpd1-500 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and heterogeneity…
7145 runs0 likes7 downloads7 reach28 impact
500 instances - 13 features - 5 classes - 0 missing values
* Abstract: The data was created by a medical expert as a data set to test the expert system, which will perform the presumptive diagnosis of two diseases of the urinary system. This is a…
423 runs0 likes14 downloads14 reach6 impact
120 instances - 7 features - 2 classes - 0 missing values
* Dataset: Reduced version (10 % of the examples) of bank-marketing dataset.
104 runs1 likes16 downloads17 reach8 impact
4521 instances - 17 features - 2 classes - 0 missing values
* Dataset Title: Volcanoes on Venus - JARtool experiment Data Set Experiment: D3 * Source: Michael C. Burl MS 126-347, JPL 4800 Oak Grove Drive Pasadena, CA 91109 (818) 393-5345 Michael.C.Burl '@'…
126 runs0 likes3 downloads3 reach7 impact
9285 instances - 4 features - 5 classes - 0 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au1-1000 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and heterogeneity of…
3255 runs1 likes8 downloads9 reach16 impact
1000 instances - 21 features - 2 classes - 0 missing values
* Dataset Title: Volcanoes on Venus - JARtool experiment Data Set Experiment: B5 * Source: Michael C. Burl MS 126-347, JPL 4800 Oak Grove Drive Pasadena, CA 91109 (818) 393-5345 Michael.C.Burl '@'…
107 runs0 likes2 downloads2 reach7 impact
9989 instances - 4 features - 5 classes - 0 missing values
* Dataset Title: Volcanoes on Venus - JARtool experiment Data Set Experiment: B6 * Source: Michael C. Burl MS 126-347, JPL 4800 Oak Grove Drive Pasadena, CA 91109 (818) 393-5345 Michael.C.Burl '@'…
111 runs0 likes2 downloads2 reach7 impact
10130 instances - 4 features - 5 classes - 0 missing values
* Dataset Title: Volcanoes on Venus - JARtool experiment Data Set Experiment: B3 * Source: Michael C. Burl MS 126-347, JPL 4800 Oak Grove Drive Pasadena, CA 91109 (818) 393-5345 Michael.C.Burl '@'…
119 runs0 likes4 downloads4 reach7 impact
10386 instances - 4 features - 5 classes - 0 missing values
* Dataset Title: Volcanoes on Venus - JARtool experiment Data Set Experiment: D2 * Source: Michael C. Burl MS 126-347, JPL 4800 Oak Grove Drive Pasadena, CA 91109 (818) 393-5345 Michael.C.Burl '@'…
118 runs0 likes3 downloads3 reach7 impact
9172 instances - 4 features - 5 classes - 0 missing values
At Santander our mission is to help people and businesses prosper. We are always looking for ways to help our customers understand their financial health and identify which products and services might…
0 runs0 likes0 downloads0 reach0 impact
200000 instances - 202 features - 2 classes - 0 missing values
This data is derived from the 2012 KDD Cup. The data is subsampled to 1% of the original number of instances, downsampling the majority class (click=0) so that the target feature is reasonably…
313 runs0 likes36 downloads36 reach7 impact
399482 instances - 12 features - 2 classes - 0 missing values
Normalized version of vehicle dataset (http://www.openml.org/d/54) NAME vehicle silhouettes PURPOSE to classify a given silhouette as one of four types of vehicle, using a set of features extracted…
372 runs0 likes10 downloads10 reach3 impact
98528 instances - 101 features - 2 classes - 0 missing values
Multiclass from binary: Expanding one-vs-all, one-vs-one and ECOC-based approaches. Dataset taken from LIBSVM: https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/multiclass.html In this dataset…
0 runs0 likes0 downloads0 reach1 impact
108000 instances - 129 features - 1000 classes - 0 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
3 runs0 likes1 downloads1 reach9 impact
5418 instances - 1637 features - 2 classes - 0 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
3 runs0 likes2 downloads2 reach9 impact
10000 instances - 2001 features - 5 classes - 0 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
10 runs0 likes1 downloads1 reach10 impact
20000 instances - 4297 features - 2 classes - 0 missing values
Much of machine learning research focuses on producing models which perform well on benchmark tasks, in turn improving our understanding of the challenges associated with those tasks. From the…
1 runs0 likes0 downloads0 reach4 impact
270912 instances - 785 features - 49 classes - 0 missing values
The German Traffic Sign Benchmark is a multi-class, single-image classification challenge held at the International Joint Conference on Neural Networks (IJCNN) 2011. We cordially invite researchers…
1 runs0 likes0 downloads0 reach4 impact
51839 instances - 2917 features - 43 classes - 0 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
0 runs0 likes0 downloads0 reach8 impact
416188 instances - 61 features - 355 classes - 0 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
11 runs0 likes0 downloads0 reach10 impact
10000 instances - 7201 features - 10 classes - 0 missing values
This dataset is taken from the MiniBooNE experiment and is used to distinguish electron neutrinos (signal) from muon neutrinos (background). This dataset is ordered. It first contains all signal…
12 runs0 likes4 downloads4 reach5 impact
130064 instances - 51 features - 2 classes - 0 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
3 runs0 likes0 downloads0 reach9 impact
8237 instances - 801 features - 7 classes - 0 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
4 runs0 likes2 downloads2 reach9 impact
2984 instances - 145 features - 2 classes - 0 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
4 runs0 likes1 downloads1 reach9 impact
5124 instances - 21 features - 2 classes - 0 missing values
Airlines Dataset Inspired in the regression dataset from Elena Ikonomovska. The task is to predict whether a given flight will be delayed, given the information of the scheduled departure.
291 runs0 likes29 downloads29 reach8 impact
539383 instances - 8 features - 2 classes - 0 missing values
Klaverjas is an example of the Jack-Nine card games, which are characterized as trick-taking games where the the Jack and nine of the trump suit are the highest-ranking trumps, and the tens and aces…
0 runs0 likes1 downloads1 reach2 impact
981541 instances - 33 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX USD/CHF from Dukascopy. One instance (row) is one candlestick of one minute. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes0 downloads0 reach2 impact
375840 instances - 12 features - 2 classes - 0 missing values
Datasets from ACM KDD Cup (http://www.sigkdd.org/kddcup/index.php) Data set for KDD Cup 1999 Modified by TunedIT (converted to ARFF format)…
4 runs1 likes20 downloads21 reach7 impact
4898431 instances - 42 features - 23 classes - 0 missing values
No data.
353 runs0 likes17 downloads17 reach4 impact
120919 instances - 1002 features - 2 classes - 0 missing values
This is the original version of the famous covertype dataset in ARFF format. Predicting forest cover type from cartographic variables only (no remotely sensed data). The actual forest cover type for a…
9 runs1 likes14 downloads15 reach15 impact
581012 instances - 55 features - 7 classes - 0 missing values
1. Data set title: Nomao Data Set 2. Abstract: Nomao collects data about places (name, phone, localization...) from many sources. Deduplication consists in detecting what data refer to the same place.…
67106 runs0 likes16 downloads16 reach19 impact
34465 instances - 119 features - 2 classes - 0 missing values
* Title: Skin Segmentation Data Set * Abstract: The Skin Segmentation dataset is constructed over B, G, R color space. Skin and Nonskin dataset is generated using skin textures from face images of…
15 runs1 likes10 downloads11 reach7 impact
245057 instances - 4 features - 2 classes - 0 missing values
* Title of Database: Spoken Arabic Digit * Abstract: This dataset contains time series of mel-frequency cepstrum coefficients (MFCCs) corresponding to spoken Arabic digits. Includes data from 44 males…
1 runs0 likes8 downloads8 reach7 impact
263256 instances - 15 features - 10 classes - 0 missing values
* Abstract: Purpose is to predict poker hands * Source - Creators: Robert Cattral (cattral '@' gmail.com) Franz Oppacher (oppacher '@' scs.carleton.ca) Carleton University, Department of Computer…
1 runs0 likes5 downloads5 reach7 impact
1025009 instances - 11 features - 10 classes - 0 missing values
The dataset collects data from an Android smartphone positioned in the chest pocket. Accelerometer Data are collected from 22 participants walking in the wild over a predefined path. The dataset is…
80 runs0 likes8 downloads8 reach7 impact
149332 instances - 5 features - 22 classes - 0 missing values
Over 92 thousand images (32x32 pixels) of 46 characters from Devanagari script. Includes the alphabet as well as the numbers. Devanagari is an Indic script and forms a basis for over 100 languages…
42 runs2 likes7 downloads9 reach6 impact
92000 instances - 1025 features - 46 classes - 0 missing values
Fashion-MNIST is a dataset of Zalando's article images, consisting of a training set of 60,000 examples and a test set of 10,000 examples. Each example is a 28x28 grayscale image, associated with a…
450 runs0 likes11 downloads11 reach16 impact
70000 instances - 785 features - 10 classes - 0 missing values
### Description ### This dataset is part of a collection datasets based on the game "Jungle Chess" (a.k.a. Dou Shou Qi). For a description of the rules, please refer to the paper (link attached). The…
5083 runs0 likes2 downloads2 reach8 impact
44819 instances - 7 features - 3 classes - 0 missing values
shuttle-pmlb
10 runs0 likes3 downloads3 reach15 impact
58000 instances - 10 features - 7 classes - 0 missing values