Data
Filter results by:
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
778 runs0 likes7 downloads7 reach6 impact
66 instances - 6 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1164 runs0 likes7 downloads7 reach7 impact
222 instances - 3 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
813 runs0 likes7 downloads7 reach7 impact
500 instances - 26 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
818 runs0 likes7 downloads7 reach7 impact
284 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
554 runs0 likes9 downloads9 reach7 impact
40768 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
726 runs0 likes6 downloads6 reach6 impact
61 instances - 3 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
801 runs0 likes8 downloads8 reach7 impact
500 instances - 51 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
808 runs1 likes9 downloads10 reach6 impact
100 instances - 26 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
729 runs0 likes5 downloads5 reach6 impact
93 instances - 7 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
118 runs0 likes5 downloads5 reach6 impact
50 instances - 8 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
787 runs0 likes7 downloads7 reach6 impact
73 instances - 6 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
754 runs0 likes10 downloads10 reach6 impact
60 instances - 16 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
786 runs0 likes7 downloads7 reach7 impact
500 instances - 6 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
985 runs0 likes8 downloads8 reach6 impact
100 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
816 runs0 likes7 downloads7 reach7 impact
500 instances - 26 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1266 runs0 likes11 downloads11 reach6 impact
131 instances - 4 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
819 runs0 likes10 downloads10 reach7 impact
500 instances - 8 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
736 runs0 likes6 downloads6 reach7 impact
1473 instances - 10 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
141 runs0 likes7 downloads7 reach6 impact
500 instances - 24 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
639 runs0 likes12 downloads12 reach7 impact
20000 instances - 17 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
766 runs0 likes11 downloads11 reach7 impact
2000 instances - 217 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
758 runs0 likes10 downloads10 reach7 impact
2000 instances - 77 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
857 runs0 likes12 downloads12 reach9 impact
9961 instances - 15 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
778 runs0 likes9 downloads9 reach7 impact
5000 instances - 41 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
765 runs0 likes12 downloads12 reach7 impact
5620 instances - 65 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
1133 runs0 likes15 downloads15 reach10 impact
150 instances - 5 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
801 runs0 likes8 downloads8 reach7 impact
841 instances - 71 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
780 runs0 likes8 downloads8 reach7 impact
178 instances - 14 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
1137 runs0 likes7 downloads7 reach6 impact
132 instances - 5 features - 2 classes - 0 missing values
Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring. Science, VOL 286, pp. 531-537, 15 October 1999. Web supplement to the article T.R. Golub, D. K.…
451 runs0 likes12 downloads12 reach6 impact
72 instances - 7130 features - 2 classes - 0 missing values
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage repeatable,…
765 runs0 likes7 downloads7 reach6 impact
145 instances - 95 features - 2 classes - 0 missing values
One of the NASA Metrics Data Program defect data sets. Data from software for storage management for receiving and processing ground data. Data comes from McCabe and Halstead features extractors of…
159209 runs2 likes22 downloads24 reach19 impact
2109 instances - 22 features - 2 classes - 0 missing values
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% %% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage repeatable,…
789 runs0 likes8 downloads8 reach6 impact
101 instances - 30 features - 2 classes - 0 missing values
One of the NASA Metrics Data Program defect data sets. The specific type of software is unknown. Data comes from McCabe and Halstead features extractors of source code. These features were defined in…
777 runs0 likes9 downloads9 reach7 impact
458 instances - 40 features - 2 classes - 0 missing values
One of the NASA Metrics Data Program defect data sets. Data from flight software for earth orbiting satellite. Data comes from McCabe and Halstead features extractors of source code. These features…
875 runs0 likes12 downloads12 reach8 impact
5589 instances - 37 features - 2 classes - 0 missing values
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage repeatable,…
908 runs0 likes9 downloads9 reach6 impact
130 instances - 9 features - 2 classes - 0 missing values
%-*- text -*- %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE data set made publicly available in order to encourage repeatable, verifiable, refutable,…
765 runs0 likes9 downloads9 reach7 impact
403 instances - 38 features - 2 classes - 0 missing values
No data.
748 runs0 likes6 downloads6 reach7 impact
274 instances - 9 features - 2 classes - 0 missing values
No data.
794 runs1 likes13 downloads14 reach6 impact
107 instances - 30 features - 2 classes - 0 missing values
No data.
726 runs0 likes9 downloads9 reach6 impact
36 instances - 30 features - 2 classes - 0 missing values
One of the NASA Metrics Data Program defect data sets. Data from software for science data processing. Data comes from McCabe and Halstead features extractors of source code. These features were…
174308 runs0 likes21 downloads21 reach18 impact
522 instances - 22 features - 2 classes - 0 missing values
This database contains 13 attributes (which have been extracted from a larger set of 75) Attribute Information: ------------------------ -- 1. age -- 2. sex -- 3. chest pain type (4 values) -- 4.…
3214 runs0 likes17 downloads17 reach2 impact
270 instances - 14 features - 2 classes - 0 missing values
NAME vehicle silhouettes PURPOSE to classify a given silhouette as one of four types of vehicle, using a set of features extracted from the silhouette. The vehicle may be viewed from one of many…
28647 runs2 likes26 downloads28 reach2 impact
846 instances - 19 features - 4 classes - 0 missing values
No data.
1777 runs0 likes15 downloads15 reach1 impact
28056 instances - 7 features - 18 classes - 0 missing values
A simple database containing 17 Boolean-valued attributes describing animals. The "type" attribute appears to be the class attribute. Notes: * I find it unusual that there are 2 instances of "frog"…
168 runs2 likes16 downloads18 reach1 impact
101 instances - 17 features - 7 classes - 0 missing values
No data.
1038 runs0 likes8 downloads8 reach1 impact
55296 instances - 10 features - 3 classes - 0 missing values
Compilation of promoters with known transcriptional start points for E. coli genes. The task is to recognize promoters in strings that represent nucleotides (one of A, G, T, or C). A promoter is a…
138 runs1 likes9 downloads10 reach2 impact
106 instances - 59 features - 2 classes - 0 missing values
1. Title of Database: Abalone data 2. Sources: (a) Original owners of database: Marine Resources Division Marine Research Laboratories - Taroona Department of Primary Industry and Fisheries, Tasmania…
34894 runs0 likes18 downloads18 reach1 impact
4177 instances - 9 features - 28 classes - 0 missing values
No data.
1457 runs0 likes12 downloads12 reach1 impact
39366 instances - 10 features - 2 classes - 0 missing values
The dataset (originally named ELEC2) contains 45,312 instances dated from 7 May 1996 to 5 December 1998. Each example of the dataset refers to a period of 30 minutes, i.e. there are 48 instances for…
106103 runs3 likes30 downloads33 reach2 impact
45312 instances - 9 features - 2 classes - 0 missing values
No data.
2193 runs0 likes15 downloads15 reach1 impact
1484 instances - 9 features - 10 classes - 0 missing values
The database consists of the multi-spectral values of pixels in 3x3 neighbourhoods in a satellite image, and the classification associated with the central pixel in each neighbourhood. The aim is to…
27180 runs2 likes23 downloads25 reach2 impact
6430 instances - 37 features - 6 classes - 0 missing values
1. Title: Glass Identification Database 2. Sources: (a) Creator: B. German -- Central Research Establishment Home Office Forensic Science Service Aldermaston, Reading, Berkshire RG7 4PN (b) Donor:…
1776 runs0 likes50 downloads50 reach1 impact
214 instances - 10 features - 6 classes - 0 missing values
Primate splice-junction gene sequences (DNA) with associated imperfect domain theory. Splice junctions are points on a DNA sequence at which 'superfluous' DNA is removed during the process of protein…
22842 runs1 likes14 downloads15 reach1 impact
3190 instances - 61 features - 3 classes - 0 missing values
1. Title: Teaching Assistant Evaluation 2. Sources: (a) Collector: Wei-Yin Loh (Department of Statistics, UW-Madison) (b) Donor: Tjen-Sien Lim (limt@stat.wisc.edu) (b) Date: June 7, 1997 3. Past…
2028 runs0 likes12 downloads12 reach1 impact
151 instances - 6 features - 3 classes - 0 missing values
This database encodes the complete set of possible board configurations at the end of tic-tac-toe games, where "x" is assumed to have played first. The target concept is "win for x" (i.e., true when…
385258 runs1 likes59 downloads60 reach1 impact
958 instances - 10 features - 2 classes - 0 missing values
This radar data was collected by a system in Goose Bay, Labrador. This system consists of a phased array of 16 high-frequency antennas with a total transmitted power on the order of 6.4 kilowatts. See…
2484 runs3 likes27 downloads30 reach2 impact
351 instances - 35 features - 2 classes - 0 missing values
1. Title: Haberman's Survival Data 2. Sources: (a) Donor: Tjen-Sien Lim (limt@stat.wisc.edu) (b) Date: March 4, 1999 3. Past Usage: 1. Haberman, S. J. (1976). Generalized Residuals for Log-Linear…
3241 runs1 likes19 downloads20 reach1 impact
306 instances - 4 features - 2 classes - 0 missing values
SPAM E-mail Database The "spam" concept is diverse: advertisements for products/websites, make money fast schemes, chain letters, pornography... Our collection of spam e-mails came from our postmaster…
158566 runs3 likes82 downloads85 reach2 impact
4601 instances - 58 features - 2 classes - 0 missing values
Generator generating 3 classes of waves. Each class is generated from a combination of 2 of 3 "base" waves. For details, see Breiman,L., Friedman,J.H., Olshen,R.A., and Stone,C.J. (1984).…
19670 runs1 likes53 downloads54 reach2 impact
5000 instances - 41 features - 3 classes - 0 missing values
1. Title of Database: Blocks Classification 2. Sources: (a) Donato Malerba Dipartimento di Informatica University of Bari via Orabona 4 70126 Bari - Italy phone: +39 - 80 - 5443269 fax: +39 - 80 -…
2719 runs0 likes17 downloads17 reach2 impact
5473 instances - 11 features - 5 classes - 0 missing values
The instances were drawn randomly from a database of 7 outdoor images. The images were hand-segmented to create a classification for every pixel. Each instance is a 3x3 region. ### Attribute…
23138 runs0 likes22 downloads22 reach2 impact
2310 instances - 20 features - 7 classes - 0 missing values
### Description ISOLET (Isolated Letter Speech Recognition) dataset was generated as follows: 150 subjects spoke the name of each letter of the alphabet twice. Hence, there are 52 training examples…
48150 runs0 likes68 downloads68 reach122 impact
7797 instances - 618 features - 26 classes - 0 missing values
Speaker independent recognition of the eleven steady state vowels of British English using a specified training set of lpc derived log area ratios. Collected by David Deterding (data and…
25626 runs0 likes14 downloads14 reach35 impact
990 instances - 13 features - 11 classes - 0 missing values
### Description Scene recognition dataset - It contains characteristics about images and their classes. The original dataset is a multi-label classification problem with 6 different labels: {Beach,…
89930 runs0 likes22 downloads22 reach17 impact
2407 instances - 300 features - 2 classes - 0 missing values
1. Title: Part of the IRAS Low Resolution Spectrometer Database 2. Sources: (a) Originator: Infra-Red Astronomy Satellite Project Database (b) Donor: John Stutz (c) Date:…
1243 runs0 likes44 downloads44 reach7 impact
531 instances - 103 features - 48 classes - 0 missing values
No data.
400 runs0 likes6 downloads6 reach2 impact
45164 instances - 75 features - 11 classes - 0 missing values
ARFF version of UCI dataset 'flags'. Creators: Collected primarily from the "Collins Gem Guide to Flags": Collins Publishers (1986). Donor: Richard S. Forsyth. Date 5/15/1990 This data file contains…
103 runs0 likes8 downloads8 reach9 impact
194 instances - 30 features - 8 classes - 0 missing values
No data.
863 runs0 likes11 downloads11 reach1 impact
39366 instances - 10 features - 2 classes - 0 missing values
No data.
960 runs0 likes8 downloads8 reach1 impact
55296 instances - 10 features - 3 classes - 0 missing values
No data.
874 runs0 likes6 downloads6 reach2 impact
71 instances - 63 features - 6 classes - 0 missing values
No data.
940 runs0 likes5 downloads5 reach2 impact
74 instances - 63 features - 4 classes - 0 missing values
No data.
941 runs0 likes4 downloads4 reach2 impact
74 instances - 63 features - 4 classes - 0 missing values
No data.
988 runs0 likes3 downloads3 reach2 impact
74 instances - 63 features - 4 classes - 0 missing values
Mammography dataset Past Usage: 1. Woods, K., Doss, C., Bowyer, K., Solka, J., Priebe, C.,
215 runs4 likes46 downloads50 reach15 impact
11183 instances - 7 features - 2 classes - 0 missing values
No data.
414 runs0 likes8 downloads8 reach51 impact
690 instances - 8262 features - 10 classes - 0 missing values
No data.
215 runs0 likes7 downloads7 reach10 impact
204 instances - 5833 features - 6 classes - 0 missing values
No data.
426 runs0 likes15 downloads15 reach76 impact
2463 instances - 2001 features - 17 classes - 0 missing values
No data.
220 runs0 likes7 downloads7 reach10 impact
336 instances - 7903 features - 6 classes - 0 missing values
No data.
219 runs0 likes5 downloads5 reach10 impact
414 instances - 6430 features - 9 classes - 0 missing values
No data.
203 runs0 likes5 downloads5 reach10 impact
878 instances - 7455 features - 10 classes - 0 missing values
No data.
416 runs1 likes13 downloads14 reach52 impact
1050 instances - 3239 features - 10 classes - 0 missing values
No data.
428 runs0 likes12 downloads12 reach52 impact
1003 instances - 3183 features - 10 classes - 0 missing values
No data.
211 runs0 likes4 downloads4 reach10 impact
313 instances - 5805 features - 8 classes - 0 missing values
No data.
222 runs0 likes10 downloads10 reach7 impact
1504 instances - 2887 features - 13 classes - 0 missing values
This dataset records 640 time series of 12 LPC cepstrum coefficients taken from nine male speakers. The data was collected for examining our newly developed classifier for multidimensional curves…
23156 runs0 likes11 downloads11 reach46 impact
9961 instances - 15 features - 9 classes - 0 missing values
### Description Synthetic Control Chart Time Series. This is actually time series classification. ### Sources ``` * Original Owner and Donor Dr Robert Alcock rob@skyblue.csd.auth.gr ``` ### Dataset…
20354 runs0 likes10 downloads10 reach40 impact
600 instances - 62 features - 6 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
1028 runs0 likes8 downloads8 reach6 impact
132 instances - 4 features - 2 classes - 0 missing values
Yeast dataset Past Usage: André Elisseeff and Jason Weston. A kernel method for multi-labelled classification. In Thomas G. Dietterich, Susan Becker, and Zoubin Ghahramani, editors, Advances in…
139 runs0 likes8 downloads8 reach6 impact
2417 instances - 117 features - 2 classes - 0 missing values
Hayes-Roth Database This is a merged version of the separate train and test set which are usually distributed. On OpenML this train-test split can be found as one of the possible tasks. Source…
380 runs0 likes3 downloads3 reach16 impact
160 instances - 5 features - 3 classes - 0 missing values
SPECT heart data This is a merged version of the separate train and test set which are usually distributed. On OpenML this train-test split can be found as one of the possible tasks. Sources: --…
1296 runs1 likes12 downloads13 reach8 impact
267 instances - 23 features - 2 classes - 0 missing values
SPECTF heart data This is a merged version of the separate train and test set which are usually distributed. On OpenML this train-test split can be found as one of the possible tasks. NOTE: See the…
1103 runs0 likes12 downloads12 reach7 impact
349 instances - 45 features - 2 classes - 0 missing values
Grass Grubs and Damage Ranking Data source: R. J. Townsend AgResearch, Lincoln, New Zealand Grass grubs are one of the major insect pests of pasture in Canterbury and can cause severe pasture damage…
988 runs0 likes8 downloads8 reach7 impact
155 instances - 9 features - 4 classes - 0 missing values
Once upon a time, in July 1991, the monks of Corsendonk Priory were faced with a school held in their priory, namely the 2nd European Summer School on Machine Learning. After listening more than one…
394289 runs0 likes20 downloads20 reach26 impact
601 instances - 7 features - 2 classes - 0 missing values
Once upon a time, in July 1991, the monks of Corsendonk Priory were faced with a school held in their priory, namely the 2nd European Summer School on Machine Learning. After listening more than one…
108666 runs0 likes14 downloads14 reach25 impact
554 instances - 7 features - 2 classes - 0 missing values
Once upon a time, in July 1991, the monks of Corsendonk Priory were faced with a school held in their priory, namely the 2nd European Summer School on Machine Learning. After listening more than one…
358449 runs1 likes17 downloads18 reach29 impact
556 instances - 7 features - 2 classes - 0 missing values
Dataset sales
0 runs0 likes0 downloads0 reach0 impact
10738 instances - 15 features - 0 classes - 0 missing values
Test file for ML training
0 runs0 likes0 downloads0 reach0 impact
1599 instances - 12 features - classes - 0 missing values
Pasture Production Data source: Dave Barker AgResearch Grasslands, Palmerston North, New Zealand The objective was to predict pasture production from a variety of biophysical factors. Vegetation and…
878 runs0 likes6 downloads6 reach7 impact
36 instances - 23 features - 3 classes - 0 missing values
"The speech dataset was also provided by (see citation request) and contains real world data from recorded English language. The normal class contains data from persons having an American accent…
1599 runs0 likes5 downloads5 reach9 impact
3686 instances - 401 features - 2 classes - 0 missing values