Data
Filter results by:
No data.
73 runs0 likes4 downloads4 reach0 impact
1000000 instances - 16 features - 2 classes - 0 missing values
No data.
87 runs0 likes4 downloads4 reach0 impact
295245 instances - 11 features - 5 classes - 0 missing values
No data.
68 runs0 likes3 downloads3 reach0 impact
1000000 instances - 21 features - 2 classes - 0 missing values
No data.
67 runs0 likes2 downloads2 reach0 impact
1000000 instances - 17 features - 10 classes - 0 missing values
No data.
65 runs0 likes4 downloads4 reach0 impact
1000000 instances - 40 features - 2 classes - 0 missing values
No data.
66 runs0 likes3 downloads3 reach0 impact
1000000 instances - 35 features - 6 classes - 0 missing values
No data.
211 runs0 likes3 downloads3 reach0 impact
1000000 instances - 20 features - 7 classes - 0 missing values
This data set was generated to model psychological experimental results. Each example is classified as having the balance scale tip to the right, tip to the left, or be balanced. The attributes are…
19798 runs0 likes14 downloads14 reach0 impact
625 instances - 5 features - 3 classes - 0 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. Corresponding patterns in different datasets correspond to the same…
26854 runs0 likes17 downloads17 reach0 impact
2000 instances - 217 features - 10 classes - 0 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. Corresponding patterns in different datasets correspond to the same…
26672 runs0 likes10 downloads10 reach0 impact
2000 instances - 77 features - 10 classes - 0 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. Corresponding patterns in different datasets correspond to the same…
26665 runs0 likes19 downloads19 reach0 impact
2000 instances - 65 features - 10 classes - 0 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. Corresponding patterns in different datasets correspond to the same…
27245 runs0 likes16 downloads16 reach0 impact
2000 instances - 7 features - 10 classes - 0 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. The maps were scanned in 8 bit grey value at density of 400dpi,…
22666 runs0 likes17 downloads17 reach0 impact
2000 instances - 241 features - 10 classes - 0 missing values
No data.
66 runs0 likes2 downloads2 reach0 impact
1000000 instances - 39 features - 6 classes - 0 missing values
No data.
324 runs0 likes5 downloads5 reach0 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
71 runs0 likes5 downloads5 reach0 impact
1000000 instances - 17 features - 2 classes - 0 missing values
No data.
60 runs0 likes2 downloads2 reach0 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
63 runs0 likes3 downloads3 reach0 impact
1000000 instances - 26 features - 7 classes - 0 missing values
No data.
63 runs0 likes4 downloads4 reach0 impact
1000000 instances - 19 features - 4 classes - 0 missing values
No data.
68 runs0 likes10 downloads10 reach0 impact
1000000 instances - 10 features - 2 classes - 0 missing values
No data.
48 runs1 likes4 downloads5 reach0 impact
1000000 instances - 77 features - 10 classes - 0 missing values
This database contains 13 attributes (which have been extracted from a larger set of 75) Attribute Information: ------------------------ -- 1. age -- 2. sex -- 3. chest pain type (4 values) -- 4.…
3208 runs0 likes17 downloads17 reach0 impact
270 instances - 14 features - 2 classes - 0 missing values
NAME vehicle silhouettes PURPOSE to classify a given silhouette as one of four types of vehicle, using a set of features extracted from the silhouette. The vehicle may be viewed from one of many…
20914 runs1 likes23 downloads24 reach0 impact
846 instances - 19 features - 4 classes - 0 missing values
This radar data was collected by a system in Goose Bay, Labrador. This system consists of a phased array of 16 high-frequency antennas with a total transmitted power on the order of 6.4 kilowatts. See…
2484 runs3 likes25 downloads28 reach0 impact
351 instances - 35 features - 2 classes - 0 missing values
Generator generating 3 classes of waves. Each class is generated from a combination of 2 of 3 "base" waves. For details, see Breiman,L., Friedman,J.H., Olshen,R.A., and Stone,C.J. (1984).…
15586 runs1 likes53 downloads54 reach0 impact
5000 instances - 41 features - 3 classes - 0 missing values
A simple database containing 17 Boolean-valued attributes describing animals. The "type" attribute appears to be the class attribute. Notes: * I find it unusual that there are 2 instances of "frog"…
168 runs2 likes14 downloads16 reach0 impact
101 instances - 18 features - 7 classes - 0 missing values
No data.
143 runs0 likes3 downloads3 reach0 impact
1000000 instances - 39 features - 6 classes - 0 missing values
1. Title: Chess End-Game -- King+Rook versus King+Pawn on a7 (usually abbreviated KRKPA7). The pawn on a7 means it is one square away from queening. It is the King+Rook's side (white) to move. 2.…
250986 runs0 likes30 downloads30 reach0 impact
3196 instances - 37 features - 2 classes - 0 missing values
1. TITLE: Letter Image Recognition Data The objective is to identify each of a large number of black-and-white rectangular pixel displays as one of the 26 capital letters in the English alphabet. The…
60471 runs1 likes66 downloads67 reach0 impact
20000 instances - 17 features - 26 classes - 0 missing values
The first 5 variables are all blood tests which are thought to be sensitive to liver disorders that might arise from excessive alcohol consumption. Each line in the dataset constitutes the record of a…
148 runs2 likes29 downloads31 reach0 impact
345 instances - 7 features - 0 classes - 0 missing values
No data.
50 runs0 likes1 downloads1 reach0 impact
1000000 instances - 65 features - 10 classes - 0 missing values
No data.
67 runs0 likes3 downloads3 reach0 impact
1000000 instances - 13 features - 6 classes - 0 missing values
No data.
66 runs0 likes3 downloads3 reach0 impact
1000000 instances - 13 features - 6 classes - 0 missing values
No data.
51 runs1 likes4 downloads5 reach0 impact
1000000 instances - 48 features - 10 classes - 0 missing values
No data.
1038 runs0 likes8 downloads8 reach0 impact
55296 instances - 10 features - 3 classes - 0 missing values
No data.
326 runs1 likes4 downloads5 reach0 impact
1000000 instances - 23 features - 2 classes - 0 missing values
We create a digit database by collecting 250 samples from 44 writers. The samples written by 30 writers are used for training, cross-validation and writer dependent testing, and the digits written by…
26338 runs0 likes18 downloads18 reach0 impact
10992 instances - 17 features - 10 classes - 0 missing values
The instances were drawn randomly from a database of 7 outdoor images. The images were hand-segmented to create a classification for every pixel. Each instance is a 3x3 region. ### Attribute…
19234 runs0 likes22 downloads22 reach0 impact
2310 instances - 20 features - 7 classes - 0 missing values
1. Title: Pima Indians Diabetes Database 2. Sources: (a) Original owners: National Institute of Diabetes and Digestive and Kidney Diseases (b) Donor of database: Vincent Sigillito…
193417 runs3 likes68 downloads71 reach0 impact
768 instances - 9 features - 2 classes - 0 missing values
1. Title: Protein Localization Sites 2. Creator and Maintainer: Kenta Nakai Institue of Molecular and Cellular Biology Osaka, University 1-3 Yamada-oka, Suita 565 Japan nakai@imcb.osaka-u.ac.jp…
1799 runs0 likes12 downloads12 reach0 impact
336 instances - 8 features - 8 classes - 0 missing values
NAME: Sonar, Mines vs. Rocks SUMMARY: This is the data set used by Gorman and Sejnowski in their study of the classification of sonar signals using a neural network [1]. The task is to train a network…
2366 runs1 likes22 downloads23 reach0 impact
208 instances - 61 features - 2 classes - 0 missing values
1. Title: Glass Identification Database 2. Sources: (a) Creator: B. German -- Central Research Establishment Home Office Forensic Science Service Aldermaston, Reading, Berkshire RG7 4PN (b) Donor:…
1772 runs0 likes49 downloads49 reach0 impact
214 instances - 10 features - 6 classes - 0 missing values
1. Title: Haberman's Survival Data 2. Sources: (a) Donor: Tjen-Sien Lim (limt@stat.wisc.edu) (b) Date: March 4, 1999 3. Past Usage: 1. Haberman, S. J. (1976). Generalized Residuals for Log-Linear…
3241 runs1 likes18 downloads19 reach0 impact
306 instances - 4 features - 2 classes - 0 missing values
SPAM E-mail Database The "spam" concept is diverse: advertisements for products/websites, make money fast schemes, chain letters, pornography... Our collection of spam e-mails came from our postmaster…
152524 runs3 likes72 downloads75 reach0 impact
4601 instances - 58 features - 2 classes - 0 missing values
Primate splice-junction gene sequences (DNA) with associated imperfect domain theory. Splice junctions are points on a DNA sequence at which 'superfluous' DNA is removed during the process of protein…
15852 runs0 likes14 downloads14 reach0 impact
3190 instances - 62 features - 3 classes - 0 missing values
1. Title: Teaching Assistant Evaluation 2. Sources: (a) Collector: Wei-Yin Loh (Department of Statistics, UW-Madison) (b) Donor: Tjen-Sien Lim (limt@stat.wisc.edu) (b) Date: June 7, 1997 3. Past…
2028 runs0 likes12 downloads12 reach0 impact
151 instances - 6 features - 3 classes - 0 missing values
This database encodes the complete set of possible board configurations at the end of tic-tac-toe games, where "x" is assumed to have played first. The target concept is "win for x" (i.e., true when…
375870 runs1 likes42 downloads43 reach0 impact
958 instances - 10 features - 2 classes - 0 missing values
1. Title: Nursery Database 2. Sources: (a) Creator: Vladislav Rajkovic et al. (13 experts) (b) Donors: Marko Bohanec (marko.bohanec@ijs.si) Blaz Zupan (blaz.zupan@ijs.si) (c) Date: June, 1997 3. Past…
2210 runs0 likes15 downloads15 reach0 impact
12960 instances - 9 features - 5 classes - 0 missing values
1. Title of Database: Optical Recognition of Handwritten Digits 2. Source: E. Alpaydin, C. Kaynak Department of Computer Engineering Bogazici University, 80815 Istanbul Turkey alpaydin@boun.edu.tr…
26796 runs1 likes18 downloads19 reach0 impact
5620 instances - 65 features - 10 classes - 0 missing values
1. Title of Database: Blocks Classification 2. Sources: (a) Donato Malerba Dipartimento di Informatica University of Bari via Orabona 4 70126 Bari - Italy phone: +39 - 80 - 5443269 fax: +39 - 80 -…
2719 runs0 likes17 downloads17 reach0 impact
5473 instances - 11 features - 5 classes - 0 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. Corresponding patterns in different datasets correspond to the same…
26776 runs0 likes21 downloads21 reach0 impact
2000 instances - 48 features - 10 classes - 0 missing values
1. Title: Contraceptive Method Choice 2. Sources: (a) Origin: This dataset is a subset of the 1987 National Indonesia Contraceptive Prevalence Survey (b) Creator: Tjen-Sien Lim (limt@stat.wisc.edu)…
18003 runs0 likes17 downloads17 reach0 impact
1473 instances - 10 features - 3 classes - 0 missing values
No data.
68 runs0 likes2 downloads2 reach0 impact
1000000 instances - 19 features - 4 classes - 0 missing values
No data.
69 runs0 likes4 downloads4 reach0 impact
1000000 instances - 20 features - 2 classes - 0 missing values
No data.
356 runs0 likes7 downloads7 reach0 impact
131072 instances - 17 features - 2 classes - 0 missing values
No data.
65 runs0 likes5 downloads5 reach0 impact
1000000 instances - 30 features - 4 classes - 0 missing values
No data.
230 runs0 likes4 downloads4 reach0 impact
1000000 instances - 35 features - 2 classes - 0 missing values
No data.
63 runs0 likes2 downloads2 reach0 impact
1000000 instances - 41 features - 3 classes - 0 missing values
No data.
65 runs1 likes2 downloads3 reach0 impact
1000000 instances - 18 features - 7 classes - 0 missing values
Dataset created to study concept drift in stream mining. It is constructed by combining the Covertype, Poker-Hand, and Electricity datasets. More details can be found in: Albert Bifet, Geoff Holmes,…
332 runs0 likes26 downloads26 reach0 impact
1455525 instances - 73 features - 10 classes - 0 missing values
Normalized version of the Forest Covertype dataset (see version 1), so that the numerical values are between 0 and 1. Contains the forest cover type for 30 x 30 meter cells obtained from US Forest…
319 runs1 likes39 downloads40 reach0 impact
581012 instances - 55 features - 7 classes - 0 missing values
No data.
73 runs0 likes5 downloads5 reach0 impact
1000000 instances - 30 features - 2 classes - 0 missing values
No data.
50 runs0 likes3 downloads3 reach0 impact
1000000 instances - 61 features - 2 classes - 0 missing values
No data.
90 runs0 likes3 downloads3 reach0 impact
137781 instances - 10 features - 7 classes - 0 missing values
No data.
314 runs1 likes8 downloads9 reach0 impact
1000000 instances - 36 features - 19 classes - 0 missing values
No data.
219 runs0 likes4 downloads4 reach0 impact
1000000 instances - 58 features - 2 classes - 0 missing values
No data.
66 runs0 likes2 downloads2 reach0 impact
1000000 instances - 14 features - 5 classes - 0 missing values
No data.
1457 runs0 likes12 downloads12 reach0 impact
39366 instances - 10 features - 2 classes - 0 missing values
No data.
66 runs0 likes2 downloads2 reach0 impact
1000000 instances - 14 features - 5 classes - 0 missing values
No data.
334 runs0 likes4 downloads4 reach0 impact
1000000 instances - 33 features - 2 classes - 0 missing values
No data.
70 runs0 likes2 downloads2 reach0 impact
1000000 instances - 14 features - 2 classes - 0 missing values
No data.
2193 runs0 likes15 downloads15 reach0 impact
1484 instances - 9 features - 10 classes - 0 missing values
The database consists of the multi-spectral values of pixels in 3x3 neighbourhoods in a satellite image, and the classification associated with the central pixel in each neighbourhood. The aim is to…
21038 runs2 likes23 downloads25 reach0 impact
6430 instances - 37 features - 6 classes - 0 missing values
1. Title of Database: Abalone data 2. Sources: (a) Original owners of database: Marine Resources Division Marine Research Laboratories - Taroona Department of Primary Industry and Fisheries, Tasmania…
34900 runs0 likes17 downloads17 reach0 impact
4177 instances - 9 features - 28 classes - 0 missing values
No data.
1777 runs0 likes15 downloads15 reach0 impact
28056 instances - 7 features - 18 classes - 0 missing values
1. Title of Database: Wine recognition data Updated Sept 21, 1998 by C.Blake : Added attribute information 2. Sources: (a) Forina, M. et al, PARVUS - An Extendible Package for Data Exploration,…
1180 runs1 likes14 downloads15 reach0 impact
178 instances - 14 features - 3 classes - 0 missing values
This is data set is concerned with the forward kinematics of an 8 link robot arm. Among the existing variants of this data set we have used the variant 8nm, which is known to be highly non-linear and…
19 runs0 likes7 downloads7 reach0 impact
8192 instances - 9 features - 0 classes - 0 missing values
Dataset from Smoothing Methods in Statistics (ftp stat.cmu.edu/datasets) Simonoff, J.S. (1996). Smoothing Methods in Statistics. New York: Springer-Verlag.
4 runs0 likes1 downloads1 reach0 impact
61 instances - 3 features - 0 classes - 0 missing values
1. Title: Wisconsin Prognostic Breast Cancer (WPBC) 2. Source Information a) Creators: Dr. William H. Wolberg, General Surgery Dept., University of Wisconsin, Clinical Sciences Center, Madison, WI…
5 runs0 likes4 downloads4 reach0 impact
194 instances - 33 features - 0 classes - 0 missing values
This data set consists of three types of entities: (a) the specification of an auto in terms of various characteristics; (b) its assigned insurance risk rating,; (c) its normalized losses in use as…
6 runs1 likes4 downloads5 reach0 impact
159 instances - 16 features - 0 classes - 0 missing values
The Computer Activity databases are a collection of computer systems activity measures. The data was collected from a Sun Sparcstation 20/712 with 128 Mbytes of memory running in a multi-user…
2 runs1 likes1 downloads2 reach0 impact
8192 instances - 22 features - 0 classes - 0 missing values
This data set is also obtained from the task of controlling the ailerons of a F16 aircraft, although the target variable and attributes are different from the ailerons domain. The target variable here…
2 runs0 likes3 downloads3 reach0 impact
9517 instances - 7 features - 0 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Identifier attribute deleted. !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! NAME: Sexual activity and the lifespan of male fruitflies TYPE: Designed (almost factorial)…
4 runs0 likes1 downloads1 reach0 impact
125 instances - 5 features - 0 classes - 0 missing values
No data.
328 runs0 likes3 downloads3 reach0 impact
1000000 instances - 4 features - 2 classes - 0 missing values
Compilation of promoters with known transcriptional start points for E. coli genes. The task is to recognize promoters in strings that represent nucleotides (one of A, G, T, or C). A promoter is a…
138 runs1 likes9 downloads10 reach0 impact
106 instances - 59 features - 2 classes - 0 missing values
The dataset (originally named ELEC2) contains 45,312 instances dated from 7 May 1996 to 5 December 1998. Each example of the dataset refers to a period of 30 minutes, i.e. there are 48 instances for…
101625 runs3 likes24 downloads27 reach0 impact
45312 instances - 9 features - 2 classes - 0 missing values
No data.
310 runs0 likes4 downloads4 reach0 impact
1000000 instances - 11 features - 2 classes - 0 missing values
Synthetic dataset. Almost identical to [dataset 152](https://www.openml.org/d/153/edit)
319 runs0 likes4 downloads4 reach0 impact
1000000 instances - 11 features - 2 classes - 0 missing values
No data.
304 runs0 likes6 downloads6 reach0 impact
1000000 instances - 25 features - 10 classes - 0 missing values
Normalized version of the pokerhand data set. Automated file upload of pokerhand-normalized.arff
314 runs0 likes10 downloads10 reach0 impact
829201 instances - 11 features - 10 classes - 0 missing values
No data.
298 runs0 likes3 downloads3 reach0 impact
1000000 instances - 11 features - 5 classes - 0 missing values
No data.
305 runs0 likes2 downloads2 reach0 impact
1000000 instances - 11 features - 5 classes - 0 missing values
No data.
308 runs0 likes2 downloads2 reach0 impact
1000000 instances - 11 features - 5 classes - 0 missing values
No data.
307 runs0 likes2 downloads2 reach0 impact
1000000 instances - 11 features - 5 classes - 0 missing values
No data.
309 runs0 likes3 downloads3 reach0 impact
1000000 instances - 11 features - 5 classes - 0 missing values
This is a commercial application described in Weiss & Indurkhya (1995). The data describes a telecommunication problem. No further information is available. Characteristics: (10000+5000) cases, 49…
2 runs0 likes3 downloads3 reach0 impact
15000 instances - 49 features - 0 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Identification code deleted. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based…
4 runs0 likes0 downloads0 reach0 impact
189 instances - 10 features - 0 classes - 0 missing values
The problem is to learn a regression equation/rule/tree to predict the activity from the descriptive structural attributes. The data and methodology is described in detail in: - King, Ross .D., Hurst,…
2 runs0 likes1 downloads1 reach0 impact
186 instances - 61 features - 0 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! All nominal attributes and instances with missing values are deleted. Price treated as the class attribute. As used by…
2 runs0 likes0 downloads0 reach0 impact
159 instances - 16 features - 0 classes - 0 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) This is the data set called `DETROIT' in the book `Subset selection in regression' by Alan J. Miller published in the Chapman & Hall series of monographs…
2 runs0 likes0 downloads0 reach0 impact
13 instances - 14 features - 0 classes - 0 missing values