Data
This dataset classifies people described by a set of attributes as good or bad credit risks. This dataset comes with a cost matrix: ``` Good Bad (predicted) Good 0 1 (actual) Bad 5 0 ``` It is worse…
481255 runs2 likes50 downloads52 reach17 impact
1000 instances - 21 features - 2 classes - 0 missing values
Data taken from the Blood Transfusion Service Center in Hsin-Chu City in Taiwan -- this is a classification problem. To demonstrate the RFMTC marketing model (a modified version of RFM), this study…
442907 runs1 likes23 downloads24 reach14 impact
748 instances - 5 features - 2 classes - 0 missing values
Once upon a time, in July 1991, the monks of Corsendonk Priory were faced with a school held in their priory, namely the 2nd European Summer School on Machine Learning. After listening more than one…
378699 runs0 likes13 downloads13 reach18 impact
601 instances - 7 features - 2 classes - 0 missing values
This database encodes the complete set of possible board configurations at the end of tic-tac-toe games, where "x" is assumed to have played first. The target concept is "win for x" (i.e., true when…
356197 runs0 likes27 downloads27 reach10 impact
958 instances - 10 features - 2 classes - 0 missing values
### Dataset: Wilt Data Set ### Abstract: High-resolution Remote Sensing data set (Quickbird). Small number of training samples of diseased trees, large number for other land cover. Testing data set…
345136 runs1 likes33 downloads34 reach15 impact
4839 instances - 6 features - 2 classes - 0 missing values
Once upon a time, in July 1991, the monks of Corsendonk Priory were faced with a school held in their priory, namely the 2nd European Summer School on Machine Learning. After listening more than one…
326332 runs0 likes13 downloads13 reach18 impact
556 instances - 7 features - 2 classes - 0 missing values
QSAR biodegradation Data Set * Abstract: Data set containing values for 41 attributes (molecular descriptors) used to classify 1055 chemicals into 2 classes (ready and not ready biodegradable). *…
243629 runs1 likes13 downloads14 reach14 impact
1055 instances - 42 features - 2 classes - 0 missing values
1. Title: Chess End-Game -- King+Rook versus King+Pawn on a7 (usually abbreviated KRKPA7). The pawn on a7 means it is one square away from queening. It is the King+Rook's side (white) to move. 2.…
237344 runs0 likes26 downloads26 reach12 impact
3196 instances - 37 features - 2 classes - 0 missing values
A dataset of steel plates' faults, classified into 7 different types. The goal was to train machine learning for automatic pattern recognition. The dataset consists of 27 features describing each…
234332 runs1 likes24 downloads25 reach13 impact
1941 instances - 34 features - 2 classes - 0 missing values
Current dataset was adapted to ARFF format from the UCI version. Sample code ID's were removed. ! Note that there is also a related Breast Cancer Wisconsin (Original) Data Set with a different set of…
204569 runs1 likes30 downloads31 reach14 impact
569 instances - 31 features - 2 classes - 0 missing values
This dataset was retrieved 2014-11-14 from the libSVM site. It was normalized to [-1,1] and converted to the ARFF format. ### Feature information There are 6 numerical and 8 categorical attributes,…
188743 runs0 likes12 downloads12 reach4 impact
690 instances - 15 features - 2 classes - 0 missing values
The aim of this dataset is to distinguish between nasal (class 0) and oral sounds (class 1). Five different attributes were chosen to characterize each vowel: they are the amplitudes of the five first…
185207 runs3 likes18 downloads21 reach13 impact
5404 instances - 6 features - 2 classes - 0 missing values
1. Title: Pima Indians Diabetes Database 2. Sources: (a) Original owners: National Institute of Diabetes and Digestive and Kidney Diseases (b) Donor of database: Vincent Sigillito…
179735 runs3 likes63 downloads66 reach16 impact
768 instances - 9 features - 2 classes - 0 missing values
Each record represents 100 points on a two-dimensional graph. When plotted in order (from 1 through 100) as the Y coordinate, the points will create either a Hill (a “bump” in the terrain) or a…
166728 runs0 likes15 downloads15 reach14 impact
1212 instances - 101 features - 2 classes - 0 missing values
Forecasting skewed biased stochastic ozone days: analyses, solutions and beyond, Knowledge and Information Systems, Vol. 14, No. 3, 2008. 1 . Abstract: Two ground ozone level data sets are included in…
154774 runs0 likes11 downloads11 reach14 impact
2534 instances - 73 features - 2 classes - 0 missing values
One of the NASA Metrics Data Program defect data sets. Data from software for science data processing. Data comes from McCabe and Halstead features extractors of source code. These features were…
150389 runs0 likes16 downloads16 reach14 impact
522 instances - 22 features - 2 classes - 0 missing values
All data is from one continuous EEG measurement with the Emotiv EEG Neuroheadset. The duration of the measurement was 117 seconds. The eye state was detected via a camera during the EEG measurement…
142305 runs2 likes79 downloads81 reach15 impact
14980 instances - 15 features - 2 classes - 0 missing values
Lucas, D. D., Klein, R., Tannahill, J., Ivanova, D., Brandon, S., Domyancic, D., and Zhang, Y.: Failure analysis of parameter-induced simulation crashes in climate models, Geosci. Model Dev. Discuss.,…
142050 runs0 likes14 downloads14 reach14 impact
540 instances - 21 features - 2 classes - 0 missing values
One of the NASA Metrics Data Program defect data sets. Data from software for storage management for receiving and processing ground data. Data comes from McCabe and Halstead features extractors of…
139267 runs0 likes20 downloads20 reach16 impact
2109 instances - 22 features - 2 classes - 0 missing values
### Description One-hundred plant species leaves dataset (Class = Texture). ### Sources ``` (a) Original owners of colour Leaves Samples: James Cope, Thibaut Beghin, Paolo Remagnino, Sarah Barman. The…
132847 runs2 likes55 downloads57 reach406 impact
1599 instances - 65 features - 100 classes - 0 missing values
This data set contains 416 liver patient records and 167 non liver patient records.The data set was collected from north east of Andhra Pradesh, India. The class label divides the patients into 2…
132839 runs0 likes14 downloads14 reach14 impact
583 instances - 11 features - 2 classes - 0 missing values
### Description One-hundred plant species leaves dataset (Class = Shape). ### Sources ``` (a) Original owners of colour Leaves Samples: James Cope, Thibaut Beghin, Paolo Remagnino, Sarah Barman. The…
132687 runs1 likes31 downloads32 reach405 impact
1600 instances - 65 features - 100 classes - 0 missing values
### Description One-hundred plant species leaves dataset (Class = Margin). ### Sources ``` (a) Original owners of colour Leaves Samples: James Cope, Thibaut Beghin, Paolo Remagnino, Sarah Barman. The…
132626 runs1 likes13 downloads14 reach405 impact
1600 instances - 65 features - 100 classes - 0 missing values
One of the NASA Metrics Data Program defect data sets. Data from flight software for earth orbiting satellite. Data comes from McCabe and Halstead features extractors of source code. These features…
123323 runs0 likes23 downloads23 reach15 impact
1109 instances - 22 features - 2 classes - 0 missing values
SPAM E-mail Database The "spam" concept is diverse: advertisements for products/websites, make money fast schemes, chain letters, pornography... Our collection of spam e-mails came from our postmaster…
121463 runs3 likes65 downloads68 reach10 impact
4601 instances - 58 features - 2 classes - 0 missing values
One of the NASA Metrics Data Program defect data sets. Data from flight software for earth orbiting satellite. Data comes from McCabe and Halstead features extractors of source code. These features…
117882 runs0 likes15 downloads15 reach14 impact
1563 instances - 38 features - 2 classes - 0 missing values
Author: Volker Lohweg (University of Applied Sciences, Ostwestfalen-Lippe) Source: [UCI](https://archive.ics.uci.edu/ml/datasets/banknote+authentication) - 2012 Please cite:…
101876 runs1 likes14 downloads15 reach13 impact
1372 instances - 5 features - 2 classes - 0 missing values
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage repeatable,…
92437 runs0 likes16 downloads16 reach16 impact
15545 instances - 6 features - 2 classes - 0 missing values
Dataset from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch), which consisted of 5 different datasets (SYLVA, GINA, NOVA, HIVA, ADA). The purpose of the challenge…
91363 runs0 likes14 downloads14 reach16 impact
4562 instances - 49 features - 2 classes - 0 missing values
The dataset (originally named ELEC2) contains 45,312 instances dated from 7 May 1996 to 5 December 1998. Each example of the dataset refers to a period of 30 minutes, i.e. there are 48 instances for…
91189 runs1 likes22 downloads23 reach9 impact
45312 instances - 9 features - 2 classes - 0 missing values
### Description This dataset represents a set of possible advertisements on Internet pages ### Sources (a) Creator and donor: Nicholas Kushmerick - nick@ucd.ie ### Dataset Information The features…
83164 runs2 likes27 downloads29 reach17 impact
3279 instances - 1559 features - 2 classes - 0 missing values
Dataset from the MLRR repository: http://axon.cs.byu.edu:5000/
82512 runs1 likes15 downloads16 reach16 impact
6598 instances - 170 features - 2 classes - 0 missing values
Once upon a time, in July 1991, the monks of Corsendonk Priory were faced with a school held in their priory, namely the 2nd European Summer School on Machine Learning. After listening more than one…
79012 runs0 likes13 downloads13 reach17 impact
554 instances - 7 features - 2 classes - 0 missing values
One of the NASA Metrics Data Program defect data sets. Data from flight software for earth orbiting satellite. Data comes from McCabe and Halstead features extractors of source code. These features…
77105 runs0 likes14 downloads14 reach14 impact
1458 instances - 38 features - 2 classes - 0 missing values
#### Abstract: MADELON is an artificial dataset, which was part of the NIPS 2003 feature selection challenge. This is a two-class classification problem with continuous input variables. The difficulty…
76334 runs0 likes15 downloads15 reach14 impact
2600 instances - 501 features - 2 classes - 0 missing values
### Description Scene recognition dataset - It contains characteristics about images and their classes. The original dataset is a multi-label classification problem with 6 different labels: {Beach,…
68095 runs0 likes18 downloads18 reach14 impact
2407 instances - 300 features - 2 classes - 0 missing values
Dataset creator and donator: Zhi Liu, e-mail: liuzhi8673 '@' gmail.com, institution: National Engineering Research Center for E-Learning, Hubei Wuhan, China Data Set Information: dataset are derived…
65163 runs2 likes36 downloads38 reach204 impact
1500 instances - 10001 features - 50 classes - 0 missing values
This data is derived from the 2012 KDD Cup. The data is subsampled to 0.1% of the original number of instances, downsampling the majority class (click=0) so that the target feature is reasonably…
56563 runs0 likes11 downloads11 reach14 impact
39948 instances - 12 features - 2 classes - 0 missing values
Dataset from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch), which consisted of 5 different datasets (SYLVA, GINA, NOVA, HIVA, ADA). The purpose of the challenge…
52904 runs0 likes18 downloads18 reach16 impact
3468 instances - 971 features - 2 classes - 0 missing values
1. TITLE: Letter Image Recognition Data The objective is to identify each of a large number of black-and-white rectangular pixel displays as one of the 26 capital letters in the English alphabet. The…
51959 runs1 likes61 downloads62 reach10 impact
20000 instances - 17 features - 26 classes - 0 missing values
Dataset from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch), which consisted of 5 different datasets (SYLVA, GINA, NOVA, HIVA, ADA). The purpose of the challenge…
51936 runs0 likes16 downloads16 reach15 impact
14395 instances - 217 features - 2 classes - 0 missing values
Donated by P. Savicky, Institute of Computer Science, AS of CR, Czech Republic The data are MC generated (see below) to simulate registration of high energy gamma particles in a ground-based…
49188 runs1 likes23 downloads24 reach15 impact
19020 instances - 12 features - 2 classes - 0 missing values
Available at: [pdf] http://hdl.handle.net/1822/14838 [bib] http://www3.dsi.uminho.pt/pcortez/bib/2011-esm-1.txt 1. Title: Bank Marketing 2. Sources Created by: Paulo Cortez (Univ. Minho) and Sérgio…
44329 runs0 likes19 downloads19 reach15 impact
45211 instances - 17 features - 2 classes - 0 missing values
1. Data set title: Nomao Data Set 2. Abstract: Nomao collects data about places (name, phone, localization...) from many sources. Deduplication consists in detecting what data refer to the same place.…
37552 runs0 likes11 downloads11 reach15 impact
34465 instances - 119 features - 2 classes - 0 missing values
### Description ISOLET (Isolated Letter Speech Recognition) dataset was generated as follows: 150 subjects spoke the name of each letter of the alphabet twice. Hence, there are 52 training examples…
36685 runs0 likes67 downloads67 reach114 impact
7797 instances - 618 features - 26 classes - 0 missing values
1. Title of Database: Abalone data 2. Sources: (a) Original owners of database: Marine Resources Division Marine Research Laboratories - Taroona Department of Primary Industry and Fisheries, Tasmania…
34835 runs0 likes16 downloads16 reach9 impact
4177 instances - 9 features - 29 classes - 0 missing values
Source: Rami Mustafa A Mohammad ( University of Huddersfield, rami.mohammad '@' hud.ac.uk, rami.mustafa.a '@' gmail.com) Lee McCluskey (University of Huddersfield,t.l.mccluskey '@' hud.ac.uk ) Fadi…
30824 runs0 likes10 downloads10 reach14 impact
11055 instances - 31 features - 2 classes - 0 missing values
### Description Large Soybean Database - This is the large soybean database from the UCI repository, with its training and test database combined into a single file. ### Sources (a) Origin R.S.…
30600 runs0 likes50 downloads50 reach11 impact
683 instances - 36 features - 19 classes - 2337 missing values
### Description MicroMass (pure spectra version) is a dataset to explore machine learning approaches for the identification of microorganisms from mass-spectrometry data. ### Source ``` Pierre Mahé,…
30038 runs1 likes11 downloads12 reach85 impact
571 instances - 1301 features - 20 classes - 0 missing values
### Description Tamilnadu Electricity Board Hourly Readings dataset. ### Source ``` K.Kalyani ,kkalyanims '@' gmail.com, T.U.K Arts College,Karanthai,Thanjavur. ``` ### Data Set Information Real-time…
28533 runs0 likes23 downloads23 reach86 impact
45781 instances - 4 features - 20 classes - 0 missing values
Predict a biological response of molecules from their chemical properties. Each row in this data set represents a molecule. The first column contains experimental data describing an actual biological…
25176 runs0 likes29 downloads29 reach16 impact
3751 instances - 1777 features - 2 classes - 0 missing values
This data was gathered from participants in experimental speed dating events from 2002-2004. During the events, the attendees would have a four-minute "first date" with every other participant of the…
18827 runs13 likes131 downloads144 reach18 impact
8378 instances - 123 features - 2 classes - 18372 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. Corresponding patterns in different datasets correspond to the same…
18644 runs0 likes16 downloads16 reach11 impact
2000 instances - 7 features - 10 classes - 0 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. Corresponding patterns in different datasets correspond to the same…
18374 runs0 likes21 downloads21 reach12 impact
2000 instances - 48 features - 10 classes - 0 missing values
1. Title of Database: Optical Recognition of Handwritten Digits 2. Source: E. Alpaydin, C. Kaynak Department of Computer Engineering Bogazici University, 80815 Istanbul Turkey alpaydin@boun.edu.tr…
18320 runs1 likes18 downloads19 reach10 impact
5620 instances - 65 features - 10 classes - 0 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. The maps were scanned in 8 bit grey value at density of 400dpi,…
18179 runs0 likes17 downloads17 reach11 impact
2000 instances - 241 features - 10 classes - 0 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. Corresponding patterns in different datasets correspond to the same…
18158 runs0 likes17 downloads17 reach11 impact
2000 instances - 217 features - 10 classes - 0 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. Corresponding patterns in different datasets correspond to the same…
18083 runs0 likes18 downloads18 reach11 impact
2000 instances - 65 features - 10 classes - 0 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. Corresponding patterns in different datasets correspond to the same…
18067 runs0 likes10 downloads10 reach11 impact
2000 instances - 77 features - 10 classes - 0 missing values
### Description The data consists of real historical data collected from 2010 & 2011. Employees are manually allowed or denied access to resources over time. The data is used to create an algorithm…
17804 runs0 likes11 downloads11 reach14 impact
32769 instances - 10 features - 2 classes - 0 missing values
1. Title of Database: Pen-Based Recognition of Handwritten Digits 2. Source: E. Alpaydin, F. Alimoglu Department of Computer Engineering Bogazici University, 80815 Istanbul Turkey alpaydin@boun.edu.tr…
17679 runs0 likes17 downloads17 reach10 impact
10992 instances - 17 features - 10 classes - 0 missing values
This database has been artificially generated. It describes the structure of the capital letters A, C, D, E, F, G, H, L, P, R, indicated by a number 1-10, in that order (A=1,C=2,...). Each letter's…
15893 runs0 likes9 downloads9 reach46 impact
10218 instances - 8 features - 10 classes - 0 missing values
2126 fetal cardiotocograms (CTGs) were automatically processed and the respective diagnostic features measured. The CTGs were also classified by three expert obstetricians and a consensus…
15886 runs1 likes22 downloads23 reach45 impact
2126 instances - 36 features - 10 classes - 0 missing values
Tattile Via Gaetano Donizetti, 1-3-5,25030 Mairano (Brescia), Italy. ### Dataset Description Semeion Handwritten Digit Data Set, where 1593 handwritten digits from around 80 persons were scanned and…
15696 runs0 likes19 downloads19 reach46 impact
1593 instances - 257 features - 10 classes - 0 missing values
The instances were drawn randomly from a database of 7 outdoor images. The images were hand-segmented to create a classification for every pixel. Each instance is a 3x3 region. ### Attribute…
14741 runs0 likes22 downloads22 reach11 impact
2310 instances - 20 features - 7 classes - 0 missing values
This dataset records 640 time series of 12 LPC cepstrum coefficients taken from nine male speakers. The data was collected for examining our newly developed classifier for multidimensional curves…
14729 runs0 likes9 downloads9 reach43 impact
9961 instances - 15 features - 9 classes - 0 missing values
### Description This is a data set containing 1080 documents of free text business descriptions of Brazilian companies categorized into a subset of 9 categories. ### Source ``` Patrick Marques…
14293 runs0 likes13 downloads13 reach41 impact
1080 instances - 857 features - 9 classes - 0 missing values
The original Annealing dataset from UCI. The exact meaning of the features and classes is largely unknown. Annealing, in metallurgy and materials science, is a heat treatment that alters the physical…
13687 runs0 likes15 downloads15 reach12 impact
898 instances - 39 features - 5 classes - 22175 missing values
One of the datasets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff. It contains data on the DMFT Index (Decayed, Missing, and Filled Teeth) before and after different prevention…
13135 runs0 likes11 downloads11 reach31 impact
797 instances - 5 features - 6 classes - 0 missing values
The objective was to determine which seedlots in a species are best for soil conservation in seasonally dry hill country. Determination is found by measurement of height, diameter by height, survival,…
12913 runs0 likes9 downloads9 reach6 impact
736 instances - 20 features - 5 classes - 448 missing values
The database consists of the multi-spectral values of pixels in 3x3 neighbourhoods in a satellite image, and the classification associated with the central pixel in each neighbourhood. The aim is to…
12451 runs1 likes23 downloads24 reach8 impact
6430 instances - 37 features - 6 classes - 0 missing values
### NAME Vowel Recognition (Deterding data) ### SUMMARY Speaker independent recognition of the eleven steady state vowels of British English using a specified training set of lpc derived log area…
12389 runs0 likes12 downloads12 reach32 impact
990 instances - 13 features - 11 classes - 0 missing values
NAME vehicle silhouettes PURPOSE to classify a given silhouette as one of four types of vehicle, using a set of features extracted from the silhouette. The vehicle may be viewed from one of many…
12326 runs1 likes22 downloads23 reach10 impact
846 instances - 19 features - 4 classes - 0 missing values
### Description Synthetic Control Chart Time Series ### Sources ``` * Original Owner and Donor Dr Robert Alcock rob@skyblue.csd.auth.gr ``` ### Dataset Information This data consists of synthetically…
12302 runs0 likes9 downloads9 reach32 impact
600 instances - 62 features - 6 classes - 0 missing values
This data set was generated to model psychological experimental results. Each example is classified as having the balance scale tip to the right, tip to the left, or be balanced. The attributes are…
10999 runs0 likes13 downloads13 reach11 impact
625 instances - 5 features - 3 classes - 0 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au6-250-drift-au6-cd1-500 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and…
10949 runs0 likes9 downloads9 reach36 impact
750 instances - 41 features - 8 classes - 0 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au6-1000 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and heterogeneity of…
10948 runs0 likes15 downloads15 reach37 impact
1000 instances - 41 features - 8 classes - 0 missing values
This database was derived from a simple hierarchical decision model originally developed for the demonstration of DEX (M. Bohanec, V. Rajkovic: Expert system for decision making. Sistemica 1(1), pp.…
10919 runs1 likes25 downloads26 reach11 impact
1728 instances - 7 features - 4 classes - 0 missing values
This file concerns credit card applications. All attribute names and values have been changed to meaningless symbols to protect the confidentiality of the data. This dataset is interesting because…
10472 runs0 likes23 downloads23 reach12 impact
690 instances - 16 features - 2 classes - 67 missing values
Data on tree growth used in the Case Study published in the September, 1995 issue of the Canadian Journal of Statistics. This data set was been provided by Dr. Fernando Camacho, Ontario Hydro…
10407 runs1 likes14 downloads15 reach29 impact
2796 instances - 35 features - 6 classes - 68100 missing values
Source: James P Bridge, Sean B Holden and Lawrence C Paulson University of Cambridge Computer Laboratory William Gates Building 15 JJ Thomson Avenue Cambridge CB3 0FD UK +44 (0)1223 763500…
10391 runs0 likes20 downloads20 reach30 impact
6118 instances - 52 features - 6 classes - 0 missing values
Current dataset was adapted to ARFF format from the UCI version. Sample code ID's were removed. ! Note that there is also a related Breast Cancer Wisconsin (Diagnosis) Data Set with a different set of…
10283 runs1 likes12 downloads13 reach10 impact
699 instances - 10 features - 2 classes - 16 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
10250 runs0 likes7 downloads7 reach23 impact
841 instances - 71 features - 4 classes - 0 missing values
### Description Gas Sensor Array Drift Dataset Data Set ### Sources ``` (a) Creators: Alexander Vergara (vergara '@' ucsd.edu) BioCircutis Institute University of California San Diego San Diego,…
10086 runs0 likes17 downloads17 reach31 impact
13910 instances - 129 features - 6 classes - 0 missing values
### Description Human Activity Recognition (HAR) database built from the recordings of 30 subjects performing activities of daily living (ADL) while carrying a waist-mounted smartphone with embedded…
9970 runs0 likes22 downloads22 reach30 impact
10299 instances - 562 features - 6 classes - 0 missing values
1. Title: Contraceptive Method Choice 2. Sources: (a) Origin: This dataset is a subset of the 1987 National Indonesia Contraceptive Prevalence Survey (b) Creator: Tjen-Sien Lim (limt@stat.wisc.edu)…
9448 runs0 likes17 downloads17 reach9 impact
1473 instances - 10 features - 3 classes - 0 missing values
eating
9409 runs0 likes14 downloads14 reach34 impact
945 instances - 6374 features - 7 classes - 0 missing values
Generator generating 3 classes of waves. Each class is generated from a combination of 2 of 3 "base" waves. For details, see Breiman,L., Friedman,J.H., Olshen,R.A., and Stone,C.J. (1984).…
9035 runs1 likes53 downloads54 reach10 impact
5000 instances - 41 features - 3 classes - 0 missing values
Creators: Renata Cristina Barros Madeo (Madeo, R. C. B.) Priscilla Koch Wagner (Wagner, P. K.) Sarajane Marques Peres (Peres, S. M.) {renata.si, priscilla.wagner, sarajane} at usp.br…
8940 runs1 likes14 downloads15 reach25 impact
9873 instances - 33 features - 5 classes - 0 missing values
### Description This dataset describes mushrooms in terms of their physical characteristics. They are classified into: poisonous or edible. ### Source ``` (a) Origin: Mushroom records are drawn from…
8348 runs0 likes31 downloads31 reach12 impact
8124 instances - 23 features - 2 classes - 2480 missing values
PRO FOOTBALL SCORES (raw data appears after the description below) How well do the oddsmakers of Las Vegas predict the outcome of professional football games? Is there really a home field advantage -…
7850 runs0 likes18 downloads18 reach15 impact
672 instances - 10 features - 2 classes - 1200 missing values
Data on educational transitions for a sample of 500 Irish schoolchildren aged 11 in 1967. The data were collected by Greaney and Kelleghan (1984), and reanalyzed by Raftery and Hout (1985, 1993). ###…
7848 runs0 likes13 downloads13 reach15 impact
500 instances - 6 features - 2 classes - 32 missing values
The data were collected as the SCITOS G5 robot navigates through the room following the wall in a clockwise direction, for 4 rounds, using 24 ultrasound sensors arranged circularly around its 'waist'.…
7801 runs0 likes19 downloads19 reach21 impact
5456 instances - 25 features - 4 classes - 0 missing values
Primate splice-junction gene sequences (DNA) with associated imperfect domain theory. Splice junctions are points on a DNA sequence at which 'superfluous' DNA is removed during the process of protein…
7189 runs0 likes14 downloads14 reach9 impact
3190 instances - 62 features - 3 classes - 0 missing values
### Attribute Information * The first column is the class label (1 for signal, 0 for background) * 21 low-level features (kinematic properties): lepton pT, lepton eta, lepton phi, missing energy…
7089 runs0 likes5 downloads5 reach7 impact
98050 instances - 29 features - 2 classes - 9 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au7-cpd1-500 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and heterogeneity…
7083 runs0 likes7 downloads7 reach24 impact
500 instances - 13 features - 5 classes - 0 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au7-300-drift-au7-cpd1-800 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and…
7067 runs0 likes10 downloads10 reach25 impact
1100 instances - 13 features - 5 classes - 0 missing values
This is perhaps the best known database to be found in the pattern recognition literature. Fisher's paper is a classic in the field and is referenced frequently to this day. (See Duda & Hart, for…
7022 runs5 likes71 downloads76 reach22 impact
150 instances - 5 features - 3 classes - 0 missing values
### Description Cylinder bands UCI dataset - Process delays known as cylinder banding in rotogravure printing were substantially mitigated using control rules discovered by decision tree induction.…
6960 runs0 likes7 downloads7 reach14 impact
540 instances - 40 features - 2 classes - 999 missing values
The MNIST database of handwritten digits with 784 features, raw data available at: http://yann.lecun.com/exdb/mnist/. It can be split in a training set of the first 60,000 examples, and a test set of…
6847 runs2 likes47 downloads49 reach9 impact
70000 instances - 785 features - 10 classes - 0 missing values