Data
Filter results by:
This data was gathered from participants in experimental speed dating events from 2002-2004. During the events, the attendees would have a four-minute "first date" with every other participant of the…
24488 runs16 likes140 downloads156 reach20 impact
8378 instances - 123 features - 2 classes - 18372 missing values
This dataset classifies people described by a set of attributes as good or bad credit risks. This dataset comes with a cost matrix: ``` Good Bad (predicted) Good 0 1 (actual) Bad 5 0 ``` It is worse…
497857 runs8 likes112 downloads120 reach1 impact
1000 instances - 21 features - 2 classes - 0 missing values
This is perhaps the best known database to be found in the pattern recognition literature. Fisher's paper is a classic in the field and is referenced frequently to this day. (See Duda & Hart, for…
8028 runs5 likes88 downloads93 reach0 impact
150 instances - 5 features - 3 classes - 0 missing values
All data is from one continuous EEG measurement with the Emotiv EEG Neuroheadset. The duration of the measurement was 117 seconds. The eye state was detected via a camera during the EEG measurement…
161218 runs2 likes84 downloads86 reach17 impact
14980 instances - 15 features - 2 classes - 0 missing values
SPAM E-mail Database The "spam" concept is diverse: advertisements for products/websites, make money fast schemes, chain letters, pornography... Our collection of spam e-mails came from our postmaster…
152524 runs3 likes72 downloads75 reach0 impact
4601 instances - 58 features - 2 classes - 0 missing values
1. Title: Pima Indians Diabetes Database 2. Sources: (a) Original owners: National Institute of Diabetes and Digestive and Kidney Diseases (b) Donor of database: Vincent Sigillito…
193417 runs3 likes68 downloads71 reach0 impact
768 instances - 9 features - 2 classes - 0 missing values
### Description ISOLET (Isolated Letter Speech Recognition) dataset was generated as follows: 150 subjects spoke the name of each letter of the alphabet twice. Hence, there are 52 training examples…
43283 runs0 likes67 downloads67 reach120 impact
7797 instances - 618 features - 26 classes - 0 missing values
1. TITLE: Letter Image Recognition Data The objective is to identify each of a large number of black-and-white rectangular pixel displays as one of the 26 capital letters in the English alphabet. The…
60471 runs1 likes66 downloads67 reach0 impact
20000 instances - 17 features - 26 classes - 0 missing values
The satellite dataset comprises of features extracted from satellite observations. In particular, each image was taken under four different light wavelength, two in visible light (green and red) and…
2074 runs2 likes62 downloads64 reach16 impact
5100 instances - 37 features - 2 classes - 0 missing values
### Description One-hundred plant species leaves dataset (Class = Texture). ### Sources ``` (a) Original owners of colour Leaves Samples: James Cope, Thibaut Beghin, Paolo Remagnino, Sarah Barman. The…
139260 runs2 likes58 downloads60 reach408 impact
1599 instances - 65 features - 100 classes - 0 missing values
Generator generating 3 classes of waves. Each class is generated from a combination of 2 of 3 "base" waves. For details, see Breiman,L., Friedman,J.H., Olshen,R.A., and Stone,C.J. (1984).…
15586 runs1 likes53 downloads54 reach0 impact
5000 instances - 41 features - 3 classes - 0 missing values
The MNIST database of handwritten digits with 784 features, raw data available at: http://yann.lecun.com/exdb/mnist/. It can be split in a training set of the first 60,000 examples, and a test set of…
10369 runs2 likes53 downloads55 reach16 impact
70000 instances - 785 features - 10 classes - 0 missing values
The datasets contains transactions made by credit cards in September 2013 by european cardholders. This dataset present transactions that occurred in two days, where we have 492 frauds out of 284,807…
355 runs0 likes52 downloads52 reach10 impact
284807 instances - 31 features - 2 classes - 0 missing values
GISETTE is a handwritten digit recognition problem. The problem is to separate the highly confusable digits '4' and '9'. This dataset is one of five datasets of the NIPS 2003 feature selection…
466 runs0 likes51 downloads51 reach13 impact
7000 instances - 5001 features - 2 classes - 0 missing values
This is the large soybean database from the UCI repository, with its training and test database combined into a single file. There are 19 classes, only the first 15 of which have been used in prior…
37282 runs0 likes50 downloads50 reach0 impact
683 instances - 36 features - 19 classes - 2337 missing values
1. Title: Glass Identification Database 2. Sources: (a) Creator: B. German -- Central Research Establishment Home Office Forensic Science Service Aldermaston, Reading, Berkshire RG7 4PN (b) Donor:…
1772 runs0 likes49 downloads49 reach0 impact
214 instances - 10 features - 6 classes - 0 missing values
The aim is to determine the type of arrhythmia from the ECG recordings. This database contains 279 attributes, 206 of which are linear valued and the rest are nominal. Concerning the study of H. Altay…
4427 runs0 likes46 downloads46 reach0 impact
452 instances - 280 features - 13 classes - 408 missing values
Mammography dataset Past Usage: 1. Woods, K., Doss, C., Bowyer, K., Solka, J., Priebe, C.,
215 runs4 likes45 downloads49 reach13 impact
11183 instances - 7 features - 2 classes - 0 missing values
1. Title: Part of the IRAS Low Resolution Spectrometer Database 2. Sources: (a) Originator: Infra-Red Astronomy Satellite Project Database (b) Donor: John Stutz (c) Date:…
1243 runs0 likes43 downloads43 reach6 impact
531 instances - 103 features - 48 classes - 0 missing values
Data taken from the Blood Transfusion Service Center in Hsin-Chu City in Taiwan -- this is a classification problem. To demonstrate the RFMTC marketing model (a modified version of RFM), this study…
457628 runs2 likes43 downloads45 reach21 impact
748 instances - 5 features - 2 classes - 0 missing values
This database encodes the complete set of possible board configurations at the end of tic-tac-toe games, where "x" is assumed to have played first. The target concept is "win for x" (i.e., true when…
375870 runs1 likes42 downloads43 reach0 impact
958 instances - 10 features - 2 classes - 0 missing values
Normalized version of the Forest Covertype dataset (see version 1), so that the numerical values are between 0 and 1. Contains the forest cover type for 30 x 30 meter cells obtained from US Forest…
319 runs1 likes39 downloads40 reach0 impact
581012 instances - 55 features - 7 classes - 0 missing values
Dataset creator and donator: Zhi Liu, e-mail: liuzhi8673 '@' gmail.com, institution: National Engineering Research Center for E-Learning, Hubei Wuhan, China Data Set Information: dataset are derived…
65168 runs2 likes37 downloads39 reach205 impact
1500 instances - 10001 features - 50 classes - 0 missing values
* Title: seismic-bumps Data Set * Abstract: The data describe the problem of high energy (higher than 10^4 J) seismic bumps forecasting in a coal mine. Data come from two of longwalls located in a…
152 runs0 likes37 downloads37 reach4 impact
210 instances - 8 features - 3 classes - 0 missing values
This data is derived from the 2012 KDD Cup. The data is subsampled to 1% of the original number of instances, downsampling the majority class (click=0) so that the target feature is reasonably…
313 runs0 likes34 downloads34 reach3 impact
399482 instances - 12 features - 2 classes - 0 missing values
### Description This dataset describes mushrooms in terms of their physical characteristics. They are classified into: poisonous or edible. ### Source ``` (a) Origin: Mushroom records are drawn from…
12820 runs1 likes34 downloads35 reach0 impact
8124 instances - 23 features - 2 classes - 2480 missing values
This is a 10% stratified subsample of the data from the 1999 ACM KDD Cup (http://www.sigkdd.org/kddcup/index.php). Modified by TunedIT (converted to ARFF format)…
25 runs1 likes33 downloads34 reach5 impact
494020 instances - 42 features - 23 classes - 0 missing values
### Description One-hundred plant species leaves dataset (Class = Shape). ### Sources ``` (a) Original owners of colour Leaves Samples: James Cope, Thibaut Beghin, Paolo Remagnino, Sarah Barman. The…
139377 runs1 likes32 downloads33 reach407 impact
1600 instances - 65 features - 100 classes - 0 missing values
Predict a biological response of molecules from their chemical properties. Each row in this data set represents a molecule. The first column contains experimental data describing an actual biological…
40845 runs1 likes32 downloads33 reach19 impact
3751 instances - 1777 features - 2 classes - 0 missing values
Current dataset was adapted to ARFF format from the UCI version. Sample code ID's were removed. ! Note that there is also a related Breast Cancer Wisconsin (Original) Data Set with a different set of…
218540 runs1 likes31 downloads32 reach16 impact
569 instances - 31 features - 2 classes - 0 missing values
1. Title: Chess End-Game -- King+Rook versus King+Pawn on a7 (usually abbreviated KRKPA7). The pawn on a7 means it is one square away from queening. It is the King+Rook's side (white) to move. 2.…
250986 runs0 likes30 downloads30 reach0 impact
3196 instances - 37 features - 2 classes - 0 missing values
A dataset of steel plates' faults, classified into 7 different types. The goal was to train machine learning for automatic pattern recognition. The dataset consists of 27 features describing each…
273355 runs1 likes30 downloads31 reach15 impact
1941 instances - 34 features - 2 classes - 0 missing values
Citation Request: This lymphography domain was obtained from the University Medical Centre, Institute of Oncology, Ljubljana, Yugoslavia. Thanks go to M. Zwitter and M. Soklic for providing the data.…
1963 runs0 likes30 downloads30 reach0 impact
148 instances - 19 features - 4 classes - 0 missing values
Citation Request: This breast cancer domain was obtained from the University Medical Centre, Institute of Oncology, Ljubljana, Yugoslavia. Thanks go to M. Zwitter and M. Soklic for providing the data.…
2006 runs1 likes29 downloads30 reach0 impact
286 instances - 10 features - 2 classes - 9 missing values
The first 5 variables are all blood tests which are thought to be sensitive to liver disorders that might arise from excessive alcohol consumption. Each line in the dataset constitutes the record of a…
148 runs2 likes29 downloads31 reach0 impact
345 instances - 7 features - 0 classes - 0 missing values
Prediction task is to determine whether a person makes over 50K a year. Extraction was done by Barry Becker from the 1994 Census database. A set of reasonably clean records was extracted using the…
2671 runs1 likes28 downloads29 reach0 impact
48842 instances - 15 features - 2 classes - 6465 missing values
This file concerns credit card applications. All attribute names and values have been changed to meaningless symbols to protect the confidentiality of the data. This dataset is interesting because…
19178 runs1 likes27 downloads28 reach0 impact
690 instances - 16 features - 2 classes - 67 missing values
Dataset created to study concept drift in stream mining. It is constructed by combining the Covertype, Poker-Hand, and Electricity datasets. More details can be found in: Albert Bifet, Geoff Holmes,…
332 runs0 likes26 downloads26 reach0 impact
1455525 instances - 73 features - 10 classes - 0 missing values
This radar data was collected by a system in Goose Bay, Labrador. This system consists of a phased array of 16 high-frequency antennas with a total transmitted power on the order of 6.4 kilowatts. See…
2484 runs3 likes25 downloads28 reach0 impact
351 instances - 35 features - 2 classes - 0 missing values
Attribute information: ``` sick, negative. | classes age: continuous. sex: M, F. on thyroxine: f, t. query on thyroxine: f, t. on antithyroid medication: f, t. sick: f, t. pregnant: f, t. thyroid…
14909 runs0 likes25 downloads25 reach0 impact
3772 instances - 30 features - 2 classes - 6064 missing values
Donated by P. Savicky, Institute of Computer Science, AS of CR, Czech Republic Methods for multidimensional event classification: a case study using images from a Cherenkov gamma-ray telescope.…
60646 runs1 likes25 downloads26 reach16 impact
19020 instances - 12 features - 2 classes - 0 missing values
The dataset (originally named ELEC2) contains 45,312 instances dated from 7 May 1996 to 5 December 1998. Each example of the dataset refers to a period of 30 minutes, i.e. there are 48 instances for…
102055 runs3 likes25 downloads28 reach0 impact
45312 instances - 9 features - 2 classes - 0 missing values
This data set consists of three types of entities: (a) the specification of an auto in terms of various characteristics, (b) its assigned insurance risk rating, (c) its normalized losses in use as…
2493 runs1 likes24 downloads25 reach0 impact
205 instances - 26 features - 6 classes - 59 missing values
Airlines Dataset Inspired in the regression dataset from Elena Ikonomovska. The task is to predict whether a given flight will be delayed, given the information of the scheduled departure.
287 runs0 likes24 downloads24 reach4 impact
539383 instances - 8 features - 2 classes - 0 missing values
2126 fetal cardiotocograms (CTGs) were automatically processed and the respective diagnostic features measured. The CTGs were also classified by three expert obstetricians and a consensus…
20176 runs2 likes24 downloads26 reach47 impact
2126 instances - 36 features - 10 classes - 0 missing values
NAME vehicle silhouettes PURPOSE to classify a given silhouette as one of four types of vehicle, using a set of features extracted from the silhouette. The vehicle may be viewed from one of many…
20914 runs1 likes23 downloads24 reach0 impact
846 instances - 19 features - 4 classes - 0 missing values
The database consists of the multi-spectral values of pixels in 3x3 neighbourhoods in a satellite image, and the classification associated with the central pixel in each neighbourhood. The aim is to…
21038 runs2 likes23 downloads25 reach0 impact
6430 instances - 37 features - 6 classes - 0 missing values
One of the NASA Metrics Data Program defect data sets. Data from flight software for earth orbiting satellite. Data comes from McCabe and Halstead features extractors of source code. These features…
140568 runs0 likes23 downloads23 reach18 impact
1109 instances - 22 features - 2 classes - 0 missing values
Human Activity Recognition (HAR) database built from the recordings of 30 subjects performing activities of daily living (ADL) while carrying a waist-mounted smartphone with embedded inertial sensors.…
16720 runs0 likes23 downloads23 reach32 impact
10299 instances - 562 features - 6 classes - 0 missing values
The instances were drawn randomly from a database of 7 outdoor images. The images were hand-segmented to create a classification for every pixel. Each instance is a 3x3 region. ### Attribute…
19234 runs0 likes22 downloads22 reach0 impact
2310 instances - 20 features - 7 classes - 0 missing values
NAME: Sonar, Mines vs. Rocks SUMMARY: This is the data set used by Gorman and Sejnowski in their study of the classification of sonar signals using a neural network [1]. The task is to train a network…
2366 runs1 likes22 downloads23 reach0 impact
208 instances - 61 features - 2 classes - 0 missing values
The data is related with direct marketing campaigns of a Portuguese banking institution. The marketing campaigns were based on phone calls. Often, more than one contact to the same client was…
60259 runs1 likes22 downloads23 reach16 impact
45211 instances - 17 features - 2 classes - 0 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. Corresponding patterns in different datasets correspond to the same…
26776 runs0 likes21 downloads21 reach0 impact
2000 instances - 48 features - 10 classes - 0 missing values
### Description Scene recognition dataset - It contains characteristics about images and their classes. The original dataset is a multi-label classification problem with 6 different labels: {Beach,…
86252 runs0 likes21 downloads21 reach16 impact
2407 instances - 300 features - 2 classes - 0 missing values
Vehicle classification in distributed sensor networks. Journal of Parallel and Distributed Computing, 64(7):826-838, July 2004. This is the SensIT Vehicle (combined) dataset, retrieved 2013-11-14 from…
403 runs0 likes21 downloads21 reach7 impact
98528 instances - 101 features - 2 classes - 0 missing values
The aim of this dataset is to distinguish between nasal (class 0) and oral sounds (class 1). Five different attributes were chosen to characterize each vowel: they are the amplitudes of the five first…
209851 runs4 likes21 downloads25 reach18 impact
5404 instances - 6 features - 2 classes - 0 missing values
One of the NASA Metrics Data Program defect data sets. Data from software for storage management for receiving and processing ground data. Data comes from McCabe and Halstead features extractors of…
150513 runs1 likes20 downloads21 reach18 impact
2109 instances - 22 features - 2 classes - 0 missing values
Source: James P Bridge, Sean B Holden and Lawrence C Paulson University of Cambridge Computer Laboratory William Gates Building 15 JJ Thomson Avenue Cambridge CB3 0FD UK +44 (0)1223 763500…
19701 runs0 likes20 downloads20 reach32 impact
6118 instances - 52 features - 6 classes - 0 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. Corresponding patterns in different datasets correspond to the same…
26665 runs0 likes19 downloads19 reach0 impact
2000 instances - 65 features - 10 classes - 0 missing values
Dataset from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch), which consisted of 5 different datasets (SYLVA, GINA, NOVA, HIVA, ADA). The purpose of the challenge…
65060 runs0 likes19 downloads19 reach17 impact
3468 instances - 971 features - 2 classes - 0 missing values
### Description Gas Sensor Array Drift Dataset Data Set ### Sources ``` (a) Creators: Alexander Vergara (vergara '@' ucsd.edu) BioCircutis Institute University of California San Diego San Diego,…
14493 runs0 likes19 downloads19 reach32 impact
13910 instances - 129 features - 6 classes - 0 missing values
Tattile Via Gaetano Donizetti, 1-3-5,25030 Mairano (Brescia), Italy. ### Dataset Description Semeion Handwritten Digit Data Set, where 1593 handwritten digits from around 80 persons were scanned and…
24246 runs0 likes19 downloads19 reach48 impact
1593 instances - 257 features - 10 classes - 0 missing values
The data were collected as the SCITOS G5 robot navigates through the room following the wall in a clockwise direction, for 4 rounds, using 24 ultrasound sensors arranged circularly around its 'waist'.…
16428 runs0 likes19 downloads19 reach24 impact
5456 instances - 25 features - 4 classes - 0 missing values
We create a digit database by collecting 250 samples from 44 writers. The samples written by 30 writers are used for training, cross-validation and writer dependent testing, and the digits written by…
26338 runs0 likes18 downloads18 reach0 impact
10992 instances - 17 features - 10 classes - 0 missing values
1. Title: Haberman's Survival Data 2. Sources: (a) Donor: Tjen-Sien Lim (limt@stat.wisc.edu) (b) Date: March 4, 1999 3. Past Usage: 1. Haberman, S. J. (1976). Generalized Residuals for Log-Linear…
3241 runs1 likes18 downloads19 reach0 impact
306 instances - 4 features - 2 classes - 0 missing values
1. Title of Database: Optical Recognition of Handwritten Digits 2. Source: E. Alpaydin, C. Kaynak Department of Computer Engineering Bogazici University, 80815 Istanbul Turkey alpaydin@boun.edu.tr…
26796 runs1 likes18 downloads19 reach0 impact
5620 instances - 65 features - 10 classes - 0 missing values
PRO FOOTBALL SCORES (raw data appears after the description below) How well do the oddsmakers of Las Vegas predict the outcome of professional football games? Is there really a home field advantage -…
12290 runs0 likes18 downloads18 reach16 impact
672 instances - 10 features - 2 classes - 1200 missing values
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage repeatable,…
106014 runs0 likes18 downloads18 reach18 impact
15545 instances - 6 features - 2 classes - 0 missing values
Datasets from ACM KDD Cup (http://www.sigkdd.org/kddcup/index.php) Data set for KDD Cup 1999 Modified by TunedIT (converted to ARFF format)…
4 runs0 likes18 downloads18 reach5 impact
4898431 instances - 42 features - 23 classes - 0 missing values
Author: Volker Lohweg (University of Applied Sciences, Ostwestfalen-Lippe) Source: [UCI](https://archive.ics.uci.edu/ml/datasets/banknote+authentication) - 2012 Please cite:…
127247 runs1 likes18 downloads19 reach19 impact
1372 instances - 5 features - 2 classes - 0 missing values
Each record represents 100 points on a two-dimensional graph. When plotted in order (from 1 through 100) as the Y coordinate, the points will create either a Hill (a “bump” in the terrain) or a…
179150 runs0 likes18 downloads18 reach15 impact
1212 instances - 101 features - 2 classes - 0 missing values
One of the NASA Metrics Data Program defect data sets. Data from software for science data processing. Data comes from McCabe and Halstead features extractors of source code. These features were…
167473 runs0 likes18 downloads18 reach17 impact
522 instances - 22 features - 2 classes - 0 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. Corresponding patterns in different datasets correspond to the same…
26854 runs0 likes17 downloads17 reach0 impact
2000 instances - 217 features - 10 classes - 0 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. The maps were scanned in 8 bit grey value at density of 400dpi,…
22666 runs0 likes17 downloads17 reach0 impact
2000 instances - 241 features - 10 classes - 0 missing values
This database contains 13 attributes (which have been extracted from a larger set of 75) Attribute Information: ------------------------ -- 1. age -- 2. sex -- 3. chest pain type (4 values) -- 4.…
3208 runs0 likes17 downloads17 reach0 impact
270 instances - 14 features - 2 classes - 0 missing values
1. Title of Database: Blocks Classification 2. Sources: (a) Donato Malerba Dipartimento di Informatica University of Bari via Orabona 4 70126 Bari - Italy phone: +39 - 80 - 5443269 fax: +39 - 80 -…
2719 runs0 likes17 downloads17 reach0 impact
5473 instances - 11 features - 5 classes - 0 missing values
1. Title: Contraceptive Method Choice 2. Sources: (a) Origin: This dataset is a subset of the 1987 National Indonesia Contraceptive Prevalence Survey (b) Creator: Tjen-Sien Lim (limt@stat.wisc.edu)…
18003 runs0 likes17 downloads17 reach0 impact
1473 instances - 10 features - 3 classes - 0 missing values
1. Title of Database: Abalone data 2. Sources: (a) Original owners of database: Marine Resources Division Marine Research Laboratories - Taroona Department of Primary Industry and Fisheries, Tasmania…
34900 runs0 likes17 downloads17 reach0 impact
4177 instances - 9 features - 28 classes - 0 missing values
Dataset from the MLRR repository: http://axon.cs.byu.edu:5000/ More infos: https://archive.ics.uci.edu/ml/datasets/Musk+(Version+2)
82514 runs1 likes17 downloads18 reach21 impact
6598 instances - 170 features - 2 classes - 0 missing values
This is a PROMISE data set made publicly available in order to encourage repeatable, verifiable, refutable, and/or improvable predictive models of software engineering. If you publish material based…
14129 runs0 likes17 downloads17 reach17 impact
10885 instances - 22 features - 2 classes - 25 missing values
Lucas, D. D., Klein, R., Tannahill, J., Ivanova, D., Brandon, S., Domyancic, D., and Zhang, Y.: Failure analysis of parameter-induced simulation crashes in climate models, Geosci. Model Dev. Discuss.,…
158613 runs0 likes17 downloads17 reach16 impact
540 instances - 21 features - 2 classes - 0 missing values
This is the poker dataset, retrieved 2013-11-14 from the libSVM site. Additional to the preprocessing done there (see LibSVM site for details), this dataset was created as follows: -join test and…
23 runs0 likes17 downloads17 reach6 impact
1025010 instances - 11 features - 2 classes - 0 missing values
Current dataset was adapted to ARFF format from the UCI version. Sample code ID's were removed. ! Note that there is also a related Breast Cancer Wisconsin (Diagnosis) Data Set with a different set of…
18908 runs1 likes16 downloads17 reach0 impact
699 instances - 10 features - 2 classes - 16 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. Corresponding patterns in different datasets correspond to the same…
27245 runs0 likes16 downloads16 reach0 impact
2000 instances - 7 features - 10 classes - 0 missing values
1. Title: 1984 United States Congressional Voting Records Database 2. Source Information: (a) Source: Congressional Quarterly Almanac, 98th Congress, 2nd session 1984, Volume XL: Congressional…
2250 runs0 likes16 downloads16 reach0 impact
435 instances - 17 features - 2 classes - 392 missing values
1. Title: Lung Cancer Data 2. Source Information: - Data was published in : Hong, Z.Q. and Yang, J.Y. "Optimal Discriminant Plane for a Small Number of Samples and Design Method of Classifier on the…
1238 runs0 likes16 downloads16 reach0 impact
32 instances - 57 features - 3 classes - 5 missing values
Oil dataset Past Usage: 1. Kubat, M., Holte, R.,
200 runs2 likes16 downloads18 reach13 impact
937 instances - 50 features - 2 classes - 0 missing values
One of the NASA Metrics Data Program defect data sets. Data from flight software for earth orbiting satellite. Data comes from McCabe and Halstead features extractors of source code. These features…
138440 runs0 likes16 downloads16 reach17 impact
1563 instances - 38 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
2863 runs1 likes16 downloads17 reach15 impact
1545 instances - 10937 features - 2 classes - 0 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au6-1000 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and heterogeneity of…
11010 runs0 likes16 downloads16 reach38 impact
1000 instances - 41 features - 8 classes - 0 missing values
#### Abstract: MADELON is an artificial dataset, which was part of the NIPS 2003 feature selection challenge. This is a two-class classification problem with continuous input variables. The difficulty…
93305 runs0 likes16 downloads16 reach16 impact
2600 instances - 501 features - 2 classes - 0 missing values
This data set contains 416 liver patient records and 167 non liver patient records.The data set was collected from north east of Andhra Pradesh, India. The class label divides the patients into 2…
149162 runs0 likes16 downloads16 reach16 impact
583 instances - 11 features - 2 classes - 0 missing values
No data.
353 runs0 likes16 downloads16 reach0 impact
120919 instances - 1002 features - 2 classes - 0 missing values
Once upon a time, in July 1991, the monks of Corsendonk Priory were faced with a school held in their priory, namely the 2nd European Summer School on Machine Learning. After listening more than one…
390638 runs0 likes16 downloads16 reach23 impact
601 instances - 7 features - 2 classes - 0 missing values
The original Annealing dataset from UCI. The exact meaning of the features and classes is largely unknown. Annealing, in metallurgy and materials science, is a heat treatment that alters the physical…
13771 runs0 likes15 downloads15 reach0 impact
898 instances - 39 features - 5 classes - 22175 missing values
Date: Tue, 15 Nov 88 15:44:08 EST From: stan To: aha@ICS.UCI.EDU 1. Title: Final settlements in labor negotitions in Canadian industry 2. Source Information -- Creators:…
7630 runs0 likes15 downloads15 reach0 impact
57 instances - 17 features - 2 classes - 326 missing values
1. Title: Nursery Database 2. Sources: (a) Creator: Vladislav Rajkovic et al. (13 experts) (b) Donors: Marko Bohanec (marko.bohanec@ijs.si) Blaz Zupan (blaz.zupan@ijs.si) (c) Date: June, 1997 3. Past…
2210 runs0 likes15 downloads15 reach0 impact
12960 instances - 9 features - 5 classes - 0 missing values
No data.
2193 runs0 likes15 downloads15 reach0 impact
1484 instances - 9 features - 10 classes - 0 missing values
No data.
1777 runs0 likes15 downloads15 reach0 impact
28056 instances - 7 features - 18 classes - 0 missing values
Once upon a time, in July 1991, the monks of Corsendonk Priory were faced with a school held in their priory, namely the 2nd European Summer School on Machine Learning. After listening more than one…
354794 runs0 likes15 downloads15 reach26 impact
556 instances - 7 features - 2 classes - 0 missing values