Data
Filter results by:
The original Annealing dataset from UCI. The exact meaning of the features and classes is largely unknown. Annealing, in metallurgy and materials science, is a heat treatment that alters the physical…
13779 runs0 likes16 downloads16 reach13 impact
898 instances - 39 features - 5 classes - 22175 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
712 runs0 likes8 downloads8 reach15 impact
898 instances - 39 features - 2 classes - 22175 missing values
One of the NASA Metrics Data Program defect data sets. The specific type of software is unknown. Data comes from McCabe and Halstead features extractors of source code. These features were defined in…
815 runs0 likes15 downloads15 reach18 impact
9466 instances - 39 features - 2 classes - 0 missing values
pie chart 3
103 runs0 likes6 downloads6 reach13 impact
1077 instances - 38 features - 2 classes - 0 missing values
Mega watt
183 runs0 likes8 downloads8 reach15 impact
253 instances - 38 features - 2 classes - 0 missing values
Pizza cutter
197 runs0 likes8 downloads8 reach14 impact
661 instances - 38 features - 2 classes - 0 missing values
Pizza cutter 3
188 runs0 likes7 downloads7 reach14 impact
1043 instances - 38 features - 2 classes - 0 missing values
Costa madre 1
90 runs0 likes6 downloads6 reach15 impact
296 instances - 38 features - 2 classes - 0 missing values
cast metal 1
111 runs0 likes9 downloads9 reach13 impact
327 instances - 38 features - 2 classes - 0 missing values
pie chart 1
102 runs0 likes5 downloads5 reach13 impact
705 instances - 38 features - 2 classes - 0 missing values
Mean While 1
0 runs0 likes3 downloads3 reach11 impact
253 instances - 38 features - 2 classes - 0 missing values
Training dataset of the 'Porto Seguros Safe Driver Prediction' Kaggle challenge [https://www.kaggle.com/c/porto-seguro-safe-driver-prediction]. The goal was to predict whether a driver will file an…
2 runs0 likes0 downloads0 reach12 impact
595212 instances - 38 features - 2 classes - 846458 missing values
The dataset contains the premier league matches for the season 2014-2015.
0 runs0 likes2 downloads2 reach8 impact
380 instances - 38 features - classes - 9 missing values
The dataset contains the serie a matches for season 2015-2016
0 runs0 likes0 downloads0 reach8 impact
379 instances - 38 features - classes - 44 missing values
A Vicon motion capture camera system was used to record 12 users performing 5 hand postures with markers attached to a left-handed glove. A rigid pattern of markers on the back of the glove was used…
0 runs0 likes0 downloads0 reach0 impact
78096 instances - 38 features - classes - 974700 missing values
%-*- text -*- %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE data set made publicly available in order to encourage repeatable, verifiable, refutable,…
765 runs0 likes10 downloads10 reach15 impact
403 instances - 38 features - 2 classes - 0 missing values
One of the NASA Metrics Data Program defect data sets. Data from flight software for earth orbiting satellite. Data comes from McCabe and Halstead features extractors of source code. These features…
146025 runs1 likes18 downloads19 reach26 impact
1563 instances - 38 features - 2 classes - 0 missing values
One of the NASA Metrics Data Program defect data sets. Data from flight software for earth orbiting satellite. Data comes from McCabe and Halstead features extractors of source code. These features…
115699 runs0 likes17 downloads17 reach27 impact
1458 instances - 38 features - 2 classes - 0 missing values
No data.
292 runs0 likes4 downloads4 reach12 impact
1000000 instances - 37 features - 6 classes - 0 missing values
pie chart 2
101 runs0 likes5 downloads5 reach13 impact
745 instances - 37 features - 2 classes - 0 missing values
No data.
29 runs0 likes1 downloads1 reach12 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
29 runs0 likes2 downloads2 reach12 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
28 runs0 likes1 downloads1 reach12 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
31 runs0 likes1 downloads1 reach12 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
28 runs0 likes1 downloads1 reach12 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
29 runs0 likes2 downloads2 reach12 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
28 runs0 likes1 downloads1 reach12 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
29 runs0 likes1 downloads1 reach12 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
29 runs0 likes1 downloads1 reach12 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
0 runs0 likes2 downloads2 reach12 impact
1000000 instances - 37 features - 0 classes - 0 missing values
No data.
324 runs0 likes5 downloads5 reach12 impact
1000000 instances - 37 features - 2 classes - 0 missing values
Source: Ashwin Srinivasan Department of Statistics and Data Modeling University of Strathclyde Glasgow Scotland UK ross '@' uk.ac.turing The original Landsat data for this database was generated from…
1 runs1 likes8 downloads9 reach19 impact
6435 instances - 37 features - 0 classes - 0 missing values
The satellite dataset comprises of features extracted from satellite observations. In particular, each image was taken under four different light wavelength, two in visible light (green and red) and…
2074 runs3 likes70 downloads73 reach33 impact
5100 instances - 37 features - 2 classes - 0 missing values
dataset for feature extraction
0 runs0 likes0 downloads0 reach8 impact
69 instances - 37 features - classes - 0 missing values
The goal of the research is to help the auditors by building a classification model that can predict the fraudulent firm on the basis the present and historical risk factors. The information about the…
0 runs0 likes0 downloads0 reach0 impact
1552 instances - 37 features - 0 classes - 19402 missing values
Author: Alen Shapiro Source: [UCI](https://archive.ics.uci.edu/ml/datasets/Chess+(King-Rook+vs.+King-Pawn)) Please cite: [UCI citation policy](https://archive.ics.uci.edu/ml/citation_policy.html) 1.…
273623 runs1 likes43 downloads44 reach16 impact
3196 instances - 37 features - 2 classes - 0 missing values
The database consists of the multi-spectral values of pixels in 3x3 neighbourhoods in a satellite image, and the classification associated with the central pixel in each neighbourhood. The aim is to…
29713 runs2 likes24 downloads26 reach12 impact
6430 instances - 37 features - 6 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
102 runs0 likes3 downloads3 reach15 impact
527 instances - 37 features - 2 classes - 542 missing values
One of the NASA Metrics Data Program defect data sets. Data from flight software for earth orbiting satellite. Data comes from McCabe and Halstead features extractors of source code. These features…
875 runs0 likes13 downloads13 reach17 impact
5589 instances - 37 features - 2 classes - 0 missing values
2126 fetal cardiotocograms (CTGs) were automatically processed and the respective diagnostic features measured. The CTGs were also classified by three expert obstetricians and a consensus…
24283 runs5 likes29 downloads34 reach56 impact
2126 instances - 36 features - 10 classes - 0 missing values
A 3-class version of Cardiotocography dataset.
134 runs0 likes14 downloads14 reach14 impact
2126 instances - 36 features - 3 classes - 0 missing values
No data.
314 runs1 likes8 downloads9 reach12 impact
1000000 instances - 36 features - 19 classes - 0 missing values
student performance analysis 1
0 runs0 likes1 downloads1 reach6 impact
3892 instances - 36 features - classes - 0 missing values
mydata
0 runs0 likes0 downloads0 reach6 impact
3892 instances - 36 features - classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Andromeda dataset (Hatzikos et al. 2008) concerns the prediction of future values for six…
0 runs0 likes0 downloads0 reach9 impact
49 instances - 36 features - classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Andromeda dataset (Hatzikos et al. 2008) concerns the prediction of future values for six…
0 runs0 likes0 downloads0 reach9 impact
49 instances - 36 features - classes - 0 missing values
The dataset was collected at 'Hospital Universitario de Caracas' in Caracas, Venezuela. The dataset comprises demographic information, habits, and historic medical records of 858 patients. Several…
0 runs0 likes0 downloads0 reach0 impact
858 instances - 36 features - classes - 3622 missing values
The dataset contains 19 attributes regarding ca cervix behavior risk with class label is ca_cervix with 1 and 0 as values which means the respondent with and without ca cervix, respectively. ###…
0 runs0 likes0 downloads0 reach0 impact
858 instances - 36 features - classes - 3622 missing values
This is an experimental data set for trying to classify numbers in a lottery as "Highly likely to be picked" or "Not very likely to be picked". It is based on a little more than a…
0 runs0 likes0 downloads0 reach0 impact
12528 instances - 36 features - classes - 0 missing values
This is the large soybean database from the UCI repository, with its training and test database combined into a single file. There are 19 classes, only the first 15 of which have been used in prior…
40719 runs1 likes54 downloads55 reach13 impact
683 instances - 36 features - 19 classes - 2337 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
722 runs0 likes6 downloads6 reach15 impact
683 instances - 36 features - 2 classes - 2337 missing values
Data on tree growth used in the Case Study published in the September, 1995 issue of the Canadian Journal of Statistics. This data set was been provided by Dr. Fernando Camacho, Ontario Hydro…
18457 runs1 likes15 downloads16 reach39 impact
2796 instances - 35 features - 6 classes - 68100 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
0 runs0 likes0 downloads0 reach11 impact
2796 instances - 35 features - 2 classes - 68100 missing values
No data.
66 runs0 likes3 downloads3 reach13 impact
1000000 instances - 35 features - 6 classes - 0 missing values
No data.
230 runs0 likes4 downloads4 reach12 impact
1000000 instances - 35 features - 2 classes - 0 missing values
No data.
309 runs0 likes6 downloads6 reach12 impact
1000000 instances - 35 features - 6 classes - 0 missing values
--------------------------------------------------------------------------- Short description --------------------------------------------------------------------------- Data on tree growth used in…
0 runs0 likes2 downloads2 reach11 impact
2796 instances - 35 features - 6 classes - 68100 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
2465 instances - 35 features - classes - 0 missing values
Automated file upload of BNG(ionosphere)
99 runs1 likes4 downloads5 reach13 impact
1000000 instances - 35 features - 2 classes - 0 missing values
1. Title: Dermatology Database 2. Source Information: (a) Original owners: -- 1. Nilsel Ilter, M.D., Ph.D., Gazi University, School of Medicine 06510 Ankara, Turkey Phone: +90 (312) 214 1080 -- 2. H.…
1756 runs0 likes14 downloads14 reach12 impact
366 instances - 35 features - 6 classes - 8 missing values
This radar data was collected by a system in Goose Bay, Labrador. This system consists of a phased array of 16 high-frequency antennas with a total transmitted power on the order of 6.4 kilowatts. See…
2484 runs3 likes27 downloads30 reach12 impact
351 instances - 35 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
723 runs0 likes6 downloads6 reach15 impact
366 instances - 35 features - 2 classes - 8 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes1 downloads1 reach13 impact
16 instances - 34 features - 0 classes - 0 missing values
Asteroid Dataset
0 runs0 likes1 downloads1 reach10 impact
126131 instances - 34 features - 2 classes - 99 missing values
Asteroid Dataset
0 runs0 likes1 downloads1 reach11 impact
126131 instances - 34 features - 2 classes - 99 missing values
Coal mining requires working in hazardous conditions. Miners in an underground coal mine can face several threats, such as, e.g. methane explosions or rock-burst. To provide protection for people…
0 runs0 likes0 downloads0 reach2 impact
9199930 instances - 34 features - classes - 0 missing values
A dataset of steel plates' faults, classified into 7 different types. The goal was to train machine learning for automatic pattern recognition. The dataset consists of 27 features describing each…
277313 runs1 likes49 downloads50 reach25 impact
1941 instances - 34 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
104 runs0 likes7 downloads7 reach15 impact
1302 instances - 34 features - 2 classes - 7830 missing values
No data.
0 runs0 likes1 downloads1 reach9 impact
1000000 instances - 33 features - 0 classes - 0 missing values
Abstract: This data set contains a total 5820 evaluation scores provided by students from Gazi University in Ankara (Turkey). There is a total of 28 course specific questions and additional 5…
0 runs0 likes2 downloads2 reach16 impact
5820 instances - 33 features - classes - 0 missing values
No data.
0 runs0 likes1 downloads1 reach9 impact
1000000 instances - 33 features - 0 classes - 0 missing values
1. Title: Wisconsin Prognostic Breast Cancer (WPBC) 2. Source Information a) Creators: Dr. William H. Wolberg, General Surgery Dept., University of Wisconsin, Clinical Sciences Center, Madison, WI…
5 runs0 likes4 downloads4 reach9 impact
194 instances - 33 features - 0 classes - 0 missing values
No data.
334 runs0 likes4 downloads4 reach12 impact
1000000 instances - 33 features - 2 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes2 downloads2 reach13 impact
195 instances - 33 features - 0 classes - 0 missing values
A family of datasets synthetically generated from a simulation of how bank-customers choose their banks. Tasks are based on predicting the fraction of bank customers who leave the bank because of full…
0 runs0 likes2 downloads2 reach14 impact
8192 instances - 33 features - 0 classes - 0 missing values
This is one of a family of datasets synthetically generated from a realistic simulation of the dynamics of a Unimation Puma 560 robot arm. There are eight datastets in this family . In this repository…
0 runs0 likes6 downloads6 reach14 impact
8192 instances - 33 features - 0 classes - 0 missing values
calendarDOW-pmlb
31 runs0 likes1 downloads1 reach21 impact
399 instances - 33 features - 5 classes - 0 missing values
One of the biggest challenges of an auto dealership purchasing a used car at an auto auction is the risk of that the vehicle might have serious issues that prevent it from being sold to customers. The…
3 runs0 likes3 downloads3 reach13 impact
72983 instances - 33 features - 2 classes - 149271 missing values
Klaverjas is an example of the Jack-Nine card games, which are characterized as trick-taking games where the the Jack and nine of the trump suit are the highest-ranking trumps, and the tens and aces…
0 runs0 likes1 downloads1 reach10 impact
981541 instances - 33 features - 2 classes - 0 missing values
This data approach student achievement in secondary education of two Portuguese schools. The data attributes include student grades, demographic, social and school related features) and it was…
0 runs0 likes0 downloads0 reach8 impact
649 instances - 33 features - 0 classes - 0 missing values
This data approach student achievement in secondary education of two Portuguese schools. The data attributes include student grades, demographic, social and school related features) and it was…
0 runs0 likes1 downloads1 reach8 impact
395 instances - 33 features - 0 classes - 0 missing values
Creators: Renata Cristina Barros Madeo (Madeo, R. C. B.) Priscilla Koch Wagner (Wagner, P. K.) Sarajane Marques Peres (Peres, S. M.) {renata.si, priscilla.wagner, sarajane} at usp.br…
26327 runs1 likes16 downloads17 reach38 impact
9873 instances - 33 features - 5 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
762 runs0 likes13 downloads13 reach15 impact
8192 instances - 33 features - 2 classes - 0 missing values
1. Title: INDUCE Trains Data set 2. Sources: - Donor: GMU, Center for AI, Software Librarian, Eric E. Bloedorn (bloedorn@aic.gmu.edu) - Original owners: Ryszard S. Michalski (michalski@aic.gmu.edu)…
1973 runs0 likes9 downloads9 reach15 impact
10 instances - 33 features - 2 classes - 51 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
490 runs0 likes4 downloads4 reach13 impact
364 instances - 33 features - 6 classes - 101 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
772 runs0 likes8 downloads8 reach15 impact
194 instances - 33 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
744 runs0 likes12 downloads12 reach15 impact
8192 instances - 33 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
736 runs0 likes6 downloads6 reach15 impact
364 instances - 33 features - 2 classes - 80 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
194 instances - 32 features - classes - 0 missing values
#sbox
0 runs0 likes0 downloads0 reach7 impact
10000 instances - 32 features - classes - 0 missing values
Data file: This data from "Problem-Solving" on "backache in pregnancy" is in somewhat different format from that listed in the book. Each integer is preceded by a space. This makes it easier to read.…
174 runs0 likes6 downloads6 reach15 impact
180 instances - 32 features - 2 classes - 0 missing values
White Clover Persistence Trials Data source: Ian Tarbotton AgResearch, Whatawhata Research Centre, Hamilton, New Zealand The objective was to determine the mechanisms which influence the persistence…
858 runs0 likes5 downloads5 reach15 impact
63 instances - 32 features - 4 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
732 runs0 likes5 downloads5 reach14 impact
63 instances - 32 features - 2 classes - 0 missing values
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage repeatable,…
0 runs0 likes0 downloads0 reach13 impact
31 instances - 31 features - 0 classes - 33 missing values
Om algos te testen
74 runs0 likes6 downloads6 reach15 impact
14240 instances - 31 features - 2 classes - 0 missing values
Phishing website 1
0 runs0 likes0 downloads0 reach2 impact
11055 instances - 31 features - 0 classes - 0 missing values
Testing dataset
0 runs0 likes1 downloads1 reach3 impact
134731 instances - 31 features - 2 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
2465 instances - 31 features - classes - 0 missing values
The YouTube personality dataset consists of a collection of behavorial features, speech transcriptions, and personality impression scores for a set of 404 YouTube vloggers that explicitly show…
0 runs0 likes1 downloads1 reach9 impact
404 instances - 31 features - classes - 0 missing values
Anonymized data of dating profiles from OkCupid
0 runs0 likes3 downloads3 reach8 impact
59946 instances - 31 features - 0 classes - 273249 missing values