Data
Filter results by:
Dataset from the MLRR repository: http://axon.cs.byu.edu:5000/
180 runs0 likes5 downloads5 reach21 impact
294 instances - 12 features - 2 classes - 0 missing values
No data.
0 runs0 likes1 downloads1 reach1 impact
1000000 instances - 14 features - 0 classes - 0 missing values
No data.
0 runs0 likes1 downloads1 reach1 impact
17496 instances - 10 features - 0 classes - 0 missing values
"The sulfur recovery unit (SRU) removes environmental pollutants from acid gas streams before they are released into the atmosphere. Furthermore, elemental sulfur is recovered as a valuable…
0 runs0 likes1 downloads1 reach3 impact
10081 instances - 7 features - 0 classes - 0 missing values
This data set is also obtained from the task of controlling the ailerons of a F16 aircraft, although the target variable and attributes are different from the ailerons domain. The target variable here…
2 runs0 likes3 downloads3 reach1 impact
9517 instances - 7 features - 0 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Survival treated as the class attribute As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using…
12 runs0 likes2 downloads2 reach1 impact
130 instances - 10 features - 0 classes - 97 missing values
Publication Request: >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> This file describes the contents of the heart-disease directory. This directory contains 4 databases…
10 runs0 likes0 downloads0 reach1 impact
294 instances - 14 features - 0 classes - 782 missing values
No data.
163 runs0 likes5 downloads5 reach1 impact
1000000 instances - 28 features - 2 classes - 0 missing values
No data.
68 runs0 likes4 downloads4 reach1 impact
1000000 instances - 23 features - 2 classes - 0 missing values
No data.
225 runs0 likes7 downloads7 reach2 impact
1000000 instances - 21 features - 2 classes - 0 missing values
Automated file upload of BNG(credit-g)
99 runs0 likes3 downloads3 reach2 impact
1000000 instances - 21 features - 2 classes - 0 missing values
ecoli-pmlb
31 runs0 likes1 downloads1 reach11 impact
327 instances - 8 features - 5 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach1 impact
1000000 instances - 14 features - 0 classes - 0 missing values
exercises
0 runs0 likes0 downloads0 reach1 impact
15000 instances - 8 features - classes - 0 missing values
exercises
0 runs0 likes0 downloads0 reach1 impact
15000 instances - 8 features - classes - 0 missing values
Test
0 runs0 likes0 downloads0 reach0 impact
6330 instances - 8 features - classes - 0 missing values
* Title: seeds Data Set * Abstract: Measurements of geometrical properties of kernels belonging to three different varieties of wheat. A soft X-ray technique and GRAINS package were used to construct…
190 runs0 likes5 downloads5 reach6 impact
210 instances - 8 features - 3 classes - 0 missing values
* Title: seismic-bumps Data Set * Abstract: The data describe the problem of high energy (higher than 10^4 J) seismic bumps forecasting in a coal mine. Data come from two of longwalls located in a…
152 runs0 likes37 downloads37 reach6 impact
210 instances - 8 features - 3 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach9 impact
209 instances - 8 features - classes - 0 missing values
Datasets of Data And Story Library, project illustrating use of basic statistic methods, converted to arff format by Hakan Kjellerstrand. Source: TunedIT: http://tunedit.org/repo/DASL DASL file…
0 runs0 likes2 downloads2 reach6 impact
48 instances - 8 features - 0 classes - 0 missing values
Daily electric energy dataset The dee problem involves predicting the daily average price of TkWhe electricity energy in Spain. The data set contains real values from 2003 about the daily consumption…
0 runs0 likes0 downloads0 reach1 impact
365 instances - 7 features - 0 classes - 0 missing values
Prediction of residuary resistance of sailing yachts at the initial design stage is of a great value for evaluating the ship’s performance and for estimating the required propulsive…
0 runs0 likes0 downloads0 reach1 impact
308 instances - 7 features - 0 classes - 0 missing values
Contains 110 data sets from the book 'The Statistical Sleuth' by Fred Ramsey and Dan Schafer; Duxbury Press, 1997. (schafer@stat.orst.edu) [14/Oct/97] (172k) Note: description taken from this web…
8 runs0 likes2 downloads2 reach6 impact
50 instances - 8 features - 0 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Cholesterol treated as the class attribute. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using…
160 runs0 likes3 downloads3 reach2 impact
303 instances - 14 features - 0 classes - 6 missing values
Donor: David W. Aha (aha@ics.uci.edu) This database contains 76 attributes, but all published experiments refer to using a subset of 14 of them. In particular, the Cleveland database is the only one…
37 runs0 likes5 downloads5 reach2 impact
303 instances - 14 features - 0 classes - 6 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) The infamous Longley data, "An appraisal of least-squares programs from the point of view of the user", JASA, 62(1967) p819-841. Variables are: Number of…
3 runs0 likes1 downloads1 reach2 impact
16 instances - 7 features - 0 classes - 0 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes0 downloads0 reach6 impact
400 instances - 7 features - 0 classes - 0 missing values
This file contains data from Regression Analysis By Example, 2nd Edition, by Samprit Chatterjee and Bertram Price, John Wiley, 1991. Data sets have names of the form 'rabe.xxx' where xxx is the page…
0 runs0 likes0 downloads0 reach6 impact
51 instances - 7 features - 0 classes - 0 missing values
One of the data sets used in the book "Analyzing Categorical Data" by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on statistical…
0 runs0 likes0 downloads0 reach6 impact
30 instances - 8 features - 0 classes - 6 missing values
The problem concerns Relative CPU Performance Data. More information can be obtained in the UCI Machine Learning repository (http://www.ics.uci.edu/~mlearn/MLSummary.html). The used attributes are :…
2 runs0 likes2 downloads2 reach4 impact
209 instances - 7 features - 0 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Attributes 2 and 8 deleted. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based…
2 runs0 likes2 downloads2 reach11 impact
209 instances - 8 features - 0 classes - 0 missing values
This classic dataset contains the prices and other attributes of almost 54,000 diamonds. It's a great dataset for beginners learning to work with data analysis and visualization. Content price price…
0 runs0 likes1 downloads1 reach0 impact
53940 instances - 10 features - 0 classes - 0 missing values
This database has been artificially generated. It describes the structure of the capital letters A, C, D, E, F, G, H, L, P, R, indicated by a number 1-10, in that order (A=1,C=2,...). Each letter's…
24309 runs0 likes10 downloads10 reach50 impact
10218 instances - 8 features - 10 classes - 0 missing values
This simple domain contains 7 Boolean attributes and 10 classes, the set of decimal digits. Recall that LED displays contain 7 light-emitting diodes -- hence the reason for 7 attributes. The class…
13006 runs0 likes9 downloads9 reach11 impact
500 instances - 8 features - 10 classes - 0 missing values
* Title: Wholesale customers Data Set * Abstract: The data set refers to clients of a wholesale distributor. It includes the annual spending in monetary units (m.u.) on diverse product categories *…
161 runs0 likes10 downloads10 reach7 impact
440 instances - 9 features - 2 classes - 0 missing values
Experiment data obtained by running random configurations of glmnet through mlr on 114 different classification tasks from openml.
0 runs0 likes0 downloads0 reach0 impact
104820 instances - 10 features - classes - 0 missing values
Geographical Analysis Spatial Data This georeferenced data set was used in: Pace, R. Kelley, and Ronald Barry, Quick Computation of Regressions with a Spatially Autoregressive Dependent Variable,…
4 runs1 likes1 downloads2 reach7 impact
3107 instances - 7 features - 0 classes - 0 missing values
The data consist of annual observations on the level of strike volume (days lost due to industrial disputes per 1000 wage salary earners), and their covariates in 18 OECD countries from 1951-1985. The…
0 runs0 likes2 downloads2 reach7 impact
625 instances - 7 features - 0 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 21 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 21 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 21 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 21 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 21 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 21 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 21 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
16598 instances - 11 features - classes - 329 missing values
Sensor data measurements of one Boiler, containing WaterInput/SteamOutput (flow, temperature, pressure) for one month, which is measured every minute.
0 runs0 likes1 downloads1 reach3 impact
44643 instances - 8 features - classes - 44643 missing values
test
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 21 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
336 instances - 8 features - classes - 0 missing values
test3
0 runs0 likes0 downloads0 reach0 impact
2 instances - 8 features - classes - 0 missing values
This data is used to test water contamination
0 runs0 likes0 downloads0 reach0 impact
26 instances - 8 features - classes - 0 missing values
No data.
697 runs0 likes7 downloads7 reach13 impact
320 instances - 9 features - 2 classes - 0 missing values
Original data from https://github.com/propublica/compas-analysis/ by ProPublica. The data was subsequently preprocessed and reduced to relevant features for classification. The target variable is…
0 runs0 likes1 downloads1 reach8 impact
5278 instances - 14 features - 2 classes - 0 missing values
The data is related with direct marketing campaigns of a Portuguese banking institution. The marketing campaigns were based on phone calls. Often, more than one contact to the same client was…
65398 runs2 likes35 downloads37 reach27 impact
45211 instances - 17 features - 2 classes - 0 missing values
* Dataset: Reduced version (10 % of the examples) of bank-marketing dataset.
1254 runs1 likes17 downloads18 reach13 impact
4521 instances - 17 features - 2 classes - 0 missing values
* Abstract: A 3-class version of abalone dataset. * Sources: (a) Original owners of database: Marine Resources Division Marine Research Laboratories - Taroona Department of Primary Industry and…
176 runs0 likes4 downloads4 reach12 impact
4177 instances - 9 features - 3 classes - 0 missing values
nominal features and target for COMPAS
0 runs0 likes1 downloads1 reach7 impact
5278 instances - 14 features - 2 classes - 0 missing values
Attribute information: ``` sick, negative. | classes age: continuous. sex: M, F. on thyroxine: f, t. query on thyroxine: f, t. on antithyroid medication: f, t. sick: f, t. pregnant: f, t. thyroid…
19940 runs0 likes31 downloads31 reach7 impact
3772 instances - 30 features - 2 classes - 6064 missing values
1. Title: Protein Localization Sites 2. Creator and Maintainer: Kenta Nakai Institue of Molecular and Cellular Biology Osaka, University 1-3 Yamada-oka, Suita 565 Japan nakai@imcb.osaka-u.ac.jp…
1806 runs0 likes13 downloads13 reach9 impact
336 instances - 8 features - 8 classes - 0 missing values
The dataset (originally named ELEC2) contains 45,312 instances dated from 7 May 1996 to 5 December 1998. Each example of the dataset refers to a period of 30 minutes, i.e. there are 48 instances for…
106854 runs3 likes38 downloads41 reach9 impact
45312 instances - 9 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
747 runs0 likes13 downloads13 reach13 impact
4177 instances - 9 features - 2 classes - 0 missing values
February 23, 1982 The 1982 annual meetings of the American Statistical Association (ASA) will be held August 16-19, 1982 in Cincinnati. At that meeting, the ASA Committee on Statistical Graphics plans…
759 runs0 likes9 downloads9 reach21 impact
209 instances - 9 features - 2 classes - 15 missing values
SUMMARY: Data from an experiment on the affects of machine adjustments on the time to count bolts. Data appear as the STATS (Issue 10) Challenge. DATA: Submitted by W. Robert Stephenson, Iowa State…
754 runs0 likes9 downloads9 reach12 impact
40 instances - 8 features - 2 classes - 0 missing values
; ; Thyroid disease records supplied by the Garavan Institute and J. Ross ; Quinlan, New South Wales Institute, Syndney, Australia. ; ; 1987. ; hypothyroid, primary hypothyroid, compensated…
883 runs0 likes11 downloads11 reach7 impact
3772 instances - 30 features - 4 classes - 6064 missing values
1. Title of Database: Abalone data 2. Sources: (a) Original owners of database: Marine Resources Division Marine Research Laboratories - Taroona Department of Primary Industry and Fisheries, Tasmania…
34899 runs0 likes18 downloads18 reach7 impact
4177 instances - 9 features - 28 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
698 runs0 likes6 downloads6 reach12 impact
97 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
739 runs0 likes11 downloads11 reach13 impact
4052 instances - 8 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
755 runs0 likes4 downloads4 reach12 impact
54 instances - 8 features - 2 classes - 120 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
554 runs0 likes10 downloads10 reach13 impact
40768 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
819 runs0 likes10 downloads10 reach13 impact
500 instances - 8 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
791 runs0 likes7 downloads7 reach13 impact
400 instances - 8 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
764 runs0 likes6 downloads6 reach13 impact
400 instances - 8 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
788 runs0 likes7 downloads7 reach13 impact
400 instances - 8 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
779 runs0 likes7 downloads7 reach13 impact
400 instances - 8 features - 2 classes - 0 missing values
Contains 110 data sets from the book 'The Statistical Sleuth' by Fred Ramsey and Dan Schafer; Duxbury Press, 1997. (schafer@stat.orst.edu) [14/Oct/97] (172k) Note: description taken from this web…
722 runs0 likes6 downloads6 reach12 impact
60 instances - 8 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
758 runs0 likes8 downloads8 reach13 impact
500 instances - 8 features - 2 classes - 0 missing values
Donor: Will Taylor (taylor@pluto.arc.nasa.gov) Database of surgeries on horses. Possible class attributes: 24 (whether lesion is surgical), others include: 23, 25, 26, and 27 Notes: * Hospital_Number…
236 runs0 likes9 downloads9 reach7 impact
368 instances - 27 features - 2 classes - 1927 missing values