OpenML
Filter results by:
sd vfv
0 runs0 likes0 downloads0 reach0 impact
4 instances - 50 features - 2 classes - 0 missing values
A dataset of steel plates' faults, classified into 7 different types. The goal was to train machine learning for automatic pattern recognition. The dataset consists of 27 features describing each…
277313 runs1 likes42 downloads43 reach19 impact
1941 instances - 34 features - 2 classes - 0 missing values
* Dataset: DBworld e-mails data set Task: dbworld-subjects-stemmed * Source: Michele Filannino, PhD University of Manchester Centre for Doctoral Training Email: filannim_AT_cs.man.ac.uk * Data Set…
71 runs0 likes2 downloads2 reach7 impact
64 instances - 230 features - 2 classes - 0 missing values
This data is derived from the 2012 KDD Cup. The data is subsampled to 0.1% of the original number of instances, downsampling the majority class (click=0) so that the target feature is reasonably…
63420 runs0 likes15 downloads15 reach19 impact
39948 instances - 12 features - 2 classes - 0 missing values
Source: David Gil, dgil '@' dtic.ua.es, Lucentia Research Group, Department of Computer Technology, University of Alicante Jose Luis Girela, girela '@' ua.es, Department of Biotechnology, University…
451 runs0 likes7 downloads7 reach7 impact
100 instances - 10 features - 2 classes - 0 missing values
The weather problem is a tiny dataset that we will use repeatedly to illustrate machine learning methods. Entirely fictitious, it supposedly concerns the conditions that are suitable for playing some…
0 runs0 likes2 downloads2 reach3 impact
14 instances - 5 features - 2 classes - 0 missing values
* Title: Thoracic Surgery Data Data Set * Abstract: The data is dedicated to classification problem related to the post-operative life expectancy in the lung cancer patients: class 1 - death within…
145 runs0 likes8 downloads8 reach8 impact
470 instances - 17 features - 2 classes - 0 missing values
The data is cleaned, regularized and encrypted global equity data. The first 21 columns (feature1 - feature21) are features, and target is the binary class you’re trying to predict.
1083 runs1 likes3 downloads4 reach9 impact
96320 instances - 22 features - 2 classes - 0 missing values
Source: http://www.ijcaonline.org/archives/volume47/number18/7291-0509 Data Set Information: In this paper, we look for to recognize the causes of users tend to cyber space in Kohkiloye and Boyer…
373 runs0 likes7 downloads7 reach7 impact
100 instances - 6 features - 2 classes - 0 missing values
PMLB version of the Titanic dataset, which only uses 3 features. See version 1 for the complete version: https://www.openml.org/d/40945
35 runs0 likes1 downloads1 reach16 impact
2201 instances - 4 features - 2 classes - 0 missing values
A dataset relating characteristics of telephony account features and usage and whether or not the customer churned. Originally used in [Discovering Knowledge in Data: An Introduction to Data…
7482 runs2 likes7 downloads9 reach18 impact
5000 instances - 21 features - 2 classes - 0 missing values
Forecasting skewed biased stochastic ozone days: analyses, solutions and beyond, Knowledge and Information Systems, Vol. 14, No. 3, 2008. 1 . Abstract: Two ground ozone level data sets are included in…
187955 runs0 likes16 downloads16 reach21 impact
2534 instances - 73 features - 2 classes - 0 missing values
QSAR biodegradation Data Set * Abstract: Data set containing values for 41 attributes (molecular descriptors) used to classify 1055 chemicals into 2 classes (ready and not ready biodegradable). *…
267507 runs1 likes20 downloads21 reach20 impact
1055 instances - 42 features - 2 classes - 0 missing values
__Major changes w.r.t. version 1: deactivated first two variables as they describe the batch of the experiments and should not be used for prediction. Also transformed the target from numeric to…
8775 runs0 likes3 downloads3 reach7 impact
540 instances - 21 features - 2 classes - 0 missing values
__Changes w.r.t. version 1: renamed variables such that they match description.__ ### Dataset: Wilt Data Set ### Abstract: High-resolution Remote Sensing data set (Quickbird). Small number of training…
10946 runs0 likes1 downloads1 reach14 impact
4839 instances - 6 features - 2 classes - 0 missing values
The aim of this dataset is to distinguish between nasal (class 0) and oral sounds (class 1). Five different attributes were chosen to characterize each vowel: they are the amplitudes of the five first…
218302 runs5 likes34 downloads39 reach23 impact
5404 instances - 6 features - 2 classes - 0 missing values
Current dataset was adapted to ARFF format from the UCI version. Sample code ID's were removed. ! Note that there is also a related Breast Cancer Wisconsin (Original) Data Set with a different set of…
226562 runs3 likes35 downloads38 reach21 impact
569 instances - 31 features - 2 classes - 0 missing values
Author: Volker Lohweg (University of Applied Sciences, Ostwestfalen-Lippe) Source: [UCI](https://archive.ics.uci.edu/ml/datasets/banknote+authentication) - 2012 Please cite:…
137675 runs5 likes29 downloads34 reach24 impact
1372 instances - 5 features - 2 classes - 0 missing values
### Description __Changes to version 1:__ all categorical features transformed as such. This dataset represents a set of possible advertisements on Internet pages. ### Sources (a) Creator and donor:…
1430 runs0 likes3 downloads3 reach15 impact
3279 instances - 1559 features - 2 classes - 0 missing values
####1. Summary This dataset contain attributes of dresses and their recommendations according to their sales. Sales are monitor on the basis of alternate days. The attributes present analyzed are:…
19054 runs1 likes6 downloads7 reach12 impact
500 instances - 13 features - 2 classes - 835 missing values
This data set contains 416 liver patient records and 167 non liver patient records.The data set was collected from north east of Andhra Pradesh, India. The class label divides the patients into 2…
154859 runs1 likes21 downloads22 reach20 impact
583 instances - 11 features - 2 classes - 0 missing values
#### Abstract: MADELON is an artificial dataset, which was part of the NIPS 2003 feature selection challenge. This is a two-class classification problem with continuous input variables. The difficulty…
100887 runs0 likes17 downloads17 reach20 impact
2600 instances - 501 features - 2 classes - 0 missing values
### Description The data consists of real historical data collected from 2010 & 2011. Employees are manually allowed or denied access to resources over time. The data is used to create an algorithm…
35323 runs0 likes18 downloads18 reach21 impact
32769 instances - 10 features - 2 classes - 0 missing values
### Description Cylinder bands UCI dataset - Process delays known as cylinder banding in rotogravure printing were substantially mitigated using control rules discovered by decision tree induction.…
21477 runs0 likes8 downloads8 reach20 impact
540 instances - 40 features - 2 classes - 999 missing values
The data is related with direct marketing campaigns of a Portuguese banking institution. The marketing campaigns were based on phone calls. Often, more than one contact to the same client was…
65398 runs2 likes33 downloads35 reach23 impact
45211 instances - 17 features - 2 classes - 0 missing values
Context "Predict behavior to retain customers. You can analyze all relevant customer data and develop focused customer retention programs." [IBM Sample Data Sets] Content Each row represents a…
0 runs1 likes1 downloads2 reach2 impact
7043 instances - 20 features - 2 classes - 0 missing values
xxx
0 runs0 likes0 downloads0 reach0 impact
891 instances - 8 features - 2 classes - 689 missing values
"The speech dataset was also provided by (see citation request) and contains real world data from recorded English language. The normal class contains data from persons having an American accent…
1599 runs0 likes6 downloads6 reach15 impact
3686 instances - 401 features - 2 classes - 0 missing values
This data was gathered from participants in experimental speed dating events from 2002-2004. During the events, the attendees would have a four-minute "first date" with every other participant of the…
28060 runs19 likes157 downloads176 reach31 impact
8378 instances - 123 features - 2 classes - 18372 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
11 runs1 likes1 downloads2 reach12 impact
20000 instances - 4297 features - 2 classes - 0 missing values
Predict a biological response of molecules from their chemical properties. Each row in this data set represents a molecule. The first column contains experimental data describing an actual biological…
48338 runs2 likes37 downloads39 reach27 impact
3751 instances - 1777 features - 2 classes - 0 missing values
DESCRIPTIVE ABSTRACT: The data set contains the oral, written and combined test scores for 2003 New Haven Fire Department promotion exams. The Race and Position for each test taker are also given.…
0 runs0 likes0 downloads0 reach0 impact
118 instances - 6 features - 2 classes - 0 missing values
No data.
867 runs0 likes11 downloads11 reach7 impact
39366 instances - 10 features - 2 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
694 runs0 likes7 downloads7 reach12 impact
83 instances - 4 features - 2 classes - 0 missing values
Data on educational transitions for a sample of 500 Irish schoolchildren aged 11 in 1967. The data were collected by Greaney and Kelleghan (1984), and reanalyzed by Raftery and Hout (1985, 1993). ###…
16022 runs0 likes16 downloads16 reach23 impact
500 instances - 6 features - 2 classes - 32 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
886 runs0 likes8 downloads8 reach13 impact
264 instances - 5 features - 2 classes - 0 missing values
NAME: Sonar, Mines vs. Rocks SUMMARY: This is the data set used by Gorman and Sejnowski in their study of the classification of sonar signals using a neural network [1]. The task is to train a network…
2366 runs1 likes25 downloads26 reach7 impact
208 instances - 61 features - 2 classes - 0 missing values
Attribute information: ``` sick, negative. | classes age: continuous. sex: M, F. on thyroxine: f, t. query on thyroxine: f, t. on antithyroid medication: f, t. sick: f, t. pregnant: f, t. thyroid…
19940 runs0 likes31 downloads31 reach7 impact
3772 instances - 30 features - 2 classes - 6064 missing values
### Description Scene recognition dataset - It contains characteristics about images and their classes. The original dataset is a multi-label classification problem with 6 different labels: {Beach,…
89930 runs0 likes23 downloads23 reach23 impact
2407 instances - 300 features - 2 classes - 0 missing values
Oil dataset Past Usage: 1. Kubat, M., Holte, R.,
204 runs3 likes19 downloads22 reach22 impact
937 instances - 50 features - 2 classes - 0 missing values
Once upon a time, in July 1991, the monks of Corsendonk Priory were faced with a school held in their priory, namely the 2nd European Summer School on Machine Learning. After listening more than one…
108666 runs1 likes14 downloads15 reach32 impact
554 instances - 7 features - 2 classes - 0 missing values
Yeast dataset Past Usage: André Elisseeff and Jason Weston. A kernel method for multi-labelled classification. In Thomas G. Dietterich, Susan Becker, and Zoubin Ghahramani, editors, Advances in…
139 runs0 likes8 downloads8 reach12 impact
2417 instances - 117 features - 2 classes - 0 missing values
Mammography dataset Past Usage: 1. Woods, K., Doss, C., Bowyer, K., Solka, J., Priebe, C.,
218 runs4 likes47 downloads51 reach22 impact
11183 instances - 7 features - 2 classes - 0 missing values
Once upon a time, in July 1991, the monks of Corsendonk Priory were faced with a school held in their priory, namely the 2nd European Summer School on Machine Learning. After listening more than one…
358449 runs2 likes18 downloads20 reach37 impact
556 instances - 7 features - 2 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
1034 runs0 likes10 downloads10 reach12 impact
100 instances - 7 features - 2 classes - 0 missing values
Schizophrenic Eye-Tracking Data in Rubin and Wu (1997) Biometrics. Yingnian Wu (wu@hustat.harvard.edu) [14/Oct/97] Information about the dataset CLASSTYPE: nominal CLASSINDEX: last
748 runs0 likes7 downloads7 reach21 impact
340 instances - 15 features - 2 classes - 834 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
104 runs0 likes4 downloads4 reach12 impact
52 instances - 10 features - 2 classes - 0 missing values
Data file: This data from "Problem-Solving" on "backache in pregnancy" is in somewhat different format from that listed in the book. Each integer is preceded by a space. This makes it easier to read.…
174 runs0 likes6 downloads6 reach13 impact
180 instances - 33 features - 2 classes - 0 missing values
Dataset from `Pattern Recognition and Neural Networks' by B.D. Ripley. Cambridge University Press (1996) ISBN 0-521-46086-7. The background to the datasets is described in section 1.4; this file…
1105 runs0 likes7 downloads7 reach13 impact
250 instances - 3 features - 2 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
698 runs0 likes6 downloads6 reach12 impact
97 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
965 runs0 likes9 downloads9 reach12 impact
137 instances - 8 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
747 runs0 likes12 downloads12 reach13 impact
4177 instances - 9 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
747 runs0 likes6 downloads6 reach13 impact
200 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
766 runs0 likes7 downloads7 reach12 impact
100 instances - 51 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
753 runs0 likes8 downloads8 reach13 impact
508 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1043 runs0 likes10 downloads10 reach12 impact
125 instances - 5 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
608 runs0 likes9 downloads9 reach13 impact
1000 instances - 26 features - 2 classes - 0 missing values
Current dataset was adapted to ARFF format from the UCI version. Sample code ID's were removed. ! Note that there is also a related Breast Cancer Wisconsin (Diagnosis) Data Set with a different set of…
28321 runs1 likes20 downloads21 reach7 impact
699 instances - 10 features - 2 classes - 16 missing values
Citation Request: This breast cancer domain was obtained from the University Medical Centre, Institute of Oncology, Ljubljana, Yugoslavia. Thanks go to M. Zwitter and M. Soklic for providing the data.…
2009 runs1 likes35 downloads36 reach7 impact
286 instances - 10 features - 2 classes - 9 missing values
1. Title: Hepatitis Domain 2. Sources: (a) unknown (b) Donor: G.Gong (Carnegie-Mellon University) via Bojan Cestnik Jozef Stefan Institute Jamova 39 61000 Ljubljana Yugoslavia (tel.: (38)(+61) 214-399…
2134 runs1 likes12 downloads13 reach7 impact
155 instances - 20 features - 2 classes - 167 missing values
This radar data was collected by a system in Goose Bay, Labrador. This system consists of a phased array of 16 high-frequency antennas with a total transmitted power on the order of 6.4 kilowatts. See…
2484 runs3 likes27 downloads30 reach9 impact
351 instances - 35 features - 2 classes - 0 missing values
This database contains 13 attributes (which have been extracted from a larger set of 75) Attribute Information: ------------------------ -- 1. age -- 2. sex -- 3. chest pain type (4 values) -- 4.…
3214 runs0 likes19 downloads19 reach9 impact
270 instances - 14 features - 2 classes - 0 missing values
1. Title: 1984 United States Congressional Voting Records Database 2. Source Information: (a) Source: Congressional Quarterly Almanac, 98th Congress, 2nd session 1984, Volume XL: Congressional…
2262 runs0 likes17 downloads17 reach7 impact
435 instances - 17 features - 2 classes - 392 missing values
1. Title: Chess End-Game -- King+Rook versus King+Pawn on a7 (usually abbreviated KRKPA7). The pawn on a7 means it is one square away from queening. It is the King+Rook's side (white) to move. 2.…
273605 runs0 likes38 downloads38 reach11 impact
3196 instances - 37 features - 2 classes - 0 missing values
Date: Tue, 15 Nov 88 15:44:08 EST From: stan To: aha@ICS.UCI.EDU 1. Title: Final settlements in labor negotitions in Canadian industry 2. Source Information -- Creators:…
7681 runs0 likes16 downloads16 reach9 impact
57 instances - 17 features - 2 classes - 326 missing values
This file concerns credit card applications. All attribute names and values have been changed to meaningless symbols to protect the confidentiality of the data. This dataset is interesting because…
25075 runs1 likes33 downloads34 reach9 impact
690 instances - 16 features - 2 classes - 67 missing values
### Description This dataset describes mushrooms in terms of their physical characteristics. They are classified into: poisonous or edible. ### Source ``` (a) Origin: Mushroom records are drawn from…
16392 runs1 likes42 downloads43 reach10 impact
8124 instances - 23 features - 2 classes - 2480 missing values
Donor: Will Taylor (taylor@pluto.arc.nasa.gov) Database of surgeries on horses. Possible class attributes: 24 (whether lesion is surgical), others include: 23, 25, 26, and 27 Notes: * Hospital_Number…
236 runs0 likes9 downloads9 reach7 impact
368 instances - 27 features - 2 classes - 1927 missing values
Donor: Will Taylor (taylor@pluto.arc.nasa.gov) In this version (version 2), some features were removed. It is unclear why of how this was done.
1883 runs1 likes10 downloads11 reach7 impact
368 instances - 23 features - 2 classes - 1927 missing values
Prediction task is to determine whether a person makes over 50K a year. Extraction was done by Barry Becker from the 1994 Census database. A set of reasonably clean records was extracted using the…
2671 runs1 likes32 downloads33 reach9 impact
48842 instances - 15 features - 2 classes - 6465 missing values
Compilation of promoters with known transcriptional start points for E. coli genes. The task is to recognize promoters in strings that represent nucleotides (one of A, G, T, or C). A promoter is a…
138 runs1 likes9 downloads10 reach9 impact
106 instances - 59 features - 2 classes - 0 missing values
The dataset (originally named ELEC2) contains 45,312 instances dated from 7 May 1996 to 5 December 1998. Each example of the dataset refers to a period of 30 minutes, i.e. there are 48 instances for…
106854 runs3 likes36 downloads39 reach9 impact
45312 instances - 9 features - 2 classes - 0 missing values
1. Title: Space Shuttle Autolanding Domain 2. Sources: (a) Original source: unknown -- NASA: Mr. Roger Burke's autolander design team (b) Donor: Bojan Cestnik Jozef Stefan Institute Jamova 39 61000…
1466 runs0 likes9 downloads9 reach7 impact
15 instances - 7 features - 2 classes - 26 missing values
No data.
1457 runs0 likes12 downloads12 reach7 impact
39366 instances - 10 features - 2 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
1030 runs0 likes8 downloads8 reach12 impact
132 instances - 4 features - 2 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
1116 runs0 likes9 downloads9 reach12 impact
120 instances - 4 features - 2 classes - 0 missing values
1. Title: INDUCE Trains Data set 2. Sources: - Donor: GMU, Center for AI, Software Librarian, Eric E. Bloedorn (bloedorn@aic.gmu.edu) - Original owners: Ryszard S. Michalski (michalski@aic.gmu.edu)…
1973 runs0 likes9 downloads9 reach12 impact
10 instances - 33 features - 2 classes - 51 missing values
Publication Request: >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> This file describes the contents of the heart-disease directory. This directory contains 4 databases…
1789 runs0 likes12 downloads12 reach7 impact
294 instances - 14 features - 2 classes - 782 missing values
This database encodes the complete set of possible board configurations at the end of tic-tac-toe games, where "x" is assumed to have played first. The target concept is "win for x" (i.e., true when…
386329 runs2 likes75 downloads77 reach8 impact
958 instances - 10 features - 2 classes - 0 missing values
Publication Request: >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> This file describes the contents of the heart-disease directory. This directory contains 4 databases…
1763 runs0 likes10 downloads10 reach7 impact
303 instances - 14 features - 2 classes - 7 missing values
1. Title: Haberman's Survival Data 2. Sources: (a) Donor: Tjen-Sien Lim (limt@stat.wisc.edu) (b) Date: March 4, 1999 3. Past Usage: 1. Haberman, S. J. (1976). Generalized Residuals for Log-Linear…
3241 runs1 likes19 downloads20 reach7 impact
306 instances - 4 features - 2 classes - 0 missing values
SPECTF heart data This is a merged version of the separate train and test set which are usually distributed. On OpenML this train-test split can be found as one of the possible tasks. NOTE: See the…
1103 runs0 likes12 downloads12 reach13 impact
349 instances - 45 features - 2 classes - 0 missing values
SPECT heart data This is a merged version of the separate train and test set which are usually distributed. On OpenML this train-test split can be found as one of the possible tasks. Sources: --…
1296 runs1 likes12 downloads13 reach14 impact
267 instances - 23 features - 2 classes - 0 missing values
Vehicle classification in distributed sensor networks. Journal of Parallel and Distributed Computing, 64(7):826-838, July 2004. This is the SensIT Vehicle (combined) dataset, retrieved 2013-11-14 from…
403 runs0 likes22 downloads22 reach14 impact
98528 instances - 101 features - 2 classes - 0 missing values
Fast training of support vector machines using sequential minimal optimization. In Bernhard Schölkopf, Christopher J. C. Burges, and Alexander J. Smola, editors, Advances in Kernel Methods - Support…
564 runs0 likes11 downloads11 reach21 impact
36974 instances - 124 features - 2 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
103 runs0 likes4 downloads4 reach12 impact
92 instances - 11 features - 2 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
119 runs0 likes5 downloads5 reach12 impact
50 instances - 7 features - 2 classes - 0 missing values
87 persons with lupus nephritis. Followed up 15+ years. 35 deaths. Var = duration of disease. Over 40 baseline variables avaiable from authors. Description : For description of this data set arising…
737 runs0 likes9 downloads9 reach12 impact
87 instances - 4 features - 2 classes - 0 missing values
PRO FOOTBALL SCORES (raw data appears after the description below) How well do the oddsmakers of Las Vegas predict the outcome of professional football games? Is there really a home field advantage -…
15930 runs0 likes19 downloads19 reach23 impact
672 instances - 10 features - 2 classes - 1200 missing values
February 23, 1982 The 1982 annual meetings of the American Statistical Association (ASA) will be held August 16-19, 1982 in Cincinnati. At that meeting, the ASA Committee on Statistical Graphics plans…
759 runs0 likes9 downloads9 reach21 impact
209 instances - 9 features - 2 classes - 15 missing values
Contains 110 data sets from the book 'The Statistical Sleuth' by Fred Ramsey and Dan Schafer; Duxbury Press, 1997. (schafer@stat.orst.edu) [14/Oct/97] (172k) Note: description taken from this web…
722 runs0 likes6 downloads6 reach12 impact
60 instances - 8 features - 2 classes - 0 missing values
Contains 110 data sets from the book 'The Statistical Sleuth' by Fred Ramsey and Dan Schafer; Duxbury Press, 1997. (schafer@stat.orst.edu) [14/Oct/97] (172k) Note: description taken from this web…
668 runs0 likes6 downloads6 reach12 impact
87 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
118 runs0 likes3 downloads3 reach13 impact
195 instances - 12 features - 2 classes - 2 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
670 runs0 likes4 downloads4 reach12 impact
62 instances - 8 features - 2 classes - 8 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
602 runs1 likes12 downloads13 reach13 impact
13750 instances - 41 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
753 runs0 likes10 downloads10 reach13 impact
8192 instances - 13 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
575 runs0 likes9 downloads9 reach13 impact
1000 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1190 runs0 likes9 downloads9 reach12 impact
111 instances - 4 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
856 runs0 likes11 downloads11 reach13 impact
209 instances - 7 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
745 runs0 likes10 downloads10 reach13 impact
3107 instances - 7 features - 2 classes - 0 missing values