Data
Filter results by:
Experiment data obtained by running random configurations of glmnet through mlr on 114 different classification tasks from openml.
0 runs0 likes0 downloads0 reach0 impact
104820 instances - 10 features - classes - 0 missing values
Geographical Analysis Spatial Data This georeferenced data set was used in: Pace, R. Kelley, and Ronald Barry, Quick Computation of Regressions with a Spatially Autoregressive Dependent Variable,…
4 runs1 likes1 downloads2 reach7 impact
3107 instances - 7 features - 0 classes - 0 missing values
The data consist of annual observations on the level of strike volume (days lost due to industrial disputes per 1000 wage salary earners), and their covariates in 18 OECD countries from 1951-1985. The…
0 runs0 likes2 downloads2 reach7 impact
625 instances - 7 features - 0 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 21 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 21 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 21 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 21 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 21 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 21 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 21 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
16598 instances - 11 features - classes - 329 missing values
Sensor data measurements of one Boiler, containing WaterInput/SteamOutput (flow, temperature, pressure) for one month, which is measured every minute.
0 runs0 likes1 downloads1 reach3 impact
44643 instances - 8 features - classes - 44643 missing values
test
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 21 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
336 instances - 8 features - classes - 0 missing values
test3
0 runs0 likes0 downloads0 reach0 impact
2 instances - 8 features - classes - 0 missing values
This data is used to test water contamination
0 runs0 likes0 downloads0 reach0 impact
26 instances - 8 features - classes - 0 missing values
No data.
697 runs0 likes7 downloads7 reach13 impact
320 instances - 9 features - 2 classes - 0 missing values
test 123
0 runs0 likes0 downloads0 reach1 impact
26 instances - 8 features - classes - 0 missing values
Original data from https://github.com/propublica/compas-analysis/ by ProPublica. The data was subsequently preprocessed and reduced to relevant features for classification. The target variable is…
0 runs0 likes1 downloads1 reach8 impact
5278 instances - 14 features - 2 classes - 0 missing values
The data is related with direct marketing campaigns of a Portuguese banking institution. The marketing campaigns were based on phone calls. Often, more than one contact to the same client was…
65398 runs2 likes35 downloads37 reach27 impact
45211 instances - 17 features - 2 classes - 0 missing values
* Dataset: Reduced version (10 % of the examples) of bank-marketing dataset.
1254 runs1 likes17 downloads18 reach13 impact
4521 instances - 17 features - 2 classes - 0 missing values
* Abstract: A 3-class version of abalone dataset. * Sources: (a) Original owners of database: Marine Resources Division Marine Research Laboratories - Taroona Department of Primary Industry and…
176 runs0 likes4 downloads4 reach12 impact
4177 instances - 9 features - 3 classes - 0 missing values
nominal features and target for COMPAS
0 runs0 likes1 downloads1 reach7 impact
5278 instances - 14 features - 2 classes - 0 missing values
Attribute information: ``` sick, negative. | classes age: continuous. sex: M, F. on thyroxine: f, t. query on thyroxine: f, t. on antithyroid medication: f, t. sick: f, t. pregnant: f, t. thyroid…
19940 runs0 likes31 downloads31 reach7 impact
3772 instances - 30 features - 2 classes - 6064 missing values
1. Title: Protein Localization Sites 2. Creator and Maintainer: Kenta Nakai Institue of Molecular and Cellular Biology Osaka, University 1-3 Yamada-oka, Suita 565 Japan nakai@imcb.osaka-u.ac.jp…
1806 runs0 likes13 downloads13 reach9 impact
336 instances - 8 features - 8 classes - 0 missing values
The dataset (originally named ELEC2) contains 45,312 instances dated from 7 May 1996 to 5 December 1998. Each example of the dataset refers to a period of 30 minutes, i.e. there are 48 instances for…
106854 runs3 likes38 downloads41 reach9 impact
45312 instances - 9 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
747 runs0 likes13 downloads13 reach13 impact
4177 instances - 9 features - 2 classes - 0 missing values
February 23, 1982 The 1982 annual meetings of the American Statistical Association (ASA) will be held August 16-19, 1982 in Cincinnati. At that meeting, the ASA Committee on Statistical Graphics plans…
759 runs0 likes9 downloads9 reach21 impact
209 instances - 9 features - 2 classes - 15 missing values
SUMMARY: Data from an experiment on the affects of machine adjustments on the time to count bolts. Data appear as the STATS (Issue 10) Challenge. DATA: Submitted by W. Robert Stephenson, Iowa State…
754 runs0 likes9 downloads9 reach12 impact
40 instances - 8 features - 2 classes - 0 missing values
; ; Thyroid disease records supplied by the Garavan Institute and J. Ross ; Quinlan, New South Wales Institute, Syndney, Australia. ; ; 1987. ; hypothyroid, primary hypothyroid, compensated…
883 runs0 likes11 downloads11 reach7 impact
3772 instances - 30 features - 4 classes - 6064 missing values
1. Title of Database: Abalone data 2. Sources: (a) Original owners of database: Marine Resources Division Marine Research Laboratories - Taroona Department of Primary Industry and Fisheries, Tasmania…
34899 runs0 likes18 downloads18 reach7 impact
4177 instances - 9 features - 28 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
698 runs0 likes6 downloads6 reach12 impact
97 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
739 runs0 likes11 downloads11 reach13 impact
4052 instances - 8 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
755 runs0 likes4 downloads4 reach12 impact
54 instances - 8 features - 2 classes - 120 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
554 runs0 likes10 downloads10 reach13 impact
40768 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
819 runs0 likes10 downloads10 reach13 impact
500 instances - 8 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
791 runs0 likes7 downloads7 reach13 impact
400 instances - 8 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
764 runs0 likes6 downloads6 reach13 impact
400 instances - 8 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
788 runs0 likes7 downloads7 reach13 impact
400 instances - 8 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
779 runs0 likes7 downloads7 reach13 impact
400 instances - 8 features - 2 classes - 0 missing values
Contains 110 data sets from the book 'The Statistical Sleuth' by Fred Ramsey and Dan Schafer; Duxbury Press, 1997. (schafer@stat.orst.edu) [14/Oct/97] (172k) Note: description taken from this web…
722 runs0 likes6 downloads6 reach12 impact
60 instances - 8 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
758 runs0 likes8 downloads8 reach13 impact
500 instances - 8 features - 2 classes - 0 missing values
Donor: Will Taylor (taylor@pluto.arc.nasa.gov) Database of surgeries on horses. Possible class attributes: 24 (whether lesion is surgical), others include: 23, 25, 26, and 27 Notes: * Hospital_Number…
236 runs0 likes9 downloads9 reach7 impact
368 instances - 27 features - 2 classes - 1927 missing values
Donor: Will Taylor (taylor@pluto.arc.nasa.gov) In this version (version 2), some features were removed. It is unclear why of how this was done.
1883 runs1 likes10 downloads11 reach7 impact
368 instances - 23 features - 2 classes - 1927 missing values
This dataset classifies people described by a set of attributes as good or bad credit risks. This dataset comes with a cost matrix: ``` Good Bad (predicted) Good 0 1 (actual) Bad 5 0 ``` It is worse…
505929 runs19 likes236 downloads255 reach26 impact
1000 instances - 21 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
670 runs0 likes4 downloads4 reach12 impact
62 instances - 8 features - 2 classes - 8 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
737 runs0 likes5 downloads5 reach12 impact
47 instances - 8 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
756 runs0 likes6 downloads6 reach13 impact
310 instances - 9 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
815 runs0 likes8 downloads8 reach13 impact
336 instances - 8 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
737 runs0 likes9 downloads9 reach13 impact
3772 instances - 30 features - 2 classes - 6064 missing values
No data.
697 runs0 likes5 downloads5 reach12 impact
89 instances - 9 features - 2 classes - 0 missing values
Dataset from the MLRR repository: http://axon.cs.byu.edu:5000/
68 runs0 likes7 downloads7 reach22 impact
32561 instances - 16 features - 2 classes - 4262 missing values
Contains 110 data sets from the book 'The Statistical Sleuth' by Fred Ramsey and Dan Schafer; Duxbury Press, 1997. (schafer@stat.orst.edu) [14/Oct/97] (172k) Note: description taken from this web…
0 runs0 likes0 downloads0 reach5 impact
47 instances - 8 features - 0 classes - 0 missing values
Normalized form of codrna (351) Andrew V Uzilov, Joshua M Keegan, and David H Mathews. Detection of non-coding RNAs on the basis of predicted secondary structure formation free energy change. BMC…
309 runs0 likes5 downloads5 reach1 impact
488565 instances - 9 features - 2 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach1 impact
78732 instances - 11 features - 0 classes - 0 missing values
This is the hip measurement data from Table B.13 in Chatfield's Problem Solving (1995, 2nd edn, Chapman and Hall). It is given in 8 columns. First 4 columns are for Control Group. Last 4 columns are…
0 runs0 likes0 downloads0 reach3 impact
54 instances - 8 features - classes - 120 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes0 downloads0 reach5 impact
4052 instances - 8 features - 0 classes - 0 missing values
"The debutanizer column is part of a desulfuring and naphtha splitter plant." u1 Top temperature u2 Top pressure u3 Reflux flow u4 Flow to next process u5 6th tray temperature u6 Bottom…
0 runs0 likes1 downloads1 reach3 impact
2394 instances - 8 features - 0 classes - 0 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) SUMMARY: Data from an experiment on the affects of machine adjustments on the time to count bolts. Data appear as the STATS (Issue 10) Challenge. DATA:…
4 runs0 likes0 downloads0 reach1 impact
40 instances - 8 features - 0 classes - 0 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) Data from which conclusions were drawn in the article "Sleep in Mammals: Ecological and Constitutional Correlates" by Allison, T. and Cicchetti, D.…
0 runs0 likes1 downloads1 reach1 impact
62 instances - 8 features - 0 classes - 12 missing values
Andrew V Uzilov, Joshua M Keegan, and David H Mathews. Detection of non-coding RNAs on the basis of predicted secondary structure formation free energy change. BMC Bioinformatics, 7(173), 2006. This…
31 runs0 likes10 downloads10 reach7 impact
488565 instances - 9 features - 2 classes - 0 missing values
No data.
332 runs0 likes4 downloads4 reach2 impact
1000000 instances - 17 features - 2 classes - 0 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
0 runs0 likes0 downloads0 reach5 impact
31406 instances - 23 features - 2 classes - 29756 missing values
uci adult partitioned
0 runs0 likes0 downloads0 reach1 impact
48844 instances - 17 features - classes - 6495 missing values
nfl_games
0 runs0 likes0 downloads0 reach1 impact
16274 instances - 12 features - classes - 0 missing values
#modelage
28 runs0 likes0 downloads0 reach1 impact
202 instances - 13 features - 3 classes - 202 missing values
test001
0 runs1 likes0 downloads1 reach1 impact
768 instances - 9 features - classes - 0 missing values
Source: The dataset was created by Angeliki Xifara (angxifara @ gmail.com, Civil/Structural Engineer) and was processed by Athanasios Tsanas (tsanasthanasis @ gmail.com, Oxford Centre for Industrial…
103 runs1 likes4 downloads5 reach6 impact
768 instances - 10 features - 37 classes - 0 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au7-700 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and heterogeneity of…
4537 runs0 likes7 downloads7 reach20 impact
700 instances - 13 features - 3 classes - 0 missing values
``**Author**: Cigdem Inan Aci","Mehmet Fatih Akay ### Data Set Information All simulations have done under the software named OPNET Modeler. Message passing is used as the communication mechanism in…
0 runs0 likes0 downloads0 reach1 impact
640 instances - 10 features - classes - 0 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes0 downloads0 reach6 impact
400 instances - 8 features - 0 classes - 0 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
22 runs0 likes2 downloads2 reach7 impact
400 instances - 8 features - 0 classes - 0 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes0 downloads0 reach6 impact
400 instances - 8 features - 0 classes - 0 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes0 downloads0 reach6 impact
400 instances - 8 features - 0 classes - 0 missing values
The data are a subsample of 500 observations from a data set that originate in a study where air pollution at a road is related to traffic volume and meteorological variables, collected by the…
2 runs0 likes1 downloads1 reach6 impact
500 instances - 8 features - 0 classes - 0 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au7-300-drift-au7-cpd1-800 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and…
7130 runs0 likes11 downloads11 reach28 impact
1100 instances - 13 features - 5 classes - 0 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au7-cpd1-500 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and heterogeneity…
7145 runs0 likes7 downloads7 reach28 impact
500 instances - 13 features - 5 classes - 0 missing values