Data
Filter results by:
uci adult partitioned
0 runs0 likes0 downloads0 reach8 impact
48844 instances - 17 features - classes - 6495 missing values
diabetes
0 runs0 likes0 downloads0 reach6 impact
768 instances - 9 features - classes - 0 missing values
No data.
332 runs0 likes4 downloads4 reach11 impact
1000000 instances - 17 features - 2 classes - 0 missing values
The data are a subsample of 500 observations from a data set that originate in a study where air pollution at a road is related to traffic volume and meteorological variables, collected by the…
2 runs0 likes1 downloads1 reach13 impact
500 instances - 8 features - 0 classes - 0 missing values
#modelage
28 runs0 likes0 downloads0 reach8 impact
202 instances - 13 features - 3 classes - 202 missing values
nfl_games
0 runs0 likes0 downloads0 reach8 impact
16274 instances - 12 features - classes - 0 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes0 downloads0 reach13 impact
400 instances - 8 features - 0 classes - 0 missing values
The data are a subsample of 500 observations from a data set that originate in a study where air pollution at a road is related to traffic volume and meteorological variables, collected by the…
2 runs0 likes1 downloads1 reach13 impact
500 instances - 8 features - 0 classes - 0 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) Data from which conclusions were drawn in the article "Sleep in Mammals: Ecological and Constitutional Correlates" by Allison, T. and Cicchetti, D.…
0 runs0 likes1 downloads1 reach9 impact
62 instances - 8 features - 0 classes - 12 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes0 downloads0 reach13 impact
400 instances - 8 features - 0 classes - 0 missing values
``**Author**: Cigdem Inan Aci","Mehmet Fatih Akay ### Data Set Information All simulations have done under the software named OPNET Modeler. Message passing is used as the communication mechanism in…
0 runs0 likes0 downloads0 reach8 impact
640 instances - 10 features - classes - 0 missing values
This is an artificial data set with dependencies between the attribute values. The cases are generated using the following method: X1 : uniformly distributed over [-5,5] X2 : uniformly distributed…
3 runs1 likes5 downloads6 reach13 impact
40768 instances - 11 features - 0 classes - 0 missing values
Contains 110 data sets from the book 'The Statistical Sleuth' by Fred Ramsey and Dan Schafer; Duxbury Press, 1997. (schafer@stat.orst.edu) [14/Oct/97] (172k) Note: description taken from this web…
0 runs0 likes0 downloads0 reach13 impact
47 instances - 8 features - 0 classes - 0 missing values
this is titanic survival prediction
0 runs0 likes2 downloads2 reach7 impact
891 instances - 8 features - 0 classes - 0 missing values
titanic surviual prediction
0 runs0 likes2 downloads2 reach7 impact
891 instances - 8 features - 0 classes - 0 missing values
this is titanic survival prediction
0 runs0 likes3 downloads3 reach7 impact
891 instances - 8 features - 0 classes - 0 missing values
titanic surviual prediction
0 runs0 likes0 downloads0 reach8 impact
891 instances - 8 features - 0 classes - 0 missing values
titanic surviual prediction
0 runs0 likes0 downloads0 reach7 impact
891 instances - 8 features - classes - 0 missing values
titanic surviual prediction
0 runs0 likes0 downloads0 reach7 impact
891 instances - 8 features - classes - 0 missing values
titanic surviual prediction
0 runs0 likes0 downloads0 reach7 impact
891 instances - 8 features - classes - 0 missing values
titanic surviual prediction
6 runs0 likes0 downloads0 reach8 impact
891 instances - 8 features - classes - 0 missing values
titanic surviual prediction
0 runs0 likes1 downloads1 reach7 impact
891 instances - 8 features - 0 classes - 0 missing values
titanic surviual prediction
0 runs0 likes1 downloads1 reach7 impact
891 instances - 8 features - 0 classes - 0 missing values
Make target (age) numeric**Author**: 1. Title of Database: Abalone data 2. Sources: (a) Original owners of database: Marine Resources Division Marine Research Laboratories - Taroona Department of…
0 runs0 likes0 downloads0 reach0 impact
4177 instances - 9 features - 0 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes0 downloads0 reach13 impact
4052 instances - 8 features - 0 classes - 0 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
4 runs0 likes2 downloads2 reach17 impact
2984 instances - 145 features - 2 classes - 0 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
1 runs0 likes0 downloads0 reach14 impact
31406 instances - 23 features - 2 classes - 29756 missing values
Normalized form of codrna (351) Andrew V Uzilov, Joshua M Keegan, and David H Mathews. Detection of non-coding RNAs on the basis of predicted secondary structure formation free energy change. BMC…
309 runs0 likes5 downloads5 reach9 impact
488565 instances - 9 features - 2 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach9 impact
78732 instances - 11 features - 0 classes - 0 missing values
No data.
2198 runs1 likes16 downloads17 reach9 impact
1484 instances - 9 features - 10 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
104 runs0 likes4 downloads4 reach14 impact
52 instances - 9 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
760 runs0 likes13 downloads13 reach15 impact
8192 instances - 9 features - 2 classes - 0 missing values
DATA-SETS FROM DIGGLE, P.J. (1990). TIME SERIES : A BIOSTATISTICAL INTRODUCTION. Oxford University Press. Table: Table A2 Wool prices Information about the dataset CLASSTYPE: numeric CLASSINDEX: none…
626 runs0 likes6 downloads6 reach14 impact
310 instances - 9 features - 9 classes - 0 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes0 downloads0 reach13 impact
400 instances - 8 features - 0 classes - 0 missing values
1. Title: Pima Indians Diabetes Database 2. Sources: (a) Original owners: National Institute of Diabetes and Digestive and Kidney Diseases (b) Donor of database: Vincent Sigillito…
202119 runs6 likes89 downloads95 reach15 impact
768 instances - 9 features - 2 classes - 0 missing values
Date: Tue, 15 Nov 88 15:44:08 EST From: stan To: aha@ICS.UCI.EDU 1. Title: Final settlements in labor negotitions in Canadian industry 2. Source Information -- Creators:…
7681 runs0 likes16 downloads16 reach11 impact
57 instances - 17 features - 2 classes - 326 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
733 runs0 likes4 downloads4 reach14 impact
87 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
112 runs0 likes5 downloads5 reach14 impact
42 instances - 10 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
118 runs0 likes3 downloads3 reach15 impact
228 instances - 9 features - 2 classes - 20 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
589 runs0 likes12 downloads12 reach15 impact
22784 instances - 9 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
581 runs0 likes11 downloads11 reach15 impact
20640 instances - 9 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
761 runs0 likes14 downloads14 reach15 impact
8192 instances - 9 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
796 runs0 likes9 downloads9 reach15 impact
8192 instances - 9 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
107 runs0 likes3 downloads3 reach14 impact
74 instances - 9 features - 2 classes - 0 missing values
No data.
747 runs0 likes7 downloads7 reach15 impact
369 instances - 9 features - 2 classes - 0 missing values
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage repeatable,…
485 runs0 likes5 downloads5 reach13 impact
76 instances - 15 features - 7 classes - 37 missing values
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage repeatable,…
908 runs0 likes9 downloads9 reach14 impact
130 instances - 9 features - 2 classes - 0 missing values
No data.
748 runs0 likes7 downloads7 reach15 impact
274 instances - 9 features - 2 classes - 0 missing values
A family of datasets synthetically generated from a simulation of how bank-customers choose their banks. Tasks are based on predicting the fraction of bank customers who leave the bank because of full…
0 runs0 likes6 downloads6 reach13 impact
8192 instances - 9 features - 0 classes - 0 missing values
A 4-class version of breast-tissue dataset.
299 runs0 likes4 downloads4 reach13 impact
106 instances - 10 features - 4 classes - 0 missing values
Concrete is the most important material in civil engineering. The concrete compressive strength is a highly nonlinear function of age and ingredients. These ingredients include cement, blast furnace…
3 runs1 likes3 downloads4 reach13 impact
1030 instances - 9 features - classes - 0 missing values
In the early 2000s, Billy Beane and Paul DePodesta worked for the Oakland Athletics. While there, they literally changed the game of baseball. They didn't do it using a bat or glove, and they…
0 runs0 likes7 downloads7 reach12 impact
1232 instances - 15 features - 0 classes - 3600 missing values
Twenty two observations of the Dwarf planet Ceres as observed by Giueseppe Piazzi and published in the September edition of Monatlicher Correspondenz in 1801. These were the measurements used by Gauss…
0 runs0 likes0 downloads0 reach10 impact
22 instances - 9 features - classes - 17 missing values
shuttle-pmlb
10 runs0 likes4 downloads4 reach23 impact
58000 instances - 10 features - 7 classes - 0 missing values
Contains 110 data sets from the book 'The Statistical Sleuth' by Fred Ramsey and Dan Schafer; Duxbury Press, 1997. (schafer@stat.orst.edu) [14/Oct/97] (172k) Note: description taken from this web…
0 runs0 likes0 downloads0 reach13 impact
42 instances - 10 features - 0 classes - 0 missing values
%%%%%%%%%%%%%%%%%%% Data-Description % %%%%%%%%%%%%%%%%%%% COIL 1999 Competition Data Data Type multivariate Abstract This data set is from the 1999 Computational Intelligence and Learning (COIL)…
0 runs0 likes0 downloads0 reach13 impact
316 instances - 12 features - 0 classes - 56 missing values
%%%%%%%%%%%%%%%%%%% Data-Description % %%%%%%%%%%%%%%%%%%% COIL 1999 Competition Data Data Type multivariate Abstract This data set is from the 1999 Computational Intelligence and Learning (COIL)…
0 runs0 likes0 downloads0 reach14 impact
316 instances - 12 features - 0 classes - 56 missing values
This dataset consists of beer reviews from Beeradvocate. The data span a period of more than 10 years, including all ~1.5 million reviews up to November 2011. Each review includes ratings in terms of…
0 runs0 likes0 downloads0 reach8 impact
1586614 instances - 13 features - 104 classes - 68148 missing values
This dataset consists of beer reviews from Beeradvocate. The data span a period of more than 10 years, including all ~1.5 million reviews up to November 2011. Each review includes ratings in terms of…
0 runs0 likes0 downloads0 reach8 impact
1586614 instances - 13 features - 104 classes - 68148 missing values
This dataset consists of beer reviews from Beeradvocate. The data span a period of more than 10 years, including all ~1.5 million reviews up to November 2011. Each review includes ratings in terms of…
0 runs0 likes0 downloads0 reach8 impact
1586614 instances - 13 features - 104 classes - 68148 missing values
This dataset consists of beer reviews from Beeradvocate. The data span a period of more than 10 years, including all ~1.5 million reviews up to November 2011. Each review includes ratings in terms of…
0 runs0 likes2 downloads2 reach8 impact
1586614 instances - 13 features - 104 classes - 68148 missing values
This dataset contains 10962 houses to rent with 13 diferent features. Some values in the dataset can be considered as outliers for further analyses. Bear in mind that the Web Crawler was used only to…
0 runs0 likes0 downloads0 reach4 impact
10692 instances - 13 features - 0 classes - 0 missing values
This is a family of datasets synthetically generated from a realistic simulation of the dynamics of a Unimation Puma 560 robot arm. There are eight datastets in this family . In this repository we…
2 runs0 likes5 downloads5 reach9 impact
8192 instances - 9 features - 0 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
2 runs0 likes0 downloads0 reach13 impact
74 instances - 9 features - 0 classes - 0 missing values
%%%%%%%%%%%%%%%%%%% Data-Description % %%%%%%%%%%%%%%%%%%% COIL 1999 Competition Data Data Type multivariate Abstract This data set is from the 1999 Computational Intelligence and Learning (COIL)…
0 runs0 likes0 downloads0 reach13 impact
316 instances - 12 features - 0 classes - 56 missing values
%%%%%%%%%%%%%%%%%%% Data-Description % %%%%%%%%%%%%%%%%%%% COIL 1999 Competition Data Data Type multivariate Abstract This data set is from the 1999 Computational Intelligence and Learning (COIL)…
12 runs0 likes0 downloads0 reach13 impact
316 instances - 12 features - 0 classes - 56 missing values
#modelage
87 runs0 likes0 downloads0 reach8 impact
224 instances - 20 features - 6 classes - 205 missing values
No data.
75 runs0 likes3 downloads3 reach9 impact
137781 instances - 10 features - 7 classes - 0 missing values
%%%%%%%%%%%%%%%%%%% Data-Description % %%%%%%%%%%%%%%%%%%% COIL 1999 Competition Data Data Type multivariate Abstract This data set is from the 1999 Computational Intelligence and Learning (COIL)…
0 runs0 likes0 downloads0 reach13 impact
316 instances - 12 features - 0 classes - 56 missing values
This database was designed on the basis of data provided by US Census Bureau [http://www.census.gov] (under Lookup Access [http://www.census.gov/cdrom/lookup]: Summary Tape File 1). The data were…
2 runs1 likes3 downloads4 reach9 impact
22784 instances - 9 features - 0 classes - 0 missing values
UserID
0 runs0 likes0 downloads0 reach8 impact
1974675 instances - 10 features - classes - 1974675 missing values
web services evaluations in this table
0 runs0 likes0 downloads0 reach9 impact
1974675 instances - 10 features - classes - 1974675 missing values
This is data set is concerned with the forward kinematics of an 8 link robot arm. Among the existing variants of this data set we have used the variant 8nm, which is known to be highly non-linear and…
19 runs0 likes7 downloads7 reach9 impact
8192 instances - 9 features - 0 classes - 0 missing values
This dataset is taken from the Places Rated Almanac, by Richard Boyer and David Savageau, copyrighted and published by Rand McNally. This book order (SBN) number is 0-528-88008-X, and it retails for…
2 runs0 likes7 downloads7 reach13 impact
329 instances - 9 features - 0 classes - 0 missing values
%%%%%%%%%%%%%%%%%%% Data-Description % %%%%%%%%%%%%%%%%%%% COIL 1999 Competition Data Data Type multivariate Abstract This data set is from the 1999 Computational Intelligence and Learning (COIL)…
0 runs0 likes0 downloads0 reach13 impact
316 instances - 12 features - 0 classes - 56 missing values
Experiment data obtained by running random configurations of the hnsw kNN through mlr on 116 different classification tasks from openml.
0 runs0 likes0 downloads0 reach7 impact
111753 instances - 13 features - classes - 0 missing values
Experiment data obtained by running random configurations of an SVM through mlr on 106 different classification tasks from openml.
0 runs0 likes0 downloads0 reach7 impact
540576 instances - 15 features - classes - 658962 missing values
Experiment data obtained by running random configurations of rpart through mlr on 115 different classification tasks from openml.
0 runs0 likes0 downloads0 reach7 impact
92067 instances - 12 features - classes - 0 missing values
Bike sharing systems are new generation of traditional bike rentals where whole process from membership, rental and return back has become automatic. Through these systems, user is able to easily rent…
0 runs0 likes0 downloads0 reach3 impact
17379 instances - 13 features - 0 classes - 0 missing values
* Source: JP Marques de Sá, INEB-Instituto de Engenharia Biomédica, Porto, Portugal; e-mail: jpmdesa '@' gmail.com J Jossinet, inserm, Lyon, France * Data Set Information: Impedance measurements…
280 runs0 likes5 downloads5 reach13 impact
106 instances - 10 features - 6 classes - 0 missing values
This data is derived from the 2012 KDD Cup. The data is subsampled to 1% of the original number of instances, downsampling the majority class (click=0) so that the target feature is reasonably…
0 runs1 likes2 downloads3 reach10 impact
798964 instances - 10 features - 3 classes - 399482 missing values
Data on predicting clicks on ads in a search engine.
0 runs0 likes8 downloads8 reach13 impact
1496391 instances - 10 features - 2 classes - 0 missing values
This data set contains 416 liver patient records and 167 non liver patient records.The data set was collected from north east of Andhra Pradesh, India. The class label divides the patients into 2…
154859 runs2 likes23 downloads25 reach26 impact
583 instances - 11 features - 2 classes - 0 missing values
This data is derived from the 2012 KDD Cup. The data is subsampled to 0.1% of the original number of instances, downsampling the majority class (click=0) so that the target feature is reasonably…
63420 runs0 likes17 downloads17 reach25 impact
39948 instances - 10 features - 2 classes - 0 missing values
%%%%%%%%%%%%%%%%%%% Data-Description % %%%%%%%%%%%%%%%%%%% COIL 1999 Competition Data Data Type multivariate Abstract This data set is from the 1999 Computational Intelligence and Learning (COIL)…
0 runs0 likes0 downloads0 reach13 impact
316 instances - 12 features - 0 classes - 56 missing values
This data is derived from the 2012 KDD Cup. The data is subsampled to 0.1% of the original number of instances, downsampling the majority class (click=0) so that the target feature is reasonably…
0 runs0 likes0 downloads0 reach0 impact
39948 instances - 10 features - 2 classes - 0 missing values
1. Title: Glass Identification Database 2. Sources: (a) Creator: B. German -- Central Research Establishment Home Office Forensic Science Service Aldermaston, Reading, Berkshire RG7 4PN (b) Donor:…
1776 runs0 likes50 downloads50 reach9 impact
214 instances - 10 features - 6 classes - 0 missing values
No data.
867 runs0 likes12 downloads12 reach9 impact
39366 instances - 10 features - 2 classes - 0 missing values
Contains 110 data sets from the book 'The Statistical Sleuth' by Fred Ramsey and Dan Schafer; Duxbury Press, 1997. (schafer@stat.orst.edu) [14/Oct/97] (172k) Note: description taken from this web…
668 runs0 likes6 downloads6 reach14 impact
87 instances - 11 features - 2 classes - 0 missing values
Source: David Gil, dgil '@' dtic.ua.es, Lucentia Research Group, Department of Computer Technology, University of Alicante Jose Luis Girela, girela '@' ua.es, Department of Biotechnology, University…
451 runs0 likes7 downloads7 reach13 impact
100 instances - 10 features - 2 classes - 0 missing values
S&P Letters Data We collected information on the variables using all the block groups in California from the 1990 Census. In this sample a block group on average includes 1425.5 individuals living in…
0 runs0 likes6 downloads6 reach13 impact
20640 instances - 9 features - 0 classes - 0 missing values
Current dataset was adapted to ARFF format from the UCI version. Sample code ID's were removed. ! Note that there is also a related Breast Cancer Wisconsin (Diagnosis) Data Set with a different set of…
28321 runs1 likes20 downloads21 reach9 impact
699 instances - 10 features - 2 classes - 16 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
773 runs0 likes14 downloads14 reach15 impact
950 instances - 10 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
728 runs0 likes5 downloads5 reach14 impact
52 instances - 10 features - 2 classes - 0 missing values
Datasets for `Pattern Recognition and Neural Networks' by B.D. Ripley ===================================================================== Cambridge University Press (1996) ISBN 0-521-46086-7 The…
640 runs0 likes6 downloads6 reach14 impact
214 instances - 10 features - 6 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
792 runs0 likes10 downloads10 reach15 impact
214 instances - 10 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
772 runs0 likes8 downloads8 reach14 impact
214 instances - 10 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
131 runs1 likes9 downloads10 reach15 impact
990 instances - 14 features - 2 classes - 0 missing values
* Abstract: Purpose is to predict poker hands * Source - Creators: Robert Cattral (cattral '@' gmail.com) Franz Oppacher (oppacher '@' scs.carleton.ca) Carleton University, Department of Computer…
1 runs0 likes5 downloads5 reach15 impact
1025009 instances - 11 features - 10 classes - 0 missing values
* Abstract: 9-class version of poker-hand dataset, it was removed the minority class.
1 runs0 likes3 downloads3 reach14 impact
1025000 instances - 11 features - 9 classes - 0 missing values