Data
Filter results by:
test
0 runs0 likes0 downloads0 reach3 impact
4324 instances - 9 features - classes - 3360 missing values
This is weather data in arff format
0 runs0 likes0 downloads0 reach2 impact
14 instances - 5 features - classes - 0 missing values
Dataset from Smoothing Methods in Statistics (ftp stat.cmu.edu/datasets) Simonoff, J.S. (1996). Smoothing Methods in Statistics. New York: Springer-Verlag. Electicity usage is being treated as the…
4 runs0 likes0 downloads0 reach9 impact
55 instances - 3 features - 0 classes - 0 missing values
The weather problem is a tiny dataset that we will use repeatedly to illustrate machine learning methods. Entirely fictitious, it supposedly concerns the conditions that are suitable for playing some…
0 runs0 likes0 downloads0 reach8 impact
14 instances - 5 features - 2 classes - 0 missing values
The weather problem is a tiny dataset that we will use repeatedly to illustrate machine learning methods. Entirely fictitious, it supposedly concerns the conditions that are suitable for playing some…
0 runs0 likes0 downloads0 reach8 impact
14 instances - 5 features - 2 classes - 0 missing values
The weather problem is a tiny dataset that we will use repeatedly to illustrate machine learning methods. Entirely fictitious, it supposedly concerns the conditions that are suitable for playing some…
0 runs0 likes0 downloads0 reach8 impact
14 instances - 5 features - 2 classes - 0 missing values
This dataset contains all the player names and player ids, taken from Sofifa
0 runs0 likes0 downloads0 reach8 impact
11009 instances - 3 features - classes - 0 missing values
The weather problem is a tiny dataset that we will use repeatedly to illustrate machine learning methods. Entirely fictitious, it supposedly concerns the conditions that are suitable for playing some…
0 runs0 likes0 downloads0 reach8 impact
14 instances - 5 features - 2 classes - 0 missing values
This data represents crime reported to the Seattle Police Department (SPD). Each row contains the record of a unique event where at least one criminal offense was reported by a member of the community…
0 runs0 likes0 downloads0 reach8 impact
523590 instances - 8 features - 144 classes - 6916 missing values
frf r
0 runs0 likes0 downloads0 reach6 impact
2 instances - 3 features - classes - 0 missing values
User profile data for San Francisco OkCupid users published in [Kim, A. Y., & Escobedo-Land, A. (2015). OKCupid data for introductory statistics and data science courses. Journal of Statistics…
0 runs0 likes0 downloads0 reach10 impact
50789 instances - 20 features - 3 classes - 154107 missing values
sample
0 runs0 likes0 downloads0 reach2 impact
14 instances - 5 features - classes - 0 missing values
No data.
44 runs0 likes3 downloads3 reach11 impact
1000000 instances - 15 features - 2 classes - 0 missing values
No data.
0 runs0 likes3 downloads3 reach12 impact
116640 instances - 10 features - 0 classes - 0 missing values
User profile data for San Francisco OkCupid users published in [Kim, A. Y., & Escobedo-Land, A. (2015). OKCupid data for introductory statistics and data science courses. Journal of Statistics…
0 runs0 likes0 downloads0 reach1 impact
50789 instances - 20 features - 3 classes - 154107 missing values
Outliers data set extracted from the Illustration (Fig. 3) in "Novelty detection with application to data streams"
0 runs0 likes0 downloads0 reach0 impact
75 instances - 3 features - 4 classes - 0 missing values
Airlines Dataset Inspired in the regression dataset from Elena Ikonomovska. The task is to predict whether a given flight will be delayed, given the information of the scheduled departure. For this…
0 runs0 likes2 downloads2 reach6 impact
26969 instances - 8 features - 2 classes - 0 missing values
Dataset from Smoothing Methods in Statistics (ftp stat.cmu.edu/datasets) Simonoff, J.S. (1996). Smoothing Methods in Statistics. New York: Springer-Verlag.
7 runs0 likes2 downloads2 reach9 impact
61 instances - 3 features - 0 classes - 0 missing values
chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S. Simonoff, John Wiley and…
0 runs0 likes0 downloads0 reach11 impact
185 instances - 2 features - classes - 0 missing values
Data Sets for 'Regression Models for Time Series Analysis' by B. Kedem and K. Fokianos, Wiley 2002. Submitted by Kostas Fokianos (fokianos@ucy.ac.cy) [8/Nov/02] (176k) Note: - attribute names were…
2 runs0 likes0 downloads0 reach13 impact
264 instances - 3 features - 0 classes - 0 missing values
Data originating from the book "Analyzing Categorical Data" by Jeffrey S. Simonoff. ### Attribute Information: 1. Sex - Male/Female 2. Age - numerical integer range 3. Race - categorical 4. AIDS -…
1087 runs0 likes9 downloads9 reach15 impact
50 instances - 5 features - 2 classes - 0 missing values
As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based learning with encoding length selection. In Progress in Connectionist-Based Information Systems.…
10 runs1 likes2 downloads3 reach12 impact
195 instances - 11 features - 0 classes - 2 missing values
Tumor-size treated as the class attribute. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based learning with encoding length selection. In Progress in…
0 runs0 likes3 downloads3 reach12 impact
286 instances - 10 features - 0 classes - 9 missing values
chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S. Simonoff, John Wiley and…
0 runs0 likes0 downloads0 reach11 impact
50 instances - 3 features - classes - 0 missing values
1. Title/Topic: cocomonasa/software cost estimation 2. Sources: -- Creators: 60 NASA projects from different centers for projects from the 1980s and 1990s. Collected by Jairus Hihn, JPL, NASA, Manager…
2 runs0 likes2 downloads2 reach13 impact
60 instances - 17 features - 0 classes - 0 missing values
1. Title: Contraceptive Method Choice 2. Sources: (a) Origin: This dataset is a subset of the 1987 National Indonesia Contraceptive Prevalence Survey (b) Creator: Tjen-Sien Lim (limt@stat.wisc.edu)…
23889 runs0 likes19 downloads19 reach10 impact
1473 instances - 10 features - 3 classes - 0 missing values
Data on educational transitions for a sample of 500 Irish schoolchildren aged 11 in 1967. The data were collected by Greaney and Kelleghan (1984), and reanalyzed by Raftery and Hout (1985, 1993). ###…
16022 runs0 likes16 downloads16 reach25 impact
500 instances - 6 features - 2 classes - 32 missing values
Cushing's syndrome ------------------ ``` Label of the patient Tetrhydrocortisone (mg/24hr) Pregnanetriol (mg/24hr) Type ``` The type of the last six patients (u1 to u6) should be regarded as unknown.…
41 runs0 likes2 downloads2 reach14 impact
27 instances - 3 features - 4 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
511 runs0 likes6 downloads6 reach15 impact
185 instances - 3 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1088 runs0 likes8 downloads8 reach14 impact
132 instances - 4 features - 2 classes - 0 missing values
The data consist of 2001 observations taken from a balloon about 30 kilometres above the surface of the earth. In the section of the flight shown here the balloon increases in height. As radiation…
0 runs1 likes2 downloads3 reach14 impact
2001 instances - 2 features - 0 classes - 0 missing values
1. Title: Haberman's Survival Data 2. Sources: (a) Donor: Tjen-Sien Lim (limt@stat.wisc.edu) (b) Date: March 4, 1999 3. Past Usage: 1. Haberman, S. J. (1976). Generalized Residuals for Log-Linear…
3241 runs1 likes19 downloads20 reach9 impact
306 instances - 4 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
775 runs0 likes7 downloads7 reach14 impact
43 instances - 3 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
767 runs0 likes9 downloads9 reach15 impact
189 instances - 10 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1043 runs0 likes10 downloads10 reach14 impact
125 instances - 5 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1038 runs0 likes9 downloads9 reach14 impact
147 instances - 7 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
119 runs0 likes6 downloads6 reach14 impact
39 instances - 3 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
113 runs0 likes3 downloads3 reach15 impact
366 instances - 5 features - 2 classes - 1 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
736 runs0 likes7 downloads7 reach15 impact
1473 instances - 10 features - 2 classes - 0 missing values
Dataset from the MLRR repository: http://axon.cs.byu.edu:5000/
731 runs0 likes5 downloads5 reach23 impact
151 instances - 7 features - 3 classes - 0 missing values
Grass Grubs and Damage Ranking Data source: R. J. Townsend AgResearch, Lincoln, New Zealand Grass grubs are one of the major insect pests of pasture in Canterbury and can cause severe pasture damage…
988 runs0 likes8 downloads8 reach15 impact
155 instances - 9 features - 4 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
771 runs0 likes9 downloads9 reach15 impact
468 instances - 4 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
119 runs0 likes7 downloads7 reach14 impact
50 instances - 4 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
762 runs0 likes6 downloads6 reach14 impact
88 instances - 3 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
140 runs0 likes6 downloads6 reach15 impact
194 instances - 29 features - 2 classes - 0 missing values
Dataset from `Pattern Recognition and Neural Networks' by B.D. Ripley. Cambridge University Press (1996) ISBN 0-521-46086-7. The background to the datasets is described in section 1.4; this file…
1105 runs0 likes7 downloads7 reach15 impact
250 instances - 3 features - 2 classes - 0 missing values
Human Development Index [DATA] United Nations Development Program compiled an Index of Human Development. Column 1: Country(character) 2: Index 3: GNP To measure the quality of life in a nation, the…
2 runs0 likes0 downloads0 reach13 impact
130 instances - 2 features - 0 classes - 0 missing values
A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on…
2 runs0 likes0 downloads0 reach13 impact
475 instances - 4 features - 0 classes - 0 missing values
A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on…
2 runs0 likes0 downloads0 reach13 impact
475 instances - 4 features - 0 classes - 0 missing values
No data.
965 runs0 likes9 downloads9 reach9 impact
55296 instances - 10 features - 3 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
903 runs0 likes8 downloads8 reach15 impact
468 instances - 3 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
963 runs0 likes11 downloads11 reach15 impact
380 instances - 3 features - 2 classes - 0 missing values
A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on…
0 runs0 likes2 downloads2 reach11 impact
366 instances - 5 features - classes - 2 missing values
A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on…
2 runs0 likes0 downloads0 reach14 impact
649 instances - 3 features - 0 classes - 0 missing values
Prediction task is to determine whether a person makes over 50K a year. Extraction was done by Barry Becker from the 1994 Census database. A set of reasonably clean records was extracted using the…
2671 runs1 likes33 downloads34 reach12 impact
48842 instances - 15 features - 2 classes - 6465 missing values
Data Sets for 'Regression Models for Time Series Analysis' by B. Kedem and K. Fokianos, Wiley 2002. Submitted by Kostas Fokianos (fokianos@ucy.ac.cy) [8/Nov/02] (176k) Note: - attribute names were…
599 runs0 likes11 downloads11 reach15 impact
1024 instances - 3 features - 4 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
111 runs0 likes5 downloads5 reach14 impact
52 instances - 3 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1164 runs0 likes7 downloads7 reach15 impact
222 instances - 3 features - 2 classes - 0 missing values
A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on…
2 runs0 likes0 downloads0 reach13 impact
108 instances - 4 features - 0 classes - 0 missing values
A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on…
2 runs0 likes0 downloads0 reach13 impact
100 instances - 3 features - 0 classes - 0 missing values
A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on…
2 runs0 likes0 downloads0 reach13 impact
450 instances - 4 features - 0 classes - 0 missing values
ARFF version of UCI dataset 'flags'. Creators: Collected primarily from the "Collins Gem Guide to Flags": Collins Publishers (1986). Donor: Richard S. Forsyth. Date 5/15/1990 This data file contains…
108 runs0 likes10 downloads10 reach18 impact
194 instances - 29 features - 8 classes - 0 missing values
A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on…
0 runs0 likes2 downloads2 reach11 impact
100 instances - 10 features - classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1073 runs0 likes10 downloads10 reach14 impact
140 instances - 4 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1823 runs0 likes9 downloads9 reach14 impact
120 instances - 3 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
842 runs0 likes7 downloads7 reach15 impact
155 instances - 9 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
123 runs0 likes6 downloads6 reach14 impact
46 instances - 4 features - 2 classes - 0 missing values
17x17x2x2 tables of counts in GLIM-ready format used for the analyses in Biblarz, Timothy J., and Adrian E. Raftery. 1993. "The Effects of Family Disruption on Social Mobility." American Sociological…
3 runs0 likes1 downloads1 reach15 impact
1156 instances - 6 features - 0 classes - 0 missing values
------------------------------------------------------------------------------- TIME SERIES USED IN LONG-MEMORY PROCESSES, THE ALLAN VARIANCE AND WAVELETS BY D. B. PERCIVAL AND P. GUTTORP, A CHAPTER…
0 runs0 likes1 downloads1 reach13 impact
6875 instances - 1 features - 0 classes - 0 missing values
* Abstract: The data was created by a medical expert as a data set to test the expert system, which will perform the presumptive diagnosis of two diseases of the urinary system. This is a…
423 runs0 likes14 downloads14 reach13 impact
120 instances - 7 features - 2 classes - 0 missing values
####1. Summary This dataset contain attributes of dresses and their recommendations according to their sales. Sales are monitor on the basis of alternate days. The attributes present analyzed are:…
19054 runs1 likes6 downloads7 reach18 impact
500 instances - 13 features - 2 classes - 835 missing values
DBpedia with top-474 most frequent YAGO types HMC dataset for type prediction. Ingoing and outgoing properties as features
0 runs0 likes3 downloads3 reach11 impact
2886305 instances - 2401 features - classes - 0 missing values
analcatdata_happiness-pmlb
31 runs0 likes0 downloads0 reach20 impact
60 instances - 4 features - 3 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach2 impact
101 instances - 18 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach3 impact
2580 instances - 7 features - classes - 2541 missing values
test
0 runs0 likes0 downloads0 reach3 impact
2580 instances - 7 features - classes - 2541 missing values
1. Title: Echocardiogram Data 2. Source Information: -- Donor: Steven Salzberg (salzberg@cs.jhu.edu) -- Collector: -- Dr. Evlin Kinney -- The Reed Institute -- P.O. Box 402603 -- Maimi, FL 33140-0603…
0 runs0 likes0 downloads0 reach8 impact
132 instances - 8 features - 4 classes - 103 missing values
No data.
309 runs0 likes6 downloads6 reach11 impact
1000000 instances - 35 features - 6 classes - 0 missing values
No data.
291 runs0 likes4 downloads4 reach9 impact
1000000 instances - 17 features - 7 classes - 0 missing values
This dataset contains traffic violation information from all electronic traffic violations issued in the County. Any information that can be used to uniquely identify the vehicle, the vehicle owner or…
0 runs1 likes1 downloads2 reach9 impact
70340 instances - 21 features - 3 classes - 2288 missing values
sdwd dede
0 runs0 likes0 downloads0 reach7 impact
44 instances - 2 features - classes - 0 missing values
swd dced
0 runs0 likes0 downloads0 reach6 impact
589 instances - 3 features - classes - 0 missing values
sdsw frfr
0 runs0 likes0 downloads0 reach6 impact
1556 instances - 3 features - classes - 0 missing values
dd fgrfg
0 runs0 likes0 downloads0 reach7 impact
2 instances - 3 features - classes - 0 missing values
efef ffrf
0 runs0 likes0 downloads0 reach6 impact
9 instances - 3 features - classes - 0 missing values
ssc vdv
0 runs0 likes0 downloads0 reach6 impact
1556 instances - 2 features - classes - 0 missing values
swd cdef
0 runs0 likes0 downloads0 reach7 impact
3 instances - 2 features - classes - 0 missing values
sxd cde
0 runs0 likes0 downloads0 reach7 impact
3 instances - 2 features - classes - 0 missing values
wded def
0 runs0 likes0 downloads0 reach7 impact
3 instances - 2 features - classes - 0 missing values
sds dcdcc
0 runs0 likes0 downloads0 reach7 impact
3 instances - 2 features - classes - 0 missing values
Download test
0 runs0 likes0 downloads0 reach7 impact
3 instances - 2 features - classes - 0 missing values
xsxs cdf
0 runs0 likes0 downloads0 reach7 impact
6 instances - 2 features - classes - 0 missing values
wdwd cd
0 runs0 likes0 downloads0 reach7 impact
2 instances - 2 features - classes - 0 missing values
efef fdfef
0 runs0 likes0 downloads0 reach7 impact
2 instances - 2 features - classes - 0 missing values
zaxa xcdc
0 runs0 likes0 downloads0 reach7 impact
2 instances - 2 features - classes - 0 missing values
scs
0 runs0 likes0 downloads0 reach7 impact
2 instances - 2 features - classes - 0 missing values
wdede
0 runs0 likes0 downloads0 reach7 impact
2 instances - 2 features - classes - 0 missing values
swdw
0 runs0 likes0 downloads0 reach7 impact
2 instances - 2 features - classes - 0 missing values
werr
0 runs0 likes0 downloads0 reach7 impact
2 instances - 2 features - classes - 0 missing values
ssf
0 runs0 likes0 downloads0 reach7 impact
2 instances - 2 features - classes - 0 missing values