OpenML
Filter results by:
This S dump contains 22 data sets from the book Visualizing Data published by Hobart Press (books@hobart.com). The dump was created by data.dump() and can be read back into S by data.restore(). The…
2 runs0 likes1 downloads1 reach14 impact
8641 instances - 5 features - 0 classes - 0 missing values
Data Sets for 'Regression Models for Time Series Analysis' by B. Kedem and K. Fokianos, Wiley 2002. Submitted by Kostas Fokianos (fokianos@ucy.ac.cy) [8/Nov/02] (176k) Note: - attribute names were…
2 runs0 likes0 downloads0 reach13 impact
264 instances - 3 features - 0 classes - 0 missing values
This file contains data from Regression Analysis By Example, 2nd Edition, by Samprit Chatterjee and Bertram Price, John Wiley, 1991. Data sets have names of the form 'rabe.xxx' where xxx is the page…
0 runs0 likes1 downloads1 reach13 impact
40 instances - 3 features - 0 classes - 0 missing values
This file contains data from Regression Analysis By Example, 2nd Edition, by Samprit Chatterjee and Bertram Price, John Wiley, 1991. Data sets have names of the form 'rabe.xxx' where xxx is the page…
0 runs0 likes0 downloads0 reach13 impact
50 instances - 6 features - 0 classes - 0 missing values
This S dump contains 22 data sets from the book Visualizing Data published by Hobart Press (books@hobart.com). The dump was created by data.dump() and can be read back into S by data.restore(). The…
0 runs0 likes1 downloads1 reach15 impact
323 instances - 5 features - 0 classes - 0 missing values
Contains 110 data sets from the book 'The Statistical Sleuth' by Fred Ramsey and Dan Schafer; Duxbury Press, 1997. (schafer@stat.orst.edu) [14/Oct/97] (172k) Note: description taken from this web…
0 runs0 likes0 downloads0 reach13 impact
62 instances - 6 features - 0 classes - 0 missing values
DATA-SETS FROM DIGGLE, P.J. (1990). TIME SERIES : A BIOSTATISTICAL INTRODUCTION. Oxford University Press. Table: Table A1 Lutenizing hormone Information about the dataset CLASSTYPE: numeric…
0 runs0 likes0 downloads0 reach13 impact
48 instances - 5 features - 0 classes - 0 missing values
This submission consists of 38 files, plus this README file. Each file represents a data set analyzed in the book. The names of the files correspond to the names used in the book. The data files are…
0 runs0 likes0 downloads0 reach13 impact
400 instances - 8 features - 0 classes - 0 missing values
1. Title: Employee Rejection\Acceptance (Original ERA) 2. Source Information: Donor: Arie Ben David MIS, Dept. of Technology Management Holon Academic Inst. of Technology 52 Golomb St. Holon 58102…
5 runs0 likes1 downloads1 reach14 impact
1000 instances - 5 features - 0 classes - 0 missing values
1. Title: Social Workers Decisions (Ordinal SWD) 2. Source Information: Donor: Arie Ben David MIS, Dept. of Technology Management Holon Academic Inst. of Technology 52 Golomb St. Holon 58102 Israel…
0 runs0 likes1 downloads1 reach14 impact
1000 instances - 11 features - 0 classes - 0 missing values
1. Title: Employee Selection (Ordinal ESL) 2. Source Informaion: Donor: Arie Ben David MIS, Dept. of Technology Management Holon Academic Inst. of Technology 52 Golomb St. Holon 58102 Israel…
0 runs0 likes0 downloads0 reach13 impact
488 instances - 5 features - 0 classes - 0 missing values
Background: ========== In this paper we develop an approach to data disclosure in survey settings by adopting a probabilistic definition of disclosure due to Dalenius. Our approach is based on the…
0 runs0 likes0 downloads0 reach14 impact
662 instances - 4 features - 0 classes - 0 missing values
This software can be freely used for non-commercial purposes and can be freely distributed. Readme file =========== The data sets in this directory are taken from the above book. The data are…
2 runs0 likes0 downloads0 reach13 impact
70 instances - 8 features - 0 classes - 0 missing values
This submission consists of 38 files, plus this README file. Each file represents a data set analyzed in the book. The names of the files correspond to the names used in the book. The data files are…
22 runs0 likes2 downloads2 reach14 impact
400 instances - 8 features - 0 classes - 0 missing values
This submission consists of 38 files, plus this README file. Each file represents a data set analyzed in the book. The names of the files correspond to the names used in the book. The data files are…
0 runs0 likes0 downloads0 reach13 impact
27 instances - 11 features - 0 classes - 0 missing values
This submission consists of 38 files, plus this README file. Each file represents a data set analyzed in the book. The names of the files correspond to the names used in the book. The data files are…
0 runs0 likes0 downloads0 reach13 impact
400 instances - 8 features - 0 classes - 0 missing values
This submission consists of 38 files, plus this README file. Each file represents a data set analyzed in the book. The names of the files correspond to the names used in the book. The data files are…
0 runs0 likes0 downloads0 reach13 impact
222 instances - 3 features - 0 classes - 0 missing values
This submission consists of 38 files, plus this README file. Each file represents a data set analyzed in the book. The names of the files correspond to the names used in the book. The data files are…
0 runs0 likes0 downloads0 reach13 impact
400 instances - 8 features - 0 classes - 0 missing values
This S dump contains 22 data sets from the book Visualizing Data published by Hobart Press (books@hobart.com). The dump was created by data.dump() and can be read back into S by data.restore(). The…
0 runs0 likes1 downloads1 reach13 impact
111 instances - 4 features - 0 classes - 0 missing values
This S dump contains 22 data sets from the book Visualizing Data published by Hobart Press (books@hobart.com). The dump was created by data.dump() and can be read back into S by data.restore(). The…
0 runs0 likes2 downloads2 reach13 impact
44 instances - 4 features - 0 classes - 0 missing values
1. Title: Assessing the Reliability of a Human Estimator 2. Sources (a) Creator: Gary D. Boetticher (b) Date: February 20, 2007 (c) Contact: boetticher AT uhcl DOT edu Phone: +1 (281) 283 8305 3.…
0 runs0 likes0 downloads0 reach13 impact
75 instances - 15 features - 0 classes - 0 missing values
1. Title: Class-level data for KC1 This one includes a numeric attribute (NUMDEFECTS) to indicate defectiveness. 2. Sources (a) Creator: A. Gunes Koru (b) Date: February 21, 2005 (c) Contact: gkoru AT…
0 runs0 likes1 downloads1 reach13 impact
145 instances - 95 features - 0 classes - 0 missing values
1. Title/Topic: COCOMO NASA 2 / Software cost estimation 2. Sources: -- 93 NASA projects from different centers for projects from the following years: n year --- ---- 1 1971 1 1974 2 1975 2 1976 10…
2 runs0 likes2 downloads2 reach13 impact
93 instances - 24 features - 0 classes - 0 missing values
#### Project Data Incorporating Qualitative Factors for Improved Software Defect Qualitative and quantitative data about 31 projects completed in a consumer electronics company (one row per project).…
0 runs0 likes0 downloads0 reach13 impact
31 instances - 31 features - 0 classes - 33 missing values
Ask a home buyer to describe their dream house, and they probably won't begin with the height of the basement ceiling or the proximity to an east-west railroad. But this playground competition's…
0 runs0 likes4 downloads4 reach8 impact
1460 instances - 81 features - 0 classes - 6965 missing values
Ask a home buyer to describe their dream house, and they probably won't begin with the height of the basement ceiling or the proximity to an east-west railroad. But this playground competition's…
0 runs0 likes2 downloads2 reach8 impact
1460 instances - 80 features - 0 classes - 6965 missing values
D. Harrington\ D. Harrington, (1991), published by John Wiley & Sons NAME: PBC Data SIZE: 418 observations, 20 variables DESCRIPTIVE ABSTRACT: Below is a description of the variables recorded from the…
10 runs0 likes1 downloads1 reach12 impact
418 instances - 19 features - 0 classes - 1239 missing values
Wisconsin Prognostic Breast Cancer (WPBC) Various versions of this data have been used in the following publications: - W. N. Street, O. L. Mangasarian, and W.H. Wolberg. An inductive learning…
5 runs0 likes4 downloads4 reach9 impact
194 instances - 33 features - 0 classes - 0 missing values
University of Sao Paulo, School of Art, Sciences and Humanities, Sao Paulo, SP, Brazil ### LIBRAS Movement Database LIBRAS, acronym of the Portuguese name "LIngua BRAsileira de Sinais", is the…
0 runs0 likes4 downloads4 reach19 impact
360 instances - 91 features - 0 classes - 0 missing values
1. Title: Ozone Level Detection 2. Source: Kun Zhang zhang.kun05 '@' gmail.com Department of Computer Science, Xavier University of Lousiana Wei Fan wei.fan '@' gmail.com IBM T.J.Watson Research…
0 runs0 likes1 downloads1 reach13 impact
2536 instances - 73 features - 0 classes - 0 missing values
This data set addresses a control problem, namely flying a F16 aircraft. The attributes describe the status of the aeroplane, while the goal is to predict the control action on the ailerons of the…
0 runs0 likes6 downloads6 reach14 impact
13750 instances - 41 features - 0 classes - 0 missing values
1. Hungarian Institute of Cardiology. Budapest: Andras Janosi, M.D. 2. University Hospital, Zurich, Switzerland: William Steinbrunn, M.D. 3. University Hospital, Basel, Switzerland: Matthias…
10 runs0 likes0 downloads0 reach9 impact
294 instances - 14 features - 0 classes - 782 missing values
The Computer Activity databases are a collection of computer systems activity measures. The data was collected from a Sun Sparcstation 20/712 with 128 Mbytes of memory running in a multi-user…
5 runs1 likes2 downloads3 reach9 impact
8192 instances - 13 features - 0 classes - 0 missing values
sjoear. Laengelmaevesi. T.H.Jaervi: Finlands Fiskeriet Band 4, Meddelanden utgivna av fiskerifoereningen i Finland. Helsingfors 1917 Weight treated as the class attribute. Identifier deleted. As used…
10 runs0 likes2 downloads2 reach12 impact
158 instances - 8 features - 0 classes - 87 missing values
The problem concerns Relative CPU Performance Data. The used attributes are : ``` MYCT: machine cycle time in nanoseconds (integer) MMIN: minimum main memory in kilobytes (integer) MMAX: maximum main…
2 runs0 likes2 downloads2 reach12 impact
209 instances - 7 features - 0 classes - 0 missing values
This is a family of datasets synthetically generated from a realistic simulation of the dynamics of a Unimation Puma 560 robot arm. There are eight datastets in this family. In this repository we only…
2 runs0 likes5 downloads5 reach10 impact
8192 instances - 9 features - 0 classes - 0 missing values
This database contains 76 attributes, but all published experiments refer to using a subset of 14 of them. In particular, the Cleveland database is the only one that has been used by ML researchers to…
37 runs0 likes5 downloads5 reach9 impact
303 instances - 14 features - 0 classes - 6 missing values
This data set consists of three types of entities: (a) the specification of an auto in terms of various characteristics; (b) its assigned insurance risk rating,; (c) its normalized losses in use as…
11 runs1 likes4 downloads5 reach10 impact
159 instances - 16 features - 0 classes - 0 missing values
Auto-Mpg Data This dataset was taken from the StatLib library which is maintained at Carnegie Mellon University. The dataset was used in the 1983 American Statistical Association Exposition. -…
2 runs0 likes2 downloads2 reach12 impact
398 instances - 8 features - 0 classes - 6 missing values
Data from StatLib A manufacturer of automotive accessories provides hardware, e.g. nuts, bolts, washers and screws, to fasten the accessory to the car or truck. Hardware is counted and packaged…
4 runs0 likes0 downloads0 reach9 impact
40 instances - 7 features - 0 classes - 0 missing values
%%%%%%%%%%%%%%%%%%% Data-Description % %%%%%%%%%%%%%%%%%%% COIL 1999 Competition Data Data Type multivariate Abstract This data set is from the 1999 Computational Intelligence and Learning (COIL)…
0 runs0 likes0 downloads0 reach13 impact
316 instances - 12 features - 0 classes - 56 missing values
%%%%%%%%%%%%%%%%%%% Data-Description % %%%%%%%%%%%%%%%%%%% COIL 1999 Competition Data Data Type multivariate Abstract This data set is from the 1999 Computational Intelligence and Learning (COIL)…
0 runs0 likes0 downloads0 reach13 impact
316 instances - 12 features - 0 classes - 56 missing values
%%%%%%%%%%%%%%%%%%% Data-Description % %%%%%%%%%%%%%%%%%%% COIL 1999 Competition Data Data Type multivariate Abstract This data set is from the 1999 Computational Intelligence and Learning (COIL)…
0 runs0 likes0 downloads0 reach14 impact
316 instances - 12 features - 0 classes - 56 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Attributes 2,4, and 6 deleted. Midrange price treated as the class attribute. As used by Kilpatrick, D. & Cameron-Jones, M.…
0 runs0 likes0 downloads0 reach18 impact
93 instances - 23 features - 0 classes - 14 missing values
1. Title: meta-data 2. Sources: (a) Creator: LIACC - University of Porto R.Campo Alegre 823 4150 PORTO (b) Donor: P.B.Brazdil or J.Gama Tel.: +351 600 1672 LIACC, University of Porto Fax.: +351 600…
32 runs0 likes2 downloads2 reach21 impact
528 instances - 22 features - 0 classes - 504 missing values
The Computer Activity databases are a collection of computer systems activity measures. The data was collected from a Sun Sparcstation 20/712 with 128 Mbytes of memory running in a multi-user…
0 runs0 likes6 downloads6 reach14 impact
8192 instances - 22 features - 0 classes - 0 missing values
### Bank FM Dataset A family of datasets synthetically generated from a simulation of how bank-customers choose their banks. Tasks are based on predicting the fraction of bank customers who leave the…
0 runs0 likes6 downloads6 reach14 impact
8192 instances - 9 features - 0 classes - 0 missing values
Short Summary: Lists estimates of the percentage of body fat determined by underwater weighing and various body circumference measurements for 252 men. Classroom use of this data set: This data set…
25 runs0 likes6 downloads6 reach19 impact
252 instances - 15 features - 0 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Attributes 2 and 8 deleted. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based…
2 runs0 likes2 downloads2 reach19 impact
209 instances - 8 features - 0 classes - 0 missing values
The Computer Activity databases are a collection of computer systems activity measures. The data was collected from a Sun Sparcstation 20/712 with 128 Mbytes of memory running in a multi-user…
12 runs0 likes4 downloads4 reach15 impact
8192 instances - 13 features - 0 classes - 0 missing values
This is an artificial data set used in Friedman (1991) and also described in Breiman (1996,p.139). The cases are generated using the following method: Generate the values of 10 attributes, X1, ...,…
0 runs2 likes7 downloads9 reach14 impact
40768 instances - 11 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10984, and it has 20 rows and 1026 features (including…
1 runs0 likes2 downloads2 reach11 impact
20 instances - 1026 features - 0 classes - 0 missing values
%%%%%%%%%%%%%%%%%%% Data-Description % %%%%%%%%%%%%%%%%%%% COIL 1999 Competition Data Data Type multivariate Abstract This data set is from the 1999 Computational Intelligence and Learning (COIL)…
0 runs0 likes0 downloads0 reach13 impact
316 instances - 12 features - 0 classes - 56 missing values
%%%%%%%%%%%%%%%%%%% Data-Description % %%%%%%%%%%%%%%%%%%% COIL 1999 Competition Data Data Type multivariate Abstract This data set is from the 1999 Computational Intelligence and Learning (COIL)…
12 runs0 likes0 downloads0 reach13 impact
316 instances - 12 features - 0 classes - 56 missing values
%%%%%%%%%%%%%%%%%%% Data-Description % %%%%%%%%%%%%%%%%%%% COIL 1999 Competition Data Data Type multivariate Abstract This data set is from the 1999 Computational Intelligence and Learning (COIL)…
0 runs0 likes0 downloads0 reach13 impact
316 instances - 12 features - 0 classes - 56 missing values
%%%%%%%%%%%%%%%%%%% Data-Description % %%%%%%%%%%%%%%%%%%% COIL 1999 Competition Data Data Type multivariate Abstract This data set is from the 1999 Computational Intelligence and Learning (COIL)…
0 runs0 likes0 downloads0 reach13 impact
316 instances - 12 features - 0 classes - 56 missing values
### Data Set Information: This database was designed on the basis of data provided by US Census Bureau [http://www.census.gov] (under Lookup Access [http://www.census.gov/cdrom/lookup]: Summary Tape…
0 runs1 likes7 downloads8 reach15 impact
22784 instances - 17 features - 0 classes - 0 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes2 downloads2 reach13 impact
250 instances - 6 features - 0 classes - 0 missing values
Contains 110 data sets from the book 'The Statistical Sleuth' by Fred Ramsey and Dan Schafer; Duxbury Press, 1997. (schafer@stat.orst.edu) [14/Oct/97] (172k) Description taken from this web site:…
0 runs0 likes0 downloads0 reach13 impact
47 instances - 8 features - 0 classes - 0 missing values
This file contains data from Regression Analysis By Example, 2nd Edition, by Samprit Chatterjee and Bertram Price, John Wiley, 1991. Data sets have names of the form 'rabe.xxx' where xxx is the page…
0 runs0 likes0 downloads0 reach13 impact
51 instances - 7 features - 0 classes - 0 missing values
Contains 110 data sets from the book 'The Statistical Sleuth' by Fred Ramsey and Dan Schafer; Duxbury Press, 1997. (schafer@stat.orst.edu) [14/Oct/97] (172k) description taken from this web site:…
2 runs0 likes0 downloads0 reach13 impact
147 instances - 7 features - 0 classes - 0 missing values
A shar archive of data from the book Data Analysis: An Introduction(1992) Prentice Hall by Jeff Witmer. Submitted by Jeff Witmer (fwitmer@ocvaxa.cc.oberlin.edu) 28/Jun/94] (29 kbytes) Description…
2 runs0 likes0 downloads0 reach13 impact
50 instances - 5 features - 0 classes - 0 missing values
Source: Ashwin Srinivasan Department of Statistics and Data Modeling University of Strathclyde Glasgow Scotland UK ross '@' uk.ac.turing The original Landsat data for this database was generated from…
1 runs1 likes7 downloads8 reach19 impact
6435 instances - 37 features - 0 classes - 0 missing values
Modeling wine preferences by data mining from physicochemical properties. In Decision Support Systems, Elsevier, 47(4):547-553, 2009. 1. Title: Wine Quality 2. Sources Created by: Paulo Cortez (Univ.…
0 runs1 likes13 downloads14 reach15 impact
6497 instances - 12 features - 0 classes - 0 missing values
1. U. S. Department of Commerce, Bureau of the Census, Census Of Population And Housing 1990 United States: Summary Tape File 1a & 3a (Computer Files), 2. U.S. Department Of Commerce, Bureau Of The…
0 runs1 likes3 downloads4 reach13 impact
1994 instances - 128 features - 0 classes - 39202 missing values
University Hospital, Basel, Switzerland: Matthias Pfisterer, M.D. V.A. Medical Center, Long Beach and Cleveland Clinic Foundation: Robert Detrano, M.D., Ph.D. Heart Disease Databases Cholesterol…
160 runs0 likes4 downloads4 reach9 impact
303 instances - 14 features - 0 classes - 6 missing values
Fruitflies" by Linda Partridge and Marion Farquhar. _Nature_, 294, 580-581, 1981 NAME: Sexual activity and the lifespan of male fruitflies\ TYPE: Designed (almost factorial) experiment\ SIZE: 125…
4 runs0 likes2 downloads2 reach12 impact
125 instances - 5 features - 0 classes - 0 missing values
The Computer Activity databases are a collection of computer systems activity measures. The data was collected from a Sun Sparcstation 20/712 with 128 Mbytes of memory running in a multi-user…
2 runs1 likes1 downloads2 reach9 impact
8192 instances - 22 features - 0 classes - 0 missing values
This data set is also obtained from the task of controlling the ailerons of a F16 aircraft, although the target variable and attributes are different from the ailerons domain. The target variable here…
2 runs0 likes3 downloads3 reach10 impact
9517 instances - 7 features - 0 classes - 0 missing values
Small dataset with time series of RAM prices over the years.
0 runs1 likes5 downloads6 reach11 impact
333 instances - 3 features - 0 classes - 0 missing values
As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based learning with encoding length selection. In Progress in Connectionist-Based Information Systems.…
10 runs1 likes2 downloads3 reach12 impact
195 instances - 11 features - 0 classes - 2 missing values
1) 1985 Model Import Car and Truck Specifications, 1985 Ward's Automotive Yearbook. 2) Personal Auto Manuals, Insurance Services Office, 160 Water Street, New York, NY 10038 3) Insurance Collision…
2 runs0 likes2 downloads2 reach12 impact
159 instances - 16 features - 0 classes - 0 missing values
Survival treated as the class attribute As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based learning with encoding length selection. In Progress in…
12 runs0 likes2 downloads2 reach12 impact
130 instances - 10 features - 0 classes - 97 missing values
Tumor-size treated as the class attribute. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based learning with encoding length selection. In Progress in…
0 runs0 likes3 downloads3 reach12 impact
286 instances - 10 features - 0 classes - 9 missing values
Detroit: The Role of Firearms", Criminology, vol.14, 387-400 (1976) This is the data set called 'DETROIT' in the book 'Subset selection in regression' by Alan J. Miller published in the Chapman & Hall…
2 runs0 likes0 downloads0 reach10 impact
13 instances - 14 features - 0 classes - 0 missing values
These data are those collected in a cloud-seeding experiment in Tasmania between mid-1964 and January 1971. Their analysis, using regression techniques and permutation tests, is discussed in: Miller,…
66 runs0 likes2 downloads2 reach9 impact
108 instances - 6 features - 0 classes - 0 missing values
(with variance 1 instead of 2). This is an artificial data set described in Breiman et al. (1984,p.238) (with variance 1 instead of 2). Generate the values of the 10 attributes independently using the…
2 runs1 likes4 downloads5 reach11 impact
40768 instances - 11 features - 0 classes - 0 missing values
This data set concerns the study of the factors affecting patterns of insulin-dependent diabetes mellitus in children. The objective is to investigate the dependence of the level of serum C-peptide on…
2 runs0 likes1 downloads1 reach9 impact
43 instances - 3 features - 0 classes - 0 missing values
Background: ========== In this paper we develop an approach to data disclosure in survey settings by adopting a probabilistic definition of disclosure due to Dalenius. Our approach is based on the…
0 runs0 likes0 downloads0 reach14 impact
662 instances - 4 features - 0 classes - 0 missing values
Background: ========== In this paper we develop an approach to data disclosure in survey settings by adopting a probabilistic definition of disclosure due to Dalenius. Our approach is based on the…
0 runs0 likes1 downloads1 reach14 impact
662 instances - 4 features - 0 classes - 0 missing values
Background: ========== In this paper we develop an approach to data disclosure in survey settings by adopting a probabilistic definition of disclosure due to Dalenius. Our approach is based on the…
0 runs0 likes0 downloads0 reach14 impact
662 instances - 4 features - 0 classes - 0 missing values
Contains 110 data sets from the book 'The Statistical Sleuth' by Fred Ramsey and Dan Schafer; Duxbury Press, 1997. (schafer@stat.orst.edu) [14/Oct/97] (172k) Note: description taken from this web…
0 runs0 likes0 downloads0 reach13 impact
42 instances - 10 features - 0 classes - 0 missing values
chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S. Simonoff, John Wiley and…
14 runs0 likes0 downloads0 reach13 impact
526 instances - 6 features - 0 classes - 0 missing values
Contains 110 data sets from the book 'The Statistical Sleuth' by Fred Ramsey and Dan Schafer; Duxbury Press, 1997. (schafer@stat.orst.edu) [14/Oct/97] (172k) Note: description taken from this web…
8 runs0 likes2 downloads2 reach13 impact
50 instances - 7 features - 0 classes - 0 missing values
1. Title/Topic: cocomonasa/software cost estimation 2. Sources: -- Creators: 60 NASA projects from different centers for projects from the 1980s and 1990s. Collected by Jairus Hihn, JPL, NASA, Manager…
2 runs0 likes2 downloads2 reach13 impact
60 instances - 17 features - 0 classes - 0 missing values
The happiness scores and rankings use data from the Gallup World Poll. The scores are based on answers to the main life evaluation question asked in the poll. This question, known as the Cantril…
2 runs0 likes3 downloads3 reach12 impact
158 instances - 12 features - 0 classes - 0 missing values
Data on fluctuating proportions of marked cells in marrow from heterozygous Safari cats from a study of early hematopoiesis. The data included below are 11 time series of proportions of marked…
2 runs0 likes2 downloads2 reach13 impact
140 instances - 4 features - 0 classes - 0 missing values
One of two multivariate regression data sets from paper industry, from an experiment at the paper plant Saugbruksforeningen, Norway. They have been described and analysed in: Aldrin, M. (1996),…
0 runs0 likes0 downloads0 reach13 impact
30 instances - 41 features - 0 classes - 0 missing values
Dataset listing all-time NFL passers through 1994 by the NFL passing efficiency rating. Associated passing statistics from which this rating is computed are included. The dataset lists statistics for…
0 runs0 likes0 downloads0 reach13 impact
26 instances - 6 features - 0 classes - 0 missing values
Primary Biliary Cirrhosis This data set is a follow-up to the original PBC data set, as discussed in appendix D of Fleming and Harrington, Counting Processes and Survival Analysis, Wiley, 1991. An…
0 runs0 likes5 downloads5 reach13 impact
1945 instances - 19 features - 0 classes - 1133 missing values
This dataset is taken from the Places Rated Almanac, by Richard Boyer and David Savageau, copyrighted and published by Rand McNally. This book order (SBN) number is 0-528-88008-X. The nine rating…
2 runs0 likes8 downloads8 reach13 impact
329 instances - 9 features - 0 classes - 0 missing values
This dataset contains 3 more features compared to version 1 of the same dataset. Data from which conclusions were drawn in the article "Sleep in Mammals: Ecological and Constitutional Correlates" by…
0 runs0 likes0 downloads0 reach13 impact
62 instances - 10 features - 0 classes - 38 missing values
This analysis describes and summarizes the relationships between 1987 salaries of major league baseball players and the player's performance. The salary data were taken from Sports Illustrated, April…
0 runs1 likes1 downloads2 reach13 impact
26 instances - 8 features - 0 classes - 0 missing values
daily average wind speeds for 1961-1978 at 12 synoptic meteorological stations in the Republic of Ireland (Haslett and raftery 1989). These data were analyzed in detail in the following article:…
0 runs0 likes6 downloads6 reach14 impact
6574 instances - 15 features - 0 classes - 0 missing values
Veteran's Administration Lung Cancer Trial Taken from Kalbfleisch and Prentice, pages 223-224 ``` Variables Treatment 1=standard, 2=test Celltype 1=squamous, 2=smallcell, 3=adeno, 4=large Survival in…
2 runs0 likes1 downloads1 reach13 impact
137 instances - 8 features - 0 classes - 0 missing values
The data consist of 2001 observations taken from a balloon about 30 kilometres above the surface of the earth. In the section of the flight shown here the balloon increases in height. As radiation…
0 runs1 likes2 downloads3 reach14 impact
2001 instances - 2 features - 0 classes - 0 missing values
Geographical Analysis Spatial Data This georeferenced data set was used in: Pace, R. Kelley, and Ronald Barry, Quick Computation of Regressions with a Spatially Autoregressive Dependent Variable,…
4 runs1 likes2 downloads3 reach15 impact
3107 instances - 7 features - 0 classes - 0 missing values
A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on…
0 runs0 likes1 downloads1 reach13 impact
365 instances - 4 features - 0 classes - 30 missing values
A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on…
2 runs0 likes0 downloads0 reach13 impact
74 instances - 9 features - 0 classes - 0 missing values
A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on…
2 runs0 likes0 downloads0 reach13 impact
120 instances - 20 features - 0 classes - 0 missing values