Data
Filter results by:
Detroit: The Role of Firearms", Criminology, vol.14, 387-400 (1976) This is the data set called 'DETROIT' in the book 'Subset selection in regression' by Alan J. Miller published in the Chapman & Hall…
2 runs0 likes0 downloads0 reach10 impact
13 instances - 14 features - 0 classes - 0 missing values
# The Oxford-IIIT Pet Dataset Number of classes = 37, we have used LabelEncoder on top of classes. The mapping for the classes are: 'abyssinian': 0, 'american_bulldog': 1, 'american_pit_bull_terrier':…
0 runs0 likes0 downloads0 reach0 impact
7349 instances - 49153 features - 37 classes - 0 missing values
Background: ========== In this paper we develop an approach to data disclosure in survey settings by adopting a probabilistic definition of disclosure due to Dalenius. Our approach is based on the…
0 runs0 likes0 downloads0 reach14 impact
662 instances - 4 features - 0 classes - 0 missing values
Background: ========== In this paper we develop an approach to data disclosure in survey settings by adopting a probabilistic definition of disclosure due to Dalenius. Our approach is based on the…
0 runs0 likes0 downloads0 reach14 impact
662 instances - 4 features - 0 classes - 0 missing values
Contains 110 data sets from the book 'The Statistical Sleuth' by Fred Ramsey and Dan Schafer; Duxbury Press, 1997. (schafer@stat.orst.edu) [14/Oct/97] (172k) Note: description taken from this web…
0 runs0 likes0 downloads0 reach13 impact
42 instances - 10 features - 0 classes - 0 missing values
chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S. Simonoff, John Wiley and…
14 runs0 likes0 downloads0 reach13 impact
526 instances - 6 features - 0 classes - 0 missing values
chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S. Simonoff, John Wiley and…
0 runs0 likes0 downloads0 reach11 impact
50 instances - 3 features - classes - 0 missing values
One of two multivariate regression data sets from paper industry, from an experiment at the paper plant Saugbruksforeningen, Norway. They have been described and analysed in: Aldrin, M. (1996),…
0 runs0 likes0 downloads0 reach13 impact
30 instances - 41 features - 0 classes - 0 missing values
These data are estimated correlations between daily 3 p.m. wind measurements during September and October 1997 for a network of 45 stations in the Sydney region. The first column below gives a list of…
0 runs0 likes0 downloads0 reach11 impact
45 instances - 47 features - classes - 0 missing values
Dataset listing all-time NFL passers through 1994 by the NFL passing efficiency rating. Associated passing statistics from which this rating is computed are included. The dataset lists statistics for…
0 runs0 likes0 downloads0 reach13 impact
26 instances - 6 features - 0 classes - 0 missing values
This dataset contains 3 more features compared to version 1 of the same dataset. Data from which conclusions were drawn in the article "Sleep in Mammals: Ecological and Constitutional Correlates" by…
0 runs0 likes0 downloads0 reach13 impact
62 instances - 10 features - 0 classes - 38 missing values
A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on…
2 runs0 likes0 downloads0 reach13 impact
74 instances - 9 features - 0 classes - 0 missing values
A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on…
2 runs0 likes0 downloads0 reach13 impact
120 instances - 20 features - 0 classes - 0 missing values
A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on…
0 runs0 likes0 downloads0 reach13 impact
60 instances - 11 features - 0 classes - 14 missing values
A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on…
2 runs0 likes0 downloads0 reach13 impact
163 instances - 6 features - 0 classes - 0 missing values
A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on…
18 runs0 likes0 downloads0 reach14 impact
159 instances - 10 features - 0 classes - 6 missing values
A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on…
0 runs0 likes0 downloads0 reach14 impact
4052 instances - 8 features - 0 classes - 0 missing values
Following are data on the shooting of Vinnie Johnson of the Detroit Pistons during the 1985-1986 through 1988-1989 seasons. Source was the New York Times. The data are analyzed in the Carnegie Mellon…
0 runs0 likes0 downloads0 reach13 impact
380 instances - 3 features - 0 classes - 0 missing values
A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on…
2 runs0 likes0 downloads0 reach14 impact
66 instances - 12 features - 0 classes - 0 missing values
A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on…
0 runs0 likes0 downloads0 reach13 impact
30 instances - 7 features - 0 classes - 6 missing values
A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on…
2 runs0 likes0 downloads0 reach13 impact
108 instances - 4 features - 0 classes - 0 missing values
A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on…
2 runs0 likes0 downloads0 reach13 impact
67 instances - 16 features - 0 classes - 0 missing values
A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on…
2 runs0 likes0 downloads0 reach13 impact
132 instances - 4 features - 0 classes - 0 missing values
A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on…
2 runs0 likes0 downloads0 reach13 impact
100 instances - 3 features - 0 classes - 0 missing values
A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on…
2 runs0 likes0 downloads0 reach13 impact
450 instances - 4 features - 0 classes - 0 missing values
A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on…
2 runs0 likes0 downloads0 reach13 impact
100 instances - 4 features - 0 classes - 0 missing values
A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on…
2 runs0 likes0 downloads0 reach14 impact
649 instances - 3 features - 0 classes - 0 missing values
A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on…
0 runs0 likes0 downloads0 reach13 impact
48 instances - 5 features - 0 classes - 0 missing values
COVI-19, Economic and population data for all Nigerian States
0 runs0 likes0 downloads0 reach0 impact
37 instances - 19 features - classes - 0 missing values
Human Development Index [DATA] United Nations Development Program compiled an Index of Human Development. Column 1: Country(character) 2: Index 3: GNP To measure the quality of life in a nation, the…
2 runs0 likes0 downloads0 reach13 impact
130 instances - 2 features - 0 classes - 0 missing values
This file contains the data in "The MU284 Population" from Appendix B of the book "Model Assisted Survey Sampling" by Sarndal, Swensson and Wretman, published by Springer-Verlag, New York, 1992. The…
0 runs0 likes0 downloads0 reach13 impact
284 instances - 10 features - 0 classes - 0 missing values
A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on…
2 runs0 likes0 downloads0 reach13 impact
475 instances - 4 features - 0 classes - 0 missing values
A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on…
2 runs0 likes0 downloads0 reach13 impact
475 instances - 4 features - 0 classes - 0 missing values
Data on patient deaths within 30 days of surgery in 131 U.S. hospitals. See Christiansen and Morris, Bayesian Biostatistics, D. Berry and D. Stangl, editors, 1996, Marcel Dekker, Inc. Data on 131…
0 runs0 likes0 downloads0 reach13 impact
131 instances - 3 features - 0 classes - 0 missing values
Data on the homicide rate in Detroit for the years 1961-1973. This is the data set called DETROIT in the book 'Subset selection in regression' by Alan J. Miller published in the Chapman & Hall series…
0 runs0 likes0 downloads0 reach13 impact
13 instances - 14 features - 0 classes - 0 missing values
speeddating
0 runs0 likes0 downloads0 reach0 impact
8378 instances - 123 features - classes - 18372 missing values
This dataset originates from the bioinformatics fields considering two types of genetic data, namely phylogenetic profiles and microarray expression data for the yeast genome.
0 runs0 likes0 downloads0 reach0 impact
2465 instances - 31 features - classes - 0 missing values
This dataset originates from the bioinformatics fields considering two types of genetic data, namely phylogenetic profiles and microarray expression data for the yeast genome.
0 runs0 likes0 downloads0 reach0 impact
2465 instances - 28 features - classes - 0 missing values
This dataset originates from the bioinformatics fields considering two types of genetic data, namely phylogenetic profiles and microarray expression data for the yeast genome.
0 runs0 likes0 downloads0 reach0 impact
2465 instances - 28 features - classes - 0 missing values
This dataset measures freshly excised breast tissues, which, plotted in the plane, constitute the impedance spectrum from where the breast tissue features are computed.
0 runs0 likes0 downloads0 reach0 impact
106 instances - 15 features - classes - 0 missing values
This dataset contains estimates of the percentage of body fat determined by underwater weighting and body circumference measurements for men.
0 runs0 likes0 downloads0 reach0 impact
252 instances - 14 features - classes - 0 missing values
This dataset measure computer systems activity by means of (restricted) attributes and the objective is to predict when the CPU is free in a certain portion of time.
0 runs0 likes0 downloads0 reach0 impact
8192 instances - 11 features - classes - 0 missing values
This dataset was derived from the 1990 U.S. census, using one row per census block group (a block group is the smallest geographical unit for which the U.S. Census Bureau publishes sample data).
0 runs0 likes0 downloads0 reach0 impact
20640 instances - 8 features - classes - 0 missing values
Author: Sven Peeters, Vitalik Melnikov, Eyke Hüllermeier Source: userbenchmark.com, fpsbenchmark.com, techpowerup.com June 2020 Please cite: Peeters, Sven, Vitalik Melnikov, and Eyke…
0 runs0 likes0 downloads0 reach1 impact
425833 instances - 45 features - 0 classes - 1299988 missing values
This dataset belongs to a collection of datasets used to analyze categorical data.
0 runs0 likes0 downloads0 reach0 impact
841 instances - 74 features - classes - 0 missing values
This dataset contains features computed from a digitized image of a fine needle aspirate of a breast mass. They describe characteristics of the cell nuclei present in the image.
0 runs0 likes0 downloads0 reach0 impact
194 instances - 32 features - classes - 0 missing values
This dataset is the result of a chemical analysis of wines grown in the same region in Italy but derived from three different cultivars. The analysis determined the quantities of constituents found in…
0 runs0 likes0 downloads0 reach0 impact
178 instances - 16 features - classes - 0 missing values
This dataset consists of predicting the cellular localization sites of proteins.
0 runs0 likes0 downloads0 reach0 impact
1484 instances - 18 features - classes - 0 missing values
This is perhaps the best known dataset to be found in the pattern recognition literature. Fisher's paper is a classic in the field and is referenced frequently to this day.
0 runs0 likes0 downloads0 reach0 impact
150 instances - 7 features - classes - 0 missing values
This dataset belongs to a collection of datasets used to analyze categorical data.
0 runs0 likes0 downloads0 reach0 impact
841 instances - 74 features - classes - 0 missing values
This dataset was obtained by a segmentation process of all the blocks of the page layout of a document.
0 runs0 likes0 downloads0 reach0 impact
5472 instances - 15 features - classes - 0 missing values
This dataset consists of classyfing references to a hand movement type according to a mapping operation representing the coordinates of movement.
0 runs0 likes0 downloads0 reach0 impact
360 instances - 105 features - classes - 0 missing values
This dataset contains a large number of black-and-white rectangular pixel displays as one of the capital letters in the English alphabet.
0 runs0 likes0 downloads0 reach0 impact
20000 instances - 42 features - classes - 0 missing values
This dataset consists of the multi-spectral values of pixels in 3x3 neighborhoods in a satellite image, and the classification associated with the central pixel in each neighborhood.
0 runs0 likes0 downloads0 reach0 impact
6435 instances - 42 features - classes - 0 missing values
This dataset contains image data described by high-level attributes of outdoor images (hand-segmented to create a classification for every pixel).
0 runs0 likes0 downloads0 reach0 impact
2310 instances - 25 features - classes - 0 missing values
This dataset contains image data described by high-level attributes of outdoor images (hand-segmented to create a classification for every pixel).
0 runs0 likes0 downloads0 reach0 impact
2310 instances - 25 features - classes - 0 missing values
This dataset contains samples arising from handwritten digits characterized by pen trajectories (successive pen points on a coordinate system).
0 runs0 likes0 downloads0 reach0 impact
10992 instances - 26 features - classes - 0 missing values
This dataset contains samples arising from handwritten digits characterized by pen trajectories (successive pen points on a coordinate system).
0 runs0 likes0 downloads0 reach0 impact
10992 instances - 26 features - classes - 0 missing values
This dataset originates from the bioinformatics fields considering two types of genetic data, namely phylogenetic profiles and microarray expression data for the yeast genome.
0 runs0 likes0 downloads0 reach0 impact
2465 instances - 30 features - classes - 0 missing values
This dataset is obtained from the task of controlling a F16 aircraft, and the objective is related to an action taken on the elevators of the aircraft according to the status attributes of the…
0 runs0 likes0 downloads0 reach0 impact
16599 instances - 18 features - classes - 0 missing values
This dataset was motivated by criminological investigation to study the classification of types of glass according to their chemical properties.
0 runs0 likes0 downloads0 reach0 impact
214 instances - 15 features - classes - 0 missing values
This dataset was motivated by criminological investigation to study the classification of types of glass according to their chemical properties.
0 runs0 likes0 downloads0 reach0 impact
214 instances - 15 features - classes - 0 missing values
This dataset contains protein localization sites.
0 runs0 likes0 downloads0 reach0 impact
336 instances - 15 features - classes - 0 missing values
This dataset contains information collected by the U.S Census Service concerning housing in the area of Boston Mass.
0 runs0 likes0 downloads0 reach0 impact
506 instances - 12 features - classes - 0 missing values
This is an artificial dataset consisting of independent attributes which are uniformly distributed.
0 runs0 likes0 downloads0 reach0 impact
40768 instances - 14 features - classes - 0 missing values
This is perhaps the best known dataset to be found in the pattern recognition literature. Fisher's paper is a classic in the field and is referenced frequently to this day.
0 runs0 likes0 downloads0 reach0 impact
150 instances - 7 features - classes - 0 missing values
This dataset originates from the bioinformatics fields considering two types of genetic data, namely phylogenetic profiles and microarray expression data for the yeast genome.
0 runs0 likes0 downloads0 reach0 impact
2465 instances - 35 features - classes - 0 missing values
This dataset purpose is to classify a given silhouette as one of four types of vehicle, using a set of features extracted from the silhouette (the vehicle may be viewed from one of many different…
0 runs0 likes0 downloads0 reach0 impact
846 instances - 22 features - classes - 0 missing values
This dataset consists of a three dimensional array: speaker, vowel and input. The speakers and vowels are indexed by integers and, for each utterance, there are floating-point input values.
0 runs0 likes0 downloads0 reach0 impact
528 instances - 21 features - classes - 0 missing values
This dataset purpose is to classify a given silhouette as one of four types of vehicle, using a set of features extracted from the silhouette (the vehicle may be viewed from one of many different…
0 runs0 likes0 downloads0 reach0 impact
846 instances - 22 features - classes - 0 missing values
This dataset consists of a three dimensional array: speaker, vowel and input. The speakers and vowels are indexed by integers and, for each utterance, there are floating-point input values.
0 runs0 likes0 downloads0 reach0 impact
528 instances - 21 features - classes - 0 missing values
This dataset is the result of a chemical analysis of wines grown in the same region in Italy but derived from three different cultivars. The analysis determined the quantities of constituents found in…
0 runs0 likes0 downloads0 reach0 impact
178 instances - 16 features - classes - 0 missing values
This dataset provides daily stock prices from January 1988 through October 1991, for 10 aerospace companies.
0 runs0 likes0 downloads0 reach0 impact
950 instances - 10 features - classes - 0 missing values
iris dataset test upload
0 runs0 likes0 downloads0 reach11 impact
150 instances - 5 features - 3 classes - 0 missing values
The dataset freMTPL2sev contains claim amounts for 26,639 motor third-part liability policies.
0 runs0 likes0 downloads0 reach9 impact
26639 instances - 2 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
1750 instances - 7 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
1750 instances - 7 features - classes - 0 missing values
This dataset contains 206 attributes of 70 children with physical and motor disability based on ICF-CY. In particular, the SCADI dataset is the only one that has been used by ML researchers for…
0 runs0 likes0 downloads0 reach0 impact
70 instances - 206 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
7027 instances - 65 features - classes - 5835 missing values
This dataset describes 100,000 realistic, synthetically generated worker compensation insurance claims. Along the ultimate payments, each claim is described by a initial case estimate, dates of…
0 runs0 likes0 downloads0 reach0 impact
100000 instances - 14 features - 0 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
704 instances - 21 features - classes - 192 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
704 instances - 21 features - classes - 192 missing values
INTRUSION DETECTOR LEARNING Software to detect network intrusions protects a computer network from unauthorized users, including perhaps insiders. The intrusion detector learning task is to build a…
0 runs1 likes0 downloads1 reach1 impact
4898431 instances - 42 features - 23 classes - 0 missing values
A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on…
0 runs0 likes0 downloads0 reach11 impact
163 instances - 27 features - 5 classes - 9 missing values
A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on…
0 runs0 likes0 downloads0 reach13 impact
379 instances - 8 features - 4 classes - 1418 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
1473 instances - 10 features - classes - 0 missing values
Dataset from Smoothing Methods in Statistics (ftp stat.cmu.edu/datasets) Simonoff, J.S. (1996). Smoothing Methods in Statistics. New York: Springer-Verlag.
0 runs0 likes1 downloads1 reach13 impact
1. Title: Faults in a urban waste water treatment plant 2. Source Information: -- Creators: Manel Poch (igte2@cc.uab.es) Unitat d'Enginyeria Quimica Universitat Autonoma de Barcelona. Bellaterra.…
0 runs0 likes1 downloads1 reach13 impact
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 17035, and it has 17 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
17 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100951, and it has 81 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
81 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10444, and it has 44 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
44 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101557, and it has 732 rows and 1026 features…
1 runs0 likes1 downloads1 reach11 impact
732 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11556, and it has 39 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
39 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12886, and it has 81 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
81 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10113, and it has 131 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
131 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12514, and it has 692 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
692 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 17106, and it has 127 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
127 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10878, and it has 427 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
427 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12949, and it has 389 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
389 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12415, and it has 395 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
395 instances - 1026 features - 0 classes - 0 missing values