Data
Filter results by:
cast metal 1
111 runs0 likes8 downloads8 reach5 impact
327 instances - 38 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
77 runs0 likes2 downloads2 reach7 impact
329 instances - 10937 features - 2 classes - 0 missing values
This dataset is taken from the Places Rated Almanac, by Richard Boyer and David Savageau, copyrighted and published by Rand McNally. This book order (SBN) number is 0-528-88008-X, and it retails for…
2 runs0 likes7 downloads7 reach6 impact
329 instances - 10 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12214, and it has 330 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
330 instances - 1026 features - 0 classes - 0 missing values
Small dataset with time series of RAM prices over the years.
0 runs1 likes4 downloads5 reach4 impact
333 instances - 3 features - 0 classes - 0 missing values
No data.
220 runs0 likes7 downloads7 reach11 impact
336 instances - 7903 features - 6 classes - 0 missing values
1. Title: Protein Localization Sites 2. Creator and Maintainer: Kenta Nakai Institue of Molecular and Cellular Biology Osaka, University 1-3 Yamada-oka, Suita 565 Japan nakai@imcb.osaka-u.ac.jp…
1803 runs0 likes12 downloads12 reach4 impact
336 instances - 8 features - 8 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
815 runs0 likes8 downloads8 reach8 impact
336 instances - 8 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
77 runs0 likes3 downloads3 reach7 impact
337 instances - 10937 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12679, and it has 337 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
337 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 18046, and it has 337 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
337 instances - 1026 features - 0 classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Airline Ticket Price dataset concerns the prediction of airline ticket prices. The rows are a…
0 runs0 likes0 downloads0 reach2 impact
337 instances - 417 features - classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Airline Ticket Price dataset concerns the prediction of airline ticket prices. The rows are a…
0 runs0 likes0 downloads0 reach2 impact
337 instances - 417 features - classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11968, and it has 338 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
338 instances - 1026 features - 0 classes - 0 missing values
Citation Request: This primary tumor domain was obtained from the University Medical Centre, Institute of Oncology, Ljubljana, Yugoslavia. Thanks go to M. Zwitter and M. Soklic for providing the data.…
1261 runs0 likes16 downloads16 reach4 impact
339 instances - 18 features - 21 classes - 225 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
752 runs0 likes7 downloads7 reach8 impact
339 instances - 18 features - 2 classes - 225 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12568, and it has 340 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
340 instances - 1026 features - 0 classes - 0 missing values
Abstract: This dataset consists in a collection of shape and texture features extracted from digital images of leaf specimens originating from a total of 40 different plant species. Source: This…
112 runs0 likes9 downloads9 reach6 impact
340 instances - 16 features - 30 classes - 0 missing values
Schizophrenic Eye-Tracking Data in Rubin and Wu (1997) Biometrics. Yingnian Wu (wu@hustat.harvard.edu) [14/Oct/97] Information about the dataset CLASSTYPE: nominal CLASSINDEX: last
748 runs0 likes7 downloads7 reach16 impact
340 instances - 15 features - 2 classes - 834 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101199, and it has 341 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
341 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10294, and it has 342 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach4 impact
342 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11054, and it has 344 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
344 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12171, and it has 344 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
344 instances - 1026 features - 0 classes - 0 missing values
The first 5 variables are all blood tests which are thought to be sensitive to liver disorders that might arise from excessive alcohol consumption. Each line in the dataset constitutes the record of a…
182 runs2 likes30 downloads32 reach1 impact
345 instances - 6 features - 0 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
65 runs0 likes2 downloads2 reach7 impact
347 instances - 10937 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 103726, and it has 348 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
348 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 102420, and it has 349 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
349 instances - 1026 features - 0 classes - 0 missing values
SPECTF heart data This is a merged version of the separate train and test set which are usually distributed. On OpenML this train-test split can be found as one of the possible tasks. NOTE: See the…
1103 runs0 likes12 downloads12 reach8 impact
349 instances - 45 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100863, and it has 350 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
350 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10542, and it has 351 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
351 instances - 1026 features - 0 classes - 0 missing values
This radar data was collected by a system in Goose Bay, Labrador. This system consists of a phased array of 16 high-frequency antennas with a total transmitted power on the order of 6.4 kilowatts. See…
2484 runs3 likes27 downloads30 reach4 impact
351 instances - 35 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
77 runs0 likes3 downloads3 reach7 impact
355 instances - 10937 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 64, and it has 358 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
358 instances - 1026 features - 0 classes - 0 missing values
This dataset contains 358 lyrics of songs for the rock bands 'The Rolling Stones' and 'Deep Purple'. The bands are equally represented in the dataset (179 songs for each band). This dataset was…
8 runs0 likes1 downloads1 reach12 impact
358 instances - 2 features - 2 classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Jura (Goovaerts 1997) dataset consists of measurements of concentrations of seven heavy…
0 runs0 likes0 downloads0 reach2 impact
359 instances - 18 features - classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Jura (Goovaerts 1997) dataset consists of measurements of concentrations of seven heavy…
0 runs0 likes0 downloads0 reach2 impact
359 instances - 18 features - classes - 0 missing values
University of Sao Paulo, School of Art, Sciences and Humanities, Sao Paulo, SP, Brazil ### LIBRAS Movement Database LIBRAS, acronym of the Portuguese name "LIngua BRAsileira de Sinais", is the…
0 runs0 likes4 downloads4 reach7 impact
360 instances - 91 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12474, and it has 360 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
360 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100186, and it has 360 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
360 instances - 1026 features - 0 classes - 0 missing values
* Dataset Title: MicroMass - Mixed (mixed spectra version) * Abstract: A dataset to explore machine learning approaches for the identification of microorganisms from mass-spectrometry data. * Source:…
64 runs1 likes4 downloads5 reach5 impact
360 instances - 1301 features - 10 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
78 runs0 likes2 downloads2 reach7 impact
363 instances - 10937 features - 2 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
490 runs0 likes4 downloads4 reach6 impact
364 instances - 33 features - 6 classes - 101 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
736 runs0 likes6 downloads6 reach8 impact
364 instances - 33 features - 2 classes - 80 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes1 downloads1 reach5 impact
365 instances - 4 features - 0 classes - 30 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
708 runs0 likes5 downloads5 reach8 impact
365 instances - 4 features - 2 classes - 30 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12000, and it has 366 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
366 instances - 1026 features - 0 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes2 downloads2 reach4 impact
366 instances - 5 features - classes - 2 missing values
testing
0 runs0 likes0 downloads0 reach0 impact
366 instances - 3 features - classes - 0 missing values
1. Title: Dermatology Database 2. Source Information: (a) Original owners: -- 1. Nilsel Ilter, M.D., Ph.D., Gazi University, School of Medicine 06510 Ankara, Turkey Phone: +90 (312) 214 1080 -- 2. H.…
1752 runs0 likes13 downloads13 reach4 impact
366 instances - 35 features - 6 classes - 8 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
113 runs0 likes3 downloads3 reach8 impact
366 instances - 6 features - 2 classes - 1 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
723 runs0 likes6 downloads6 reach8 impact
366 instances - 35 features - 2 classes - 8 missing values
Donor: Will Taylor (taylor@pluto.arc.nasa.gov) Database of surgeries on horses. Possible class attributes: 24 (whether lesion is surgical), others include: 23, 25, 26, and 27 Notes: * Hospital_Number…
234 runs0 likes9 downloads9 reach2 impact
368 instances - 27 features - 2 classes - 1927 missing values
Donor: Will Taylor (taylor@pluto.arc.nasa.gov) In this version (version 2), some features were removed. It is unclear why of how this was done.
1883 runs0 likes9 downloads9 reach2 impact
368 instances - 23 features - 2 classes - 1927 missing values
No data.
747 runs0 likes7 downloads7 reach8 impact
369 instances - 9 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 17024, and it has 373 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
373 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10473, and it has 376 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
376 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10074, and it has 377 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
377 instances - 1026 features - 0 classes - 0 missing values
The dataset contains the serie a matches for season 2015-2016
0 runs0 likes0 downloads0 reach1 impact
379 instances - 38 features - classes - 44 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes0 downloads0 reach6 impact
379 instances - 9 features - 4 classes - 1418 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
104 runs0 likes6 downloads6 reach8 impact
379 instances - 9 features - 2 classes - 1368 missing values
Following are data on the shooting of Vinnie Johnson of the Detroit Pistons during the 1985-1986 through 1988-1989 seasons. Source was the New York Times. The data are analyzed in the Carnegie Mellon…
0 runs0 likes0 downloads0 reach5 impact
380 instances - 3 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11192, and it has 380 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
380 instances - 1026 features - 0 classes - 0 missing values
The dataset contains the premier league matches for the season 2014-2015.
0 runs0 likes0 downloads0 reach1 impact
380 instances - 38 features - classes - 9 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
963 runs0 likes11 downloads11 reach8 impact
380 instances - 3 features - 2 classes - 0 missing values
Data from the RSCTC 2010 Discovery Challenge. All datasets contain between 100 and 400 samples, characterized by values of 20,000 - 65,000 attributes. Samples are assigned to several (2-10) classes.…
1 runs0 likes2 downloads2 reach6 impact
383 instances - 54676 features - 9 classes - 0 missing values
simple engine data
52 runs0 likes5 downloads5 reach4 impact
383 instances - 6 features - 3 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
77 runs0 likes2 downloads2 reach7 impact
384 instances - 10937 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
66 runs0 likes1 downloads1 reach7 impact
386 instances - 10937 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10919, and it has 386 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
386 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10579, and it has 388 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
388 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10692, and it has 388 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
388 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12949, and it has 389 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
389 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 17073, and it has 391 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
391 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11046, and it has 392 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
392 instances - 1026 features - 0 classes - 0 missing values
cars1-pmlb
31 runs0 likes2 downloads2 reach11 impact
392 instances - 8 features - 3 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 143, and it has 393 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
393 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11680, and it has 395 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
395 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12862, and it has 395 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
395 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12415, and it has 395 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach4 impact
395 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11036, and it has 396 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
396 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10244, and it has 396 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
396 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10577, and it has 397 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
397 instances - 1026 features - 0 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Identifier attribute deleted. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based…
2 runs0 likes2 downloads2 reach1 impact
398 instances - 8 features - 0 classes - 6 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
705 runs0 likes5 downloads5 reach8 impact
398 instances - 8 features - 2 classes - 6 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10116, and it has 399 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
399 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101598, and it has 399 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
399 instances - 1026 features - 0 classes - 0 missing values
calendarDOW-pmlb
31 runs0 likes1 downloads1 reach11 impact
399 instances - 33 features - 5 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 30028, and it has 399 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
399 instances - 1026 features - 0 classes - 0 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes0 downloads0 reach5 impact
400 instances - 8 features - 0 classes - 0 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes0 downloads0 reach5 impact
400 instances - 8 features - 0 classes - 0 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
22 runs0 likes2 downloads2 reach5 impact
400 instances - 8 features - 0 classes - 0 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes0 downloads0 reach5 impact
400 instances - 7 features - 0 classes - 0 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes0 downloads0 reach5 impact
400 instances - 8 features - 0 classes - 0 missing values
This dataset contains a set of face images taken between April 1992 and April 1994 at AT&T Laboratories Cambridge. As described on the original website: There are ten different images of each of 40…
53 runs0 likes0 downloads0 reach5 impact
400 instances - 4097 features - 40 classes - 0 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au6-cd1-400 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and heterogeneity…
144 runs0 likes3 downloads3 reach6 impact
400 instances - 41 features - 8 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
581 runs0 likes5 downloads5 reach7 impact
400 instances - 6 features - 4 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
788 runs0 likes7 downloads7 reach8 impact
400 instances - 8 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
817 runs0 likes8 downloads8 reach8 impact
400 instances - 7 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
764 runs0 likes6 downloads6 reach8 impact
400 instances - 8 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
779 runs0 likes7 downloads7 reach8 impact
400 instances - 8 features - 2 classes - 0 missing values