OpenML
Filter results by:
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 9, and it has 5013 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
5013 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 259, and it has 4332 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
4332 instances - 1026 features - 0 classes - 0 missing values
This version has feature names based on https://www2.1010data.com/documentationcenter/beta/Tutorials/MachineLearningExamples/CensusIncomeDataSet.html Missing data is also properly encoded in this…
0 runs0 likes1 downloads1 reach0 impact
199523 instances - 42 features - 2 classes - 415717 missing values
Bike sharing systems are new generation of traditional bike rentals where whole process from membership, rental and return back has become automatic. Through these systems, user is able to easily rent…
0 runs0 likes2 downloads2 reach3 impact
17379 instances - 13 features - 0 classes - 0 missing values
Bike sharing systems are new generation of traditional bike rentals where whole process from membership, rental and return back has become automatic. Through these systems, user is able to easily rent…
0 runs0 likes1 downloads1 reach3 impact
17379 instances - 13 features - 0 classes - 0 missing values
Online advertisement clicking rates, where the metrics are cost-per-click (CPC) and cost per thousand impressions (CPM).
0 runs0 likes0 downloads0 reach0 impact
1643 instances - 3 features - classes - 0 missing values
Online advertisement clicking rates, where the metrics are cost-per-click (CPC) and cost per thousand impressions (CPM).
0 runs0 likes0 downloads0 reach0 impact
1624 instances - 3 features - classes - 0 missing values
Online advertisement clicking rates, where the metrics are cost-per-click (CPC) and cost per thousand impressions (CPM).
0 runs0 likes0 downloads0 reach0 impact
1538 instances - 3 features - classes - 0 missing values
Online advertisement clicking rates, where the metrics are cost-per-click (CPC) and cost per thousand impressions (CPM).
0 runs0 likes0 downloads0 reach0 impact
1643 instances - 2 features - classes - 0 missing values
https://archive.ics.uci.edu/ml/datasets/Diabetes
0 runs0 likes1 downloads1 reach0 impact
768 instances - 9 features - classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101211, and it has 82 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
82 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11150, and it has 84 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
84 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 30009, and it has 102 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
102 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10264, and it has 146 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
146 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10773, and it has 525 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
525 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10579, and it has 388 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
388 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11636, and it has 411 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
411 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 138, and it has 1426 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
1426 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12699, and it has 789 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
789 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 188, and it has 2230 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
2230 instances - 1026 features - 0 classes - 0 missing values
Wikidata with top-474 most frequent types and ingoing/outgoing properties as features
0 runs0 likes15 downloads15 reach11 impact
19254100 instances - 2331 features - classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11361, and it has 79 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
79 instances - 1026 features - 0 classes - 0 missing values
The data were collected as the SCITOS G5 robot navigates through the room following the wall in a clockwise direction, for 4 rounds, using 24 ultrasound sensors arranged circularly around its 'waist'.…
25199 runs0 likes22 downloads22 reach34 impact
5456 instances - 25 features - 4 classes - 0 missing values
Andrew V Uzilov, Joshua M Keegan, and David H Mathews. Detection of non-coding RNAs on the basis of predicted secondary structure formation free energy change. BMC Bioinformatics, 7(173), 2006. This…
31 runs0 likes11 downloads11 reach15 impact
488565 instances - 9 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11066, and it has 25 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
25 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10364, and it has 64 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
64 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12898, and it has 12 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
12 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11049, and it has 22 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
22 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100914, and it has 80 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
80 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101312, and it has 80 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
80 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100990, and it has 81 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
81 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101504, and it has 99 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
99 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100948, and it has 139 rows and 1026 features…
1 runs0 likes1 downloads1 reach11 impact
139 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11279, and it has 522 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
522 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11602, and it has 53 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
53 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12327, and it has 42 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
42 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12173, and it has 114 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
114 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12793, and it has 85 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
85 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10682, and it has 23 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
23 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 20162, and it has 156 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
156 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12264, and it has 24 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
24 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11634, and it has 569 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
569 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 261, and it has 542 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
542 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10647, and it has 890 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
890 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11896, and it has 35 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
35 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11287, and it has 100 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
100 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10473, and it has 376 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
376 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 30004, and it has 84 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
84 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100909, and it has 81 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
81 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10197, and it has 3058 rows and 1026 features…
1 runs0 likes1 downloads1 reach11 impact
3058 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10696, and it has 1070 rows and 1026 features…
1 runs0 likes1 downloads1 reach11 impact
1070 instances - 1026 features - 0 classes - 0 missing values
DBpedia with top-474 most frequent YAGO types HMC dataset for type prediction. Ingoing and outgoing properties as features
0 runs0 likes3 downloads3 reach11 impact
2886305 instances - 2401 features - classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101090, and it has 48 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
48 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10803, and it has 37 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
37 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10621, and it has 99 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
99 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101276, and it has 25 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
25 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 103062, and it has 74 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
74 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101056, and it has 79 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
79 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10967, and it has 101 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
101 instances - 1026 features - 0 classes - 0 missing values
This dataset summarizes a heterogeneous set of features about articles published by Mashable in a period of two years. The goal is to predict the number of shares in social networks (popularity). *…
0 runs0 likes5 downloads5 reach12 impact
39644 instances - 61 features - 0 classes - 0 missing values
Predicting the Geographical Origin of Music, ICDM, 2014 Abstract: Instances in this dataset contain audio features extracted from 1059 wave files. The task associated with the data is to predict the…
0 runs0 likes4 downloads4 reach14 impact
1059 instances - 118 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 103800, and it has 91 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
91 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11802, and it has 13 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
13 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100042, and it has 14 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
14 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101226, and it has 79 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
79 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 17113, and it has 12 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
12 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 282, and it has 479 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
479 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11110, and it has 969 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
969 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10950, and it has 889 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
889 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100129, and it has 513 rows and 1026 features…
1 runs0 likes1 downloads1 reach11 impact
513 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10434, and it has 3650 rows and 1026 features…
1 runs0 likes1 downloads1 reach11 impact
3650 instances - 1026 features - 0 classes - 0 missing values
Multiclass cancer diagnosis using 16063 tumor gene expression signatures. PNAS, VOL 98, no 26, pp. 15149-15154, December 18, 2001. S. Ramaswamy, P. Tamayo, R. Rifkin, S. Mukherjee, C.-H. Yeang, M.…
116 runs0 likes10 downloads10 reach23 impact
190 instances - 16064 features - 14 classes - 0 missing values
Test dataset
0 runs0 likes2 downloads2 reach13 impact
15547 instances - 61 features - 0 classes - 280 missing values
This is an experimental data set for trying to classify numbers in a lottery as "Highly likely to be picked" or "Not very likely to be picked". It is based on a little more than a…
0 runs0 likes0 downloads0 reach0 impact
12528 instances - 36 features - classes - 0 missing values
ARFF Training Data
0 runs0 likes0 downloads0 reach0 impact
177640 instances - 40 features - classes - 0 missing values
The aim is to determine the type of arrhythmia from the ECG recordings. This database contains 279 attributes, 206 of which are linear valued and the rest are nominal. Concerning the study of H. Altay…
4430 runs0 likes50 downloads50 reach15 impact
452 instances - 280 features - 13 classes - 408 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. Corresponding patterns in different datasets correspond to the same…
37368 runs0 likes18 downloads18 reach15 impact
2000 instances - 217 features - 10 classes - 0 missing values
This database is a standardized version of the original audiology database (see audiology.* in this directory). The non-standard set of attributes have been converted to a standard set of attributes…
7303 runs0 likes13 downloads13 reach12 impact
226 instances - 70 features - 24 classes - 317 missing values
Citation Request: This lymphography domain was obtained from the University Medical Centre, Institute of Oncology, Ljubljana, Yugoslavia. Thanks go to M. Zwitter and M. Soklic for providing the data.…
1972 runs0 likes30 downloads30 reach12 impact
148 instances - 19 features - 4 classes - 0 missing values
Date: Tue, 15 Nov 88 15:44:08 EST From: stan To: aha@ICS.UCI.EDU 1. Title: Final settlements in labor negotitions in Canadian industry 2. Source Information -- Creators:…
7681 runs0 likes17 downloads17 reach12 impact
57 instances - 17 features - 2 classes - 326 missing values
The original Annealing dataset from UCI. The exact meaning of the features and classes is largely unknown. Annealing, in metallurgy and materials science, is a heat treatment that alters the physical…
13779 runs0 likes16 downloads16 reach13 impact
898 instances - 39 features - 5 classes - 22175 missing values
This data set contains unweighted PUMS census data from the Los Angeles and Long Beach areas for the years 1970, 1980, and 1990. The coding schemes have been standardized (by the IPUMS project) to be…
354 runs0 likes7 downloads7 reach14 impact
7485 instances - 61 features - 7 classes - 52048 missing values
The Committee on Statistical Graphics of the American Statistical Association (ASA) invites you to participate in its Second (1983) Exposition of Statistical Graphics Technology. The purposes of the…
164 runs0 likes3 downloads3 reach14 impact
406 instances - 8 features - 3 classes - 14 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes0 downloads0 reach13 impact
379 instances - 8 features - 4 classes - 1418 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
41 runs0 likes2 downloads2 reach15 impact
1340 instances - 17 features - 3 classes - 20 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
28846 runs0 likes8 downloads8 reach34 impact
841 instances - 71 features - 4 classes - 0 missing values
Data file: This data from "Problem-Solving" on "backache in pregnancy" is in somewhat different format from that listed in the book. Each integer is preceded by a space. This makes it easier to read.…
174 runs0 likes6 downloads6 reach15 impact
180 instances - 32 features - 2 classes - 0 missing values
Datasets for `Pattern Recognition and Neural Networks' by B.D. Ripley ===================================================================== Cambridge University Press (1996) ISBN 0-521-46086-7 The…
41 runs0 likes2 downloads2 reach14 impact
27 instances - 3 features - 4 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
35 runs0 likes2 downloads2 reach14 impact
23 instances - 5 features - 3 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
1034 runs0 likes10 downloads10 reach14 impact
100 instances - 7 features - 2 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
694 runs0 likes7 downloads7 reach14 impact
83 instances - 4 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
800 runs0 likes10 downloads10 reach15 impact
209 instances - 8 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
737 runs0 likes5 downloads5 reach15 impact
303 instances - 14 features - 2 classes - 6 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
766 runs0 likes5 downloads5 reach14 impact
55 instances - 3 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
620 runs0 likes10 downloads10 reach15 impact
1000 instances - 51 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
812 runs0 likes7 downloads7 reach15 impact
250 instances - 26 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
791 runs0 likes6 downloads6 reach15 impact
250 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
802 runs0 likes8 downloads8 reach15 impact
662 instances - 4 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
806 runs0 likes8 downloads8 reach15 impact
500 instances - 6 features - 2 classes - 0 missing values
### Description ISOLET (Isolated Letter Speech Recognition) dataset was generated as follows: 150 subjects spoke the name of each letter of the alphabet twice. Hence, there are 52 training examples…
50748 runs0 likes70 downloads70 reach132 impact
7797 instances - 618 features - 26 classes - 0 missing values