Data
Filter results by:
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12919, and it has 79 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
79 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 102815, and it has 34 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
34 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11188, and it has 39 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
39 instances - 1026 features - 0 classes - 0 missing values
## Guess which points belong to signal track [COMET](http://comet.kek.jp/Introduction.html) is an experiment being constructed at the J-PARC proton beam laboratory in Japan. It will search for…
0 runs0 likes1 downloads1 reach11 impact
7619400 instances - 6 features - 0 classes - 0 missing values
## Guess which points belong to signal track [COMET](http://comet.kek.jp/Introduction.html) is an experiment being constructed at the J-PARC proton beam laboratory in Japan. It will search for…
0 runs0 likes1 downloads1 reach11 impact
7619400 instances - 6 features - 0 classes - 0 missing values
The dataset and this description is made available on http://www-stat.stanford.edu/~tibs/ElemStatLearn/data.html. Normalized handwritten digits, automatically scanned from envelopes by the U.S. Postal…
57 runs0 likes1 downloads1 reach11 impact
9298 instances - 257 features - 10 classes - 0 missing values
Multi-label dataset. The genbase dataset contains protein sequences that can be assigned to several classes of protein families.
0 runs0 likes1 downloads1 reach11 impact
662 instances - 1213 features - 2 classes - 0 missing values
The langLog dataset includes 1004 textual predictors and was originally compiled in the doctorial thesis of Read (2010). It consists of 956 text samples that can be assigned to one or more topics such…
0 runs0 likes4 downloads4 reach11 impact
1460 instances - 1079 features - 2 classes - 0 missing values
Multi-label dataset. The scene dataset is an image classification task where labels like Beach, Mountain, Field, Urban are assigned to each image.
0 runs0 likes12 downloads12 reach11 impact
2407 instances - 300 features - 2 classes - 0 missing values
Multi-label dataset. The yeast dataset (Elisseeff and Weston, 2002) consists of micro-array expression data, as well as phylogenetic profiles of yeast, and includes 2417 genes and 103 predictors. In…
0 runs0 likes2 downloads2 reach11 impact
2417 instances - 117 features - 2 classes - 0 missing values
Small dataset with time series of RAM prices over the years.
0 runs1 likes4 downloads5 reach11 impact
333 instances - 3 features - 0 classes - 0 missing values
Historical Rainfall data of Bangladesh
0 runs0 likes0 downloads0 reach11 impact
16755 instances - 4 features - 0 classes - 0 missing values
Customer purchases on Black Friday
0 runs0 likes1 downloads1 reach11 impact
166821 instances - 10 features - 0 classes - 0 missing values
Testing this plattform
0 runs0 likes0 downloads0 reach11 impact
36203 instances - 18 features - 0 classes - 8971 missing values
Asteroid Dataset
0 runs0 likes1 downloads1 reach11 impact
126131 instances - 34 features - 2 classes - 99 missing values
Fixed dataset for autoHorse.csv I suggest...
0 runs0 likes0 downloads0 reach11 impact
201 instances - 69 features - 186 classes - 0 missing values
No data.
206 runs0 likes3 downloads3 reach11 impact
1000000 instances - 39 features - 6 classes - 0 missing values
No data.
51 runs1 likes4 downloads5 reach11 impact
1000000 instances - 48 features - 10 classes - 0 missing values
This is the hip measurement data from Table B.13 in Chatfield's Problem Solving (1995, 2nd edn, Chapman and Hall). It is given in 8 columns. First 4 columns are for Control Group. Last 4 columns are…
0 runs0 likes0 downloads0 reach11 impact
54 instances - 8 features - classes - 120 missing values
No data.
73 runs0 likes5 downloads5 reach11 impact
1000000 instances - 16 features - 2 classes - 0 missing values
No data.
67 runs0 likes2 downloads2 reach11 impact
1000000 instances - 39 features - 6 classes - 0 missing values
No data.
70 runs0 likes2 downloads2 reach11 impact
1000000 instances - 14 features - 2 classes - 0 missing values
No data.
314 runs1 likes8 downloads9 reach11 impact
1000000 instances - 36 features - 19 classes - 0 missing values
No data.
326 runs0 likes4 downloads4 reach11 impact
1000000 instances - 16 features - 2 classes - 0 missing values
Normalized version of the pokerhand data set. Automated file upload of pokerhand-normalized.arff
314 runs0 likes12 downloads12 reach11 impact
829201 instances - 11 features - 10 classes - 0 missing values
No data.
307 runs0 likes3 downloads3 reach11 impact
1000000 instances - 41 features - 3 classes - 0 missing values
No data.
68 runs0 likes4 downloads4 reach11 impact
1000000 instances - 21 features - 2 classes - 0 missing values
Normalized version of the Forest Covertype dataset (see version 1), so that the numerical values are between 0 and 1. Contains the forest cover type for 30 x 30 meter cells obtained from US Forest…
342 runs1 likes39 downloads40 reach11 impact
581012 instances - 55 features - 7 classes - 0 missing values
No data.
332 runs0 likes4 downloads4 reach11 impact
1000000 instances - 17 features - 2 classes - 0 missing values
No data.
308 runs0 likes2 downloads2 reach11 impact
1000000 instances - 11 features - 5 classes - 0 missing values
No data.
309 runs0 likes3 downloads3 reach11 impact
1000000 instances - 11 features - 5 classes - 0 missing values
No data.
60 runs0 likes2 downloads2 reach11 impact
1000000 instances - 17 features - 26 classes - 0 missing values
--------------------------------------------------------------------------- Short description --------------------------------------------------------------------------- Data on tree growth used in…
0 runs0 likes2 downloads2 reach11 impact
2796 instances - 35 features - 6 classes - 68100 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
0 runs0 likes0 downloads0 reach11 impact
2796 instances - 35 features - 2 classes - 68100 missing values
No data.
90 runs2 likes3 downloads5 reach11 impact
663552 instances - 13 features - 2 classes - 0 missing values
No data.
309 runs0 likes6 downloads6 reach11 impact
1000000 instances - 35 features - 6 classes - 0 missing values
No data.
67 runs0 likes2 downloads2 reach11 impact
1000000 instances - 17 features - 10 classes - 0 missing values
Predicting forest cover type from cartographic variables only (no remotely sensed data). The actual forest cover type for a given observation (30 x 30 meter cell) was determined from US Forest Service…
216 runs0 likes11 downloads11 reach11 impact
110393 instances - 55 features - 7 classes - 0 missing values
No data.
194 runs0 likes3 downloads3 reach11 impact
1000000 instances - 65 features - 10 classes - 0 missing values
No data.
311 runs0 likes3 downloads3 reach11 impact
1000000 instances - 17 features - 26 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes1 downloads1 reach11 impact
228 instances - 8 features - classes - 20 missing values
No data.
87 runs0 likes5 downloads5 reach11 impact
295245 instances - 11 features - 5 classes - 0 missing values
These data are estimated correlations between daily 3 p.m. wind measurements during September and October 1997 for a network of 45 stations in the Sydney region. The first column below gives a list of…
0 runs0 likes0 downloads0 reach11 impact
45 instances - 47 features - classes - 0 missing values
No data.
66 runs0 likes2 downloads2 reach11 impact
1000000 instances - 39 features - 6 classes - 0 missing values
Binarized version of the isolet dataset (see version 1). Only instances with class labels 1 and 2 from the original dataset are considered.
0 runs0 likes0 downloads0 reach11 impact
600 instances - 618 features - 2 classes - 0 missing values
No data.
305 runs0 likes2 downloads2 reach11 impact
1000000 instances - 11 features - 5 classes - 0 missing values
No data.
48 runs1 likes4 downloads5 reach11 impact
1000000 instances - 77 features - 10 classes - 0 missing values
No data.
211 runs0 likes3 downloads3 reach11 impact
1000000 instances - 20 features - 7 classes - 0 missing values
No data.
298 runs0 likes3 downloads3 reach11 impact
1000000 instances - 11 features - 5 classes - 0 missing values
No data.
330 runs0 likes5 downloads5 reach11 impact
1000000 instances - 4 features - 2 classes - 0 missing values
No data.
63 runs0 likes2 downloads2 reach11 impact
1000000 instances - 41 features - 3 classes - 0 missing values
No data.
326 runs1 likes5 downloads6 reach11 impact
1000000 instances - 23 features - 2 classes - 0 missing values
No data.
290 runs0 likes5 downloads5 reach11 impact
1000000 instances - 77 features - 10 classes - 0 missing values
No data.
71 runs0 likes5 downloads5 reach11 impact
1000000 instances - 17 features - 2 classes - 0 missing values
The first 5 variables are all blood tests which are thought to be sensitive to liver disorders that might arise from excessive alcohol consumption. Each line in the dataset constitutes the record of a…
191 runs2 likes30 downloads32 reach11 impact
345 instances - 6 features - 0 classes - 0 missing values
No data.
63 runs0 likes4 downloads4 reach11 impact
1000000 instances - 19 features - 4 classes - 0 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes0 downloads0 reach11 impact
185 instances - 2 features - classes - 0 missing values
No data.
324 runs0 likes5 downloads5 reach11 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
50 runs0 likes1 downloads1 reach11 impact
1000000 instances - 65 features - 10 classes - 0 missing values
No data.
315 runs0 likes2 downloads2 reach11 impact
295245 instances - 11 features - 5 classes - 0 missing values
No data.
307 runs0 likes2 downloads2 reach11 impact
1000000 instances - 11 features - 5 classes - 0 missing values
No data.
143 runs0 likes4 downloads4 reach11 impact
1000000 instances - 39 features - 6 classes - 0 missing values
No data.
310 runs0 likes4 downloads4 reach11 impact
1000000 instances - 11 features - 2 classes - 0 missing values
No data.
167 runs0 likes8 downloads8 reach11 impact
399940 instances - 1002 features - 2 classes - 0 missing values
Synthetic dataset. Almost identical to [dataset 152](https://www.openml.org/d/153/edit)
319 runs0 likes4 downloads4 reach11 impact
1000000 instances - 11 features - 2 classes - 0 missing values
No data.
225 runs0 likes7 downloads7 reach11 impact
1000000 instances - 21 features - 2 classes - 0 missing values
No data.
230 runs0 likes4 downloads4 reach11 impact
1000000 instances - 35 features - 2 classes - 0 missing values
No data.
293 runs0 likes2 downloads2 reach11 impact
1000000 instances - 17 features - 10 classes - 0 missing values
No data.
328 runs0 likes3 downloads3 reach11 impact
1000000 instances - 4 features - 2 classes - 0 missing values
No data.
219 runs0 likes4 downloads4 reach11 impact
1000000 instances - 58 features - 2 classes - 0 missing values
No data.
310 runs0 likes4 downloads4 reach11 impact
1000000 instances - 19 features - 4 classes - 0 missing values
No data.
326 runs0 likes4 downloads4 reach11 impact
1000000 instances - 14 features - 2 classes - 0 missing values
No data.
304 runs0 likes7 downloads7 reach11 impact
1000000 instances - 25 features - 10 classes - 0 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes0 downloads0 reach11 impact
50 instances - 3 features - classes - 0 missing values
No data.
52 runs0 likes3 downloads3 reach11 impact
1000000 instances - 48 features - 10 classes - 0 missing values
One of the data sets used in the book "Analyzing Categorical Data" by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on statistical…
0 runs0 likes1 downloads1 reach11 impact
31 instances - 16 features - classes - 150 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes2 downloads2 reach11 impact
100 instances - 10 features - classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes2 downloads2 reach11 impact
366 instances - 5 features - classes - 2 missing values
No data.
334 runs0 likes4 downloads4 reach11 impact
1000000 instances - 33 features - 2 classes - 0 missing values
No data.
44 runs0 likes3 downloads3 reach11 impact
1000000 instances - 15 features - 2 classes - 0 missing values
Automated file upload of BNG(credit-g)
99 runs0 likes3 downloads3 reach11 impact
1000000 instances - 21 features - 2 classes - 0 missing values
Automated file upload of BNG(spambase)
98 runs0 likes3 downloads3 reach11 impact
1000000 instances - 58 features - 2 classes - 0 missing values
Automated file upload of BNG(optdigits)
100 runs1 likes1 downloads2 reach11 impact
1000000 instances - 65 features - 10 classes - 0 missing values
Automated file upload of BNG(segment)
99 runs0 likes1 downloads1 reach11 impact
1000000 instances - 20 features - 7 classes - 0 missing values
Multi-label dataset. The birds dataset consists of 327 audio recordings of 12 different vocalizing bird species. Each sound can be assigned to various bird species.
0 runs0 likes6 downloads6 reach11 impact
645 instances - 279 features - 2 classes - 0 missing values
Multi-label dataset. Audio dataset (emotions) consists of 593 musical files with 6 clustered emotional labels and 72 predictors. Each song can be labeled with one or more of the labels…
0 runs2 likes5 downloads7 reach11 impact
593 instances - 78 features - 2 classes - 0 missing values
No data.
311 runs0 likes5 downloads5 reach11 impact
1000000 instances - 10 features - 2 classes - 0 missing values
No data.
307 runs0 likes5 downloads5 reach11 impact
1000000 instances - 4 features - 2 classes - 0 missing values
No data.
29 runs0 likes1 downloads1 reach11 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
29 runs0 likes2 downloads2 reach11 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
30 runs0 likes1 downloads1 reach11 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
28 runs0 likes2 downloads2 reach11 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
29 runs0 likes1 downloads1 reach11 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
32 runs0 likes1 downloads1 reach11 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
305 runs0 likes3 downloads3 reach11 impact
1000000 instances - 4 features - 2 classes - 0 missing values
No data.
337 runs1 likes2 downloads3 reach11 impact
1000000 instances - 13 features - 3 classes - 0 missing values
No data.
29 runs0 likes1 downloads1 reach11 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
28 runs0 likes1 downloads1 reach11 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
31 runs0 likes1 downloads1 reach11 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
34 runs0 likes2 downloads2 reach11 impact
1000000 instances - 17 features - 26 classes - 0 missing values