Data
Filter results by:
No data.
304 runs0 likes3 downloads3 reach9 impact
1000000 instances - 19 features - 4 classes - 0 missing values
No data.
304 runs0 likes7 downloads7 reach11 impact
1000000 instances - 25 features - 10 classes - 0 missing values
No data.
305 runs0 likes2 downloads2 reach11 impact
1000000 instances - 11 features - 5 classes - 0 missing values
No data.
305 runs0 likes3 downloads3 reach11 impact
1000000 instances - 4 features - 2 classes - 0 missing values
No data.
306 runs0 likes3 downloads3 reach9 impact
1000000 instances - 13 features - 6 classes - 0 missing values
No data.
306 runs0 likes4 downloads4 reach11 impact
1000000 instances - 4 features - 2 classes - 0 missing values
No data.
307 runs0 likes3 downloads3 reach11 impact
1000000 instances - 41 features - 3 classes - 0 missing values
No data.
307 runs0 likes2 downloads2 reach11 impact
1000000 instances - 11 features - 5 classes - 0 missing values
No data.
307 runs0 likes5 downloads5 reach11 impact
1000000 instances - 4 features - 2 classes - 0 missing values
No data.
308 runs0 likes2 downloads2 reach11 impact
1000000 instances - 11 features - 5 classes - 0 missing values
No data.
309 runs0 likes3 downloads3 reach11 impact
1000000 instances - 11 features - 5 classes - 0 missing values
No data.
309 runs0 likes6 downloads6 reach11 impact
1000000 instances - 35 features - 6 classes - 0 missing values
Normalized form of codrna (351) Andrew V Uzilov, Joshua M Keegan, and David H Mathews. Detection of non-coding RNAs on the basis of predicted secondary structure formation free energy change. BMC…
309 runs0 likes5 downloads5 reach9 impact
488565 instances - 9 features - 2 classes - 0 missing values
No data.
310 runs0 likes2 downloads2 reach9 impact
1000000 instances - 14 features - 5 classes - 0 missing values
No data.
310 runs0 likes4 downloads4 reach11 impact
1000000 instances - 11 features - 2 classes - 0 missing values
No data.
310 runs0 likes4 downloads4 reach11 impact
1000000 instances - 19 features - 4 classes - 0 missing values
No data.
311 runs0 likes3 downloads3 reach11 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
311 runs0 likes5 downloads5 reach11 impact
1000000 instances - 10 features - 2 classes - 0 missing values
No data.
312 runs1 likes5 downloads6 reach12 impact
1000000 instances - 14 features - 3 classes - 0 missing values
This data is derived from the 2012 KDD Cup. The data is subsampled to 1% of the original number of instances, downsampling the majority class (click=0) so that the target feature is reasonably…
313 runs0 likes36 downloads36 reach15 impact
399482 instances - 12 features - 2 classes - 0 missing values
No data.
313 runs0 likes3 downloads3 reach9 impact
1000000 instances - 23 features - 2 classes - 0 missing values
No data.
314 runs1 likes8 downloads9 reach11 impact
1000000 instances - 36 features - 19 classes - 0 missing values
Normalized version of the pokerhand data set. Automated file upload of pokerhand-normalized.arff ### Data Set Information: Each record is an example of a hand consisting of five playing cards drawn…
314 runs0 likes12 downloads12 reach12 impact
829201 instances - 11 features - 10 classes - 0 missing values
No data.
315 runs0 likes2 downloads2 reach11 impact
295245 instances - 11 features - 5 classes - 0 missing values
Synthetic dataset. Almost identical to [dataset 152](https://www.openml.org/d/153/edit)
319 runs0 likes4 downloads4 reach11 impact
1000000 instances - 11 features - 2 classes - 0 missing values
No data.
324 runs0 likes5 downloads5 reach11 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
326 runs0 likes4 downloads4 reach11 impact
1000000 instances - 16 features - 2 classes - 0 missing values
No data.
326 runs1 likes5 downloads6 reach11 impact
1000000 instances - 23 features - 2 classes - 0 missing values
No data.
326 runs0 likes4 downloads4 reach11 impact
1000000 instances - 14 features - 2 classes - 0 missing values
No data.
328 runs0 likes3 downloads3 reach11 impact
1000000 instances - 4 features - 2 classes - 0 missing values
No data.
330 runs0 likes5 downloads5 reach11 impact
1000000 instances - 4 features - 2 classes - 0 missing values
No data.
331 runs0 likes7 downloads7 reach9 impact
1000000 instances - 20 features - 2 classes - 0 missing values
No data.
332 runs0 likes4 downloads4 reach11 impact
1000000 instances - 17 features - 2 classes - 0 missing values
Dataset created to study concept drift in stream mining. It is constructed by combining the Covertype, Poker-Hand, and Electricity datasets. More details can be found in: Albert Bifet, Geoff Holmes,…
332 runs0 likes27 downloads27 reach12 impact
1455525 instances - 73 features - 10 classes - 0 missing values
No data.
334 runs0 likes4 downloads4 reach11 impact
1000000 instances - 33 features - 2 classes - 0 missing values
No data.
337 runs1 likes2 downloads3 reach11 impact
1000000 instances - 13 features - 3 classes - 0 missing values
### Data Set Information: Predicting forest cover type from cartographic variables only (no remotely sensed data). The actual forest cover type for a given observation (30 x 30 meter cell) was…
342 runs1 likes39 downloads40 reach12 impact
581012 instances - 55 features - 7 classes - 0 missing values
Embryonal tumours of the central nervous system Prediction of Central Nervous System Embryonal Tumour Outcome based on Gene Expression. Nature, VOL 415, pp. 436-442, 24 January 2002. Scott L. Pomeroy,…
343 runs0 likes6 downloads6 reach14 impact
60 instances - 7130 features - 2 classes - 0 missing values
No data.
353 runs0 likes17 downloads17 reach13 impact
120919 instances - 1002 features - 2 classes - 0 missing values
This data set contains unweighted PUMS census data from the Los Angeles and Long Beach areas for the years 1970, 1980, and 1990. The coding schemes have been standardized (by the IPUMS project) to be…
354 runs0 likes7 downloads7 reach14 impact
7485 instances - 61 features - 7 classes - 52048 missing values
The datasets contains transactions made by credit cards in September 2013 by european cardholders. This dataset present transactions that occurred in two days, where we have 492 frauds out of 284,807…
355 runs1 likes56 downloads57 reach20 impact
284807 instances - 31 features - 2 classes - 0 missing values
No data.
356 runs0 likes8 downloads8 reach9 impact
131072 instances - 17 features - 2 classes - 0 missing values
This data set contains unweighted PUMS census data from the Los Angeles and Long Beach areas for the years 1970, 1980, and 1990. The coding schemes have been standardized (by the IPUMS project) to be…
366 runs0 likes10 downloads10 reach14 impact
8844 instances - 61 features - 7 classes - 51515 missing values
Normalized version of vehicle dataset (http://www.openml.org/d/54) NAME vehicle silhouettes PURPOSE to classify a given silhouette as one of four types of vehicle, using a set of features extracted…
372 runs0 likes10 downloads10 reach11 impact
98528 instances - 101 features - 2 classes - 0 missing values
Source: http://www.ijcaonline.org/archives/volume47/number18/7291-0509 Data Set Information: In this paper, we look for to recognize the causes of users tend to cyber space in Kohkiloye and Boyer…
373 runs0 likes7 downloads7 reach13 impact
100 instances - 6 features - 2 classes - 0 missing values
No data.
373 runs0 likes9 downloads9 reach62 impact
918 instances - 3013 features - 10 classes - 0 missing values
No data.
377 runs0 likes10 downloads10 reach62 impact
913 instances - 3101 features - 10 classes - 0 missing values
This is a 20,000 instance sample of the original CIFAR-10 dataset. Sampled randomly and stratified, with 2000 examples per class. Training and test set are merged. Find the corresponding task for the…
380 runs0 likes4 downloads4 reach21 impact
20000 instances - 3073 features - 10 classes - 0 missing values
Hayes-Roth Database This is a merged version of the separate train and test set which are usually distributed. On OpenML this train-test split can be found as one of the possible tasks. Source…
384 runs0 likes4 downloads4 reach25 impact
160 instances - 5 features - 3 classes - 0 missing values
* Abstract: The data was created by a medical expert as a data set to test the expert system, which will perform the presumptive diagnosis of two diseases of the urinary system. * Source: Jacek…
391 runs0 likes11 downloads11 reach13 impact
120 instances - 7 features - 2 classes - 0 missing values
Datasets from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch) Dataset from: http://www.agnostic.inf.ethz.ch/datasets.php Modified by TunedIT (converted to ARFF…
396 runs0 likes17 downloads17 reach15 impact
3468 instances - 785 features - 10 classes - 0 missing values
Vehicle classification in distributed sensor networks. Journal of Parallel and Distributed Computing, 64(7):826-838, July 2004. This is the SensIT Vehicle (combined) dataset, retrieved 2013-11-14 from…
403 runs0 likes22 downloads22 reach16 impact
98528 instances - 101 features - 2 classes - 0 missing values
No data.
405 runs0 likes7 downloads7 reach11 impact
45164 instances - 75 features - 11 classes - 0 missing values
Datasets from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch) Dataset from: http://www.agnostic.inf.ethz.ch/datasets.php Modified by TunedIT (converted to ARFF…
406 runs1 likes11 downloads12 reach16 impact
4229 instances - 1618 features - 2 classes - 0 missing values
No data.
414 runs0 likes8 downloads8 reach62 impact
690 instances - 8262 features - 10 classes - 0 missing values
No data.
416 runs1 likes13 downloads14 reach63 impact
1050 instances - 3239 features - 10 classes - 0 missing values
* Abstract: The data was created by a medical expert as a data set to test the expert system, which will perform the presumptive diagnosis of two diseases of the urinary system. This is a…
423 runs0 likes14 downloads14 reach13 impact
120 instances - 7 features - 2 classes - 0 missing values
No data.
426 runs0 likes15 downloads15 reach84 impact
2463 instances - 2001 features - 17 classes - 0 missing values
No data.
428 runs0 likes12 downloads12 reach63 impact
1003 instances - 3183 features - 10 classes - 0 missing values
This data set contains unweighted PUMS census data from the Los Angeles and Long Beach areas for the years 1970, 1980, and 1990. The coding schemes have been standardized (by the IPUMS project) to be…
434 runs0 likes10 downloads10 reach14 impact
7019 instances - 61 features - 8 classes - 48089 missing values
Jarkko Salojarvi, Kai Puolamaki, Jaana Simola, Lauri Kovanen, Ilpo Kojo, Samuel Kaski. Inferring Relevance from Eye Movements: Feature Extraction. Helsinki University of Technology, Publications in…
440 runs0 likes12 downloads12 reach15 impact
10936 instances - 28 features - 3 classes - 0 missing values
Source: David Gil, dgil '@' dtic.ua.es, Lucentia Research Group, Department of Computer Technology, University of Alicante Jose Luis Girela, girela '@' ua.es, Department of Biotechnology, University…
451 runs0 likes7 downloads7 reach13 impact
100 instances - 10 features - 2 classes - 0 missing values
Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring. Science, VOL 286, pp. 531-537, 15 October 1999. Web supplement to the article T.R. Golub, D. K.…
451 runs0 likes13 downloads13 reach15 impact
72 instances - 7130 features - 2 classes - 0 missing values
Fashion-MNIST is a dataset of Zalando's article images, consisting of a training set of 60,000 examples and a test set of 10,000 examples. Each example is a 28x28 grayscale image, associated with a…
452 runs0 likes12 downloads12 reach25 impact
70000 instances - 785 features - 10 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
455 runs0 likes5 downloads5 reach14 impact
108 instances - 4 features - 2 classes - 0 missing values
GISETTE is a handwritten digit recognition problem. The problem is to separate the highly confusable digits '4' and '9'. This dataset is one of five datasets of the NIPS 2003 feature selection…
466 runs0 likes52 downloads52 reach25 impact
7000 instances - 5001 features - 2 classes - 0 missing values
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage repeatable,…
485 runs0 likes5 downloads5 reach13 impact
76 instances - 15 features - 7 classes - 37 missing values
Datasets from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch) Dataset from: http://www.agnostic.inf.ethz.ch/datasets.php Modified by TunedIT (converted to ARFF…
486 runs0 likes14 downloads14 reach16 impact
14395 instances - 109 features - 2 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
490 runs0 likes4 downloads4 reach13 impact
364 instances - 33 features - 6 classes - 101 missing values
No data.
496 runs0 likes6 downloads6 reach22 impact
45 instances - 4027 features - 2 classes - 5948 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
511 runs0 likes6 downloads6 reach15 impact
185 instances - 3 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
514 runs0 likes7 downloads7 reach14 impact
130 instances - 2 features - 2 classes - 0 missing values
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage repeatable,…
519 runs0 likes7 downloads7 reach14 impact
203 instances - 17 features - 11 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
537 runs0 likes4 downloads4 reach14 impact
285 instances - 8 features - 7 classes - 27 missing values
Datasets from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch) Dataset from: http://www.agnostic.inf.ethz.ch/datasets.php Modified by TunedIT (converted to ARFF…
548 runs0 likes9 downloads9 reach16 impact
3468 instances - 785 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
554 runs0 likes10 downloads10 reach15 impact
40768 instances - 11 features - 2 classes - 0 missing values
CODING: ITEM 1 = BUSINESS CONDIDIONS 6 MONTHS FROM NOW (CONFERENCE BOARD) ITEM 2 = JOBS 6 MONTHS FROM NOW (CONFERENCE BOARD) ITEM 3 = FAMILY INCOME 6 MONTHS FROM NOW (CONFERENCE BOARD) ITEM 4 =…
560 runs0 likes4 downloads4 reach14 impact
72 instances - 4 features - 6 classes - 0 missing values
Fast training of support vector machines using sequential minimal optimization. In Bernhard Schölkopf, Christopher J. C. Burges, and Alexander J. Smola, editors, Advances in Kernel Methods - Support…
564 runs0 likes11 downloads11 reach23 impact
36974 instances - 124 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
567 runs0 likes14 downloads14 reach15 impact
40768 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
572 runs0 likes6 downloads6 reach14 impact
100 instances - 3 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
575 runs0 likes9 downloads9 reach15 impact
1000 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
581 runs0 likes11 downloads11 reach15 impact
20640 instances - 9 features - 2 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
581 runs0 likes5 downloads5 reach14 impact
400 instances - 6 features - 4 classes - 0 missing values
Dataset from `Pattern Recognition and Neural Networks' by B.D. Ripley. Cambridge University Press (1996) ISBN 0-521-46086-7 The background to the datasets is described in section 1.4; this file…
587 runs0 likes5 downloads5 reach14 impact
61 instances - 19 features - 4 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
589 runs0 likes12 downloads12 reach15 impact
22784 instances - 9 features - 2 classes - 0 missing values
Datasets of Data And Story Library, project illustrating use of basic statistic methods, converted to arff format by Hakan Kjellerstrand. Source: TunedIT: http://tunedit.org/repo/DASL DASL file…
593 runs0 likes7 downloads7 reach14 impact
478 instances - 11 features - 3 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
594 runs0 likes8 downloads8 reach15 impact
1000 instances - 6 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
598 runs0 likes8 downloads8 reach15 impact
1000 instances - 6 features - 2 classes - 0 missing values
Data Sets for 'Regression Models for Time Series Analysis' by B. Kedem and K. Fokianos, Wiley 2002. Submitted by Kostas Fokianos (fokianos@ucy.ac.cy) [8/Nov/02] (176k) Note: - attribute names were…
599 runs0 likes11 downloads11 reach15 impact
1024 instances - 3 features - 4 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
600 runs0 likes12 downloads12 reach15 impact
1000 instances - 101 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
602 runs1 likes12 downloads13 reach15 impact
13750 instances - 41 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
604 runs0 likes14 downloads14 reach15 impact
22784 instances - 17 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
604 runs0 likes9 downloads9 reach15 impact
1000 instances - 26 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
608 runs1 likes9 downloads10 reach15 impact
1000 instances - 26 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
608 runs0 likes9 downloads9 reach15 impact
1000 instances - 26 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
614 runs0 likes9 downloads9 reach15 impact
1000 instances - 51 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
615 runs0 likes9 downloads9 reach15 impact
1000 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
617 runs0 likes11 downloads11 reach15 impact
1000 instances - 26 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
618 runs0 likes12 downloads12 reach15 impact
40768 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
620 runs0 likes10 downloads10 reach15 impact
1000 instances - 51 features - 2 classes - 0 missing values