Data
Filter results by:
* Dataset: DBworld e-mails data set Task: dbworld-subjects-stemmed * Source: Michele Filannino, PhD University of Manchester Centre for Doctoral Training Email: filannim_AT_cs.man.ac.uk * Data Set…
71 runs0 likes3 downloads3 reach13 impact
64 instances - 230 features - 2 classes - 0 missing values
* Dataset: DBworld e-mails data set Task: dbworld-subjects * Source: Michele Filannino, PhD University of Manchester Centre for Doctoral Training Email: filannim_AT_cs.man.ac.uk * Data Set…
40 runs0 likes3 downloads3 reach13 impact
64 instances - 243 features - 2 classes - 0 missing values
* Abstract: Purpose is to predict poker hands * Source - Creators: Robert Cattral (cattral '@' gmail.com) Franz Oppacher (oppacher '@' scs.carleton.ca) Carleton University, Department of Computer…
1 runs0 likes5 downloads5 reach15 impact
1025009 instances - 11 features - 10 classes - 0 missing values
* Abstract: 9-class version of poker-hand dataset, it was removed the minority class.
1 runs0 likes3 downloads3 reach14 impact
1025000 instances - 11 features - 9 classes - 0 missing values
libSVM","AAD group #Dataset from the LIBSVM data repository. Preprocessing: We used binary encoding for each feature (o, b, x), so the number of features is 42*3 = 126
0 runs0 likes3 downloads3 reach16 impact
67557 instances - 127 features - 0 classes - 0 missing values
#Dataset from the LIBSVM multiclass data repository.
0 runs0 likes3 downloads3 reach18 impact
108000 instances - 129 features - 0 classes - 0 missing values
libSVM","AAD group #Dataset from the LIBSVM data repository. Preprocessing: Regenerate features by the authors' matlab scripts (see Sec. C of Appendix A), then randomly select 10% instances from the…
0 runs0 likes2 downloads2 reach16 impact
98528 instances - 101 features - 0 classes - 0 missing values
libSVM","AAD group #Dataset from the LIBSVM data repository.
1 runs0 likes2 downloads2 reach16 impact
1025010 instances - 11 features - 0 classes - 0 missing values
This is a corrected version of the previous data file in version 1, which contained a dataset (349 instances) incorrectly merged from the original training and test sets available on UCI (there are…
0 runs0 likes3 downloads3 reach12 impact
267 instances - 45 features - 2 classes - 0 missing values
libSVM","AAD group #Dataset from the LIBSVM data repository.
0 runs0 likes1 downloads1 reach16 impact
49749 instances - 301 features - 0 classes - 0 missing values
libSVM","AAD group #Dataset from the LIBSVM data repository.
0 runs0 likes1 downloads1 reach16 impact
49749 instances - 301 features - 0 classes - 0 missing values
libSVM","AAD group #Dataset from the LIBSVM data repository.
0 runs0 likes1 downloads1 reach16 impact
49749 instances - 301 features - 0 classes - 0 missing values
libSVM","AAD group #Dataset from the LIBSVM data repository.
0 runs0 likes0 downloads0 reach16 impact
49749 instances - 301 features - 0 classes - 0 missing values
libSVM","AAD group #Dataset from the LIBSVM data repository.
0 runs0 likes0 downloads0 reach16 impact
49749 instances - 301 features - 0 classes - 0 missing values
libSVM","AAD group #Dataset from the LIBSVM data repository.
0 runs0 likes0 downloads0 reach16 impact
49749 instances - 301 features - 0 classes - 0 missing values
libSVM","AAD group #Dataset from the LIBSVM data repository.
0 runs0 likes0 downloads0 reach16 impact
49749 instances - 301 features - 0 classes - 0 missing values
libSVM","AAD group #Dataset from the LIBSVM data repository.
0 runs0 likes0 downloads0 reach16 impact
64700 instances - 301 features - 0 classes - 0 missing values
libSVM","AAD group #Dataset from the LIBSVM data repository. Preprocessing: Original data: someone from Germany working with the car industry.
0 runs0 likes1 downloads1 reach16 impact
1243 instances - 23 features - 0 classes - 0 missing values
1. Title: Wisconsin Prognostic Breast Cancer (WPBC) 2. Source Information a) Creators: Dr. William H. Wolberg, General Surgery Dept., University of Wisconsin, Clinical Sciences Center, Madison, WI…
5 runs0 likes4 downloads4 reach9 impact
194 instances - 33 features - 0 classes - 0 missing values
Dataset from Smoothing Methods in Statistics (ftp stat.cmu.edu/datasets) Simonoff, J.S. (1996). Smoothing Methods in Statistics. New York: Springer-Verlag.
2 runs0 likes0 downloads0 reach9 impact
52 instances - 3 features - 0 classes - 0 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) SUMMARY: Data from an experiment on the affects of machine adjustments on the time to count bolts. Data appear as the STATS (Issue 10) Challenge. DATA:…
4 runs0 likes0 downloads0 reach9 impact
40 instances - 7 features - 0 classes - 0 missing values
The Computer Activity databases are a collection of computer systems activity measures. The data was collected from a Sun Sparcstation 20/712 with 128 Mbytes of memory running in a multi-user…
2 runs1 likes1 downloads2 reach9 impact
8192 instances - 22 features - 0 classes - 0 missing values
No data.
328 runs0 likes3 downloads3 reach12 impact
1000000 instances - 4 features - 2 classes - 0 missing values
No data.
330 runs0 likes5 downloads5 reach12 impact
1000000 instances - 4 features - 2 classes - 0 missing values
This is a commercial application described in Weiss & Indurkhya (1995). The data describes a telecommunication problem. No further information is available. Characteristics: (10000+5000) cases, 49…
2 runs0 likes4 downloads4 reach11 impact
15000 instances - 49 features - 0 classes - 0 missing values
The problem is to learn a regression equation/rule/tree to predict the activity from the descriptive structural attributes. The data and methodology is described in detail in: - King, Ross .D., Hurst,…
5 runs0 likes1 downloads1 reach9 impact
186 instances - 61 features - 0 classes - 0 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) This is the data set called `DETROIT' in the book `Subset selection in regression' by Alan J. Miller published in the Chapman & Hall series of monographs…
2 runs0 likes0 downloads0 reach10 impact
13 instances - 14 features - 0 classes - 0 missing values
Synthetic dataset. Almost identical to [dataset 152](https://www.openml.org/d/153/edit)
319 runs0 likes4 downloads4 reach12 impact
1000000 instances - 11 features - 2 classes - 0 missing values
No data.
304 runs0 likes7 downloads7 reach12 impact
1000000 instances - 25 features - 10 classes - 0 missing values
Normalized version of the pokerhand data set. Automated file upload of pokerhand-normalized.arff
314 runs0 likes12 downloads12 reach12 impact
829201 instances - 11 features - 10 classes - 0 missing values
No data.
298 runs0 likes3 downloads3 reach12 impact
1000000 instances - 11 features - 5 classes - 0 missing values
No data.
305 runs0 likes2 downloads2 reach12 impact
1000000 instances - 11 features - 5 classes - 0 missing values
No data.
308 runs0 likes2 downloads2 reach12 impact
1000000 instances - 11 features - 5 classes - 0 missing values
No data.
307 runs0 likes2 downloads2 reach12 impact
1000000 instances - 11 features - 5 classes - 0 missing values
No data.
309 runs0 likes3 downloads3 reach12 impact
1000000 instances - 11 features - 5 classes - 0 missing values
No data.
143 runs0 likes4 downloads4 reach12 impact
1000000 instances - 39 features - 6 classes - 0 missing values
Dataset from Smoothing Methods in Statistics (ftp stat.cmu.edu/datasets) Simonoff, J.S. (1996). Smoothing Methods in Statistics. New York: Springer-Verlag. Gasoline comnsumption is being treated as…
2 runs0 likes0 downloads0 reach9 impact
27 instances - 5 features - 0 classes - 0 missing values
Dataset from Smoothing Methods in Statistics (ftp stat.cmu.edu/datasets) Simonoff, J.S. (1996). Smoothing Methods in Statistics. New York: Springer-Verlag. Electicity usage is being treated as the…
4 runs0 likes0 downloads0 reach9 impact
55 instances - 3 features - 0 classes - 0 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) The infamous Longley data, "An appraisal of least-squares programs from the point of view of the user", JASA, 62(1967) p819-841. Variables are: Number of…
3 runs0 likes1 downloads1 reach9 impact
16 instances - 7 features - 0 classes - 0 missing values
This data set concerns the study of the factors affecting patterns of insulin-dependent diabetes mellitus in children. The objective is to investigate the dependence of the level of serum C-peptide on…
2 runs0 likes1 downloads1 reach9 impact
43 instances - 3 features - 0 classes - 0 missing values
Dataset from Smoothing Methods in Statistics (ftp stat.cmu.edu/datasets) Simonoff, J.S. (1996). Smoothing Methods in Statistics. New York: Springer-Verlag. Points scored per minute is being treated as…
2 runs0 likes0 downloads0 reach9 impact
96 instances - 5 features - 0 classes - 0 missing values
This is an artificial data set described in Breiman et al. (1984,p.238) (with variance 1 instead of 2). Generate the values of the 10 attributes independently using the following probabilities: P(X_1…
2 runs1 likes4 downloads5 reach11 impact
40768 instances - 11 features - 0 classes - 0 missing values
This data set is also obtained from the task of controlling a F16 aircraft, although the target variable and attributes are different from the ailerons domain. In this case the goal variable is…
2 runs0 likes7 downloads7 reach11 impact
16599 instances - 19 features - 0 classes - 0 missing values
This database was designed on the basis of data provided by US Census Bureau [http://www.census.gov] (under Lookup Access [http://www.census.gov/cdrom/lookup]: Summary Tape File 1). The data were…
2 runs1 likes3 downloads4 reach10 impact
22784 instances - 9 features - 0 classes - 0 missing values
No data.
307 runs0 likes3 downloads3 reach12 impact
1000000 instances - 41 features - 3 classes - 0 missing values
No data.
291 runs0 likes4 downloads4 reach9 impact
1000000 instances - 17 features - 7 classes - 0 missing values
No data.
167 runs0 likes9 downloads9 reach12 impact
399940 instances - 1002 features - 2 classes - 0 missing values
No data.
70 runs0 likes3 downloads3 reach9 impact
1000000 instances - 28 features - 2 classes - 0 missing values
No data.
72 runs0 likes3 downloads3 reach9 impact
1000000 instances - 23 features - 2 classes - 0 missing values
No data.
194 runs0 likes3 downloads3 reach12 impact
1000000 instances - 65 features - 10 classes - 0 missing values
No data.
73 runs0 likes5 downloads5 reach12 impact
1000000 instances - 16 features - 2 classes - 0 missing values
No data.
87 runs0 likes5 downloads5 reach12 impact
295245 instances - 11 features - 5 classes - 0 missing values
No data.
68 runs0 likes4 downloads4 reach12 impact
1000000 instances - 21 features - 2 classes - 0 missing values
No data.
67 runs0 likes2 downloads2 reach12 impact
1000000 instances - 17 features - 10 classes - 0 missing values
No data.
65 runs0 likes4 downloads4 reach10 impact
1000000 instances - 40 features - 2 classes - 0 missing values
No data.
66 runs0 likes3 downloads3 reach13 impact
1000000 instances - 35 features - 6 classes - 0 missing values
No data.
211 runs0 likes3 downloads3 reach12 impact
1000000 instances - 20 features - 7 classes - 0 missing values
No data.
68 runs0 likes2 downloads2 reach9 impact
1000000 instances - 19 features - 4 classes - 0 missing values
No data.
69 runs0 likes4 downloads4 reach9 impact
1000000 instances - 20 features - 2 classes - 0 missing values
No data.
356 runs0 likes8 downloads8 reach9 impact
131072 instances - 17 features - 2 classes - 0 missing values
No data.
65 runs0 likes5 downloads5 reach9 impact
1000000 instances - 30 features - 4 classes - 0 missing values
No data.
230 runs0 likes4 downloads4 reach12 impact
1000000 instances - 35 features - 2 classes - 0 missing values
No data.
63 runs0 likes2 downloads2 reach12 impact
1000000 instances - 41 features - 3 classes - 0 missing values
No data.
65 runs1 likes2 downloads3 reach9 impact
1000000 instances - 18 features - 7 classes - 0 missing values
Dataset created to study concept drift in stream mining. It is constructed by combining the Covertype, Poker-Hand, and Electricity datasets. More details can be found in: Albert Bifet, Geoff Holmes,…
332 runs0 likes27 downloads27 reach13 impact
1455525 instances - 73 features - 10 classes - 0 missing values
No data.
293 runs0 likes2 downloads2 reach12 impact
1000000 instances - 17 features - 10 classes - 0 missing values
No data.
65 runs0 likes3 downloads3 reach9 impact
1000000 instances - 40 features - 2 classes - 0 missing values
No data.
309 runs0 likes6 downloads6 reach12 impact
1000000 instances - 35 features - 6 classes - 0 missing values
No data.
296 runs0 likes7 downloads7 reach9 impact
1000000 instances - 61 features - 2 classes - 0 missing values
No data.
75 runs0 likes3 downloads3 reach9 impact
137781 instances - 10 features - 7 classes - 0 missing values
No data.
310 runs0 likes2 downloads2 reach9 impact
1000000 instances - 14 features - 5 classes - 0 missing values
No data.
304 runs0 likes3 downloads3 reach9 impact
1000000 instances - 19 features - 4 classes - 0 missing values
No data.
331 runs0 likes7 downloads7 reach9 impact
1000000 instances - 20 features - 2 classes - 0 missing values
No data.
52 runs0 likes2 downloads2 reach11 impact
1000000 instances - 65 features - 10 classes - 0 missing values
No data.
306 runs0 likes3 downloads3 reach9 impact
1000000 instances - 13 features - 6 classes - 0 missing values
No data.
52 runs0 likes3 downloads3 reach12 impact
1000000 instances - 48 features - 10 classes - 0 missing values
No data.
163 runs0 likes5 downloads5 reach9 impact
1000000 instances - 28 features - 2 classes - 0 missing values
No data.
68 runs0 likes4 downloads4 reach9 impact
1000000 instances - 23 features - 2 classes - 0 missing values
No data.
326 runs0 likes4 downloads4 reach12 impact
1000000 instances - 16 features - 2 classes - 0 missing values
No data.
315 runs0 likes2 downloads2 reach12 impact
295245 instances - 11 features - 5 classes - 0 missing values
No data.
225 runs0 likes7 downloads7 reach12 impact
1000000 instances - 21 features - 2 classes - 0 missing values
No data.
73 runs0 likes5 downloads5 reach9 impact
1000000 instances - 30 features - 2 classes - 0 missing values
No data.
50 runs0 likes3 downloads3 reach9 impact
1000000 instances - 61 features - 2 classes - 0 missing values
No data.
90 runs0 likes5 downloads5 reach9 impact
137781 instances - 10 features - 7 classes - 0 missing values
No data.
314 runs1 likes8 downloads9 reach12 impact
1000000 instances - 36 features - 19 classes - 0 missing values
No data.
219 runs0 likes4 downloads4 reach12 impact
1000000 instances - 58 features - 2 classes - 0 missing values
No data.
66 runs0 likes2 downloads2 reach9 impact
1000000 instances - 14 features - 5 classes - 0 missing values
No data.
66 runs0 likes2 downloads2 reach9 impact
1000000 instances - 14 features - 5 classes - 0 missing values
No data.
334 runs0 likes4 downloads4 reach12 impact
1000000 instances - 33 features - 2 classes - 0 missing values
No data.
70 runs0 likes2 downloads2 reach12 impact
1000000 instances - 14 features - 2 classes - 0 missing values
No data.
66 runs0 likes2 downloads2 reach12 impact
1000000 instances - 39 features - 6 classes - 0 missing values
No data.
324 runs0 likes5 downloads5 reach12 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
71 runs0 likes5 downloads5 reach12 impact
1000000 instances - 17 features - 2 classes - 0 missing values
No data.
60 runs0 likes2 downloads2 reach12 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
63 runs0 likes3 downloads3 reach9 impact
1000000 instances - 26 features - 7 classes - 0 missing values
No data.
63 runs0 likes4 downloads4 reach12 impact
1000000 instances - 19 features - 4 classes - 0 missing values
No data.
68 runs0 likes11 downloads11 reach9 impact
1000000 instances - 10 features - 2 classes - 0 missing values
No data.
48 runs1 likes4 downloads5 reach12 impact
1000000 instances - 77 features - 10 classes - 0 missing values
No data.
50 runs0 likes1 downloads1 reach12 impact
1000000 instances - 65 features - 10 classes - 0 missing values
No data.
67 runs0 likes3 downloads3 reach9 impact
1000000 instances - 13 features - 6 classes - 0 missing values