Data
Filter results by:
Data from StatLib (ftp stat.cmu.edu/datasets) These data are those collected in a cloud-seeding experiment in Tasmania between mid-1964 and January 1971. Their analysis, using regression techniques…
66 runs0 likes2 downloads2 reach1 impact
108 instances - 7 features - 0 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Survival treated as the class attribute As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using…
12 runs0 likes2 downloads2 reach1 impact
130 instances - 10 features - 0 classes - 97 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Tumor-size treated as the class attribute. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using…
0 runs0 likes3 downloads3 reach1 impact
286 instances - 10 features - 0 classes - 9 missing values
This is a family of datasets synthetically generated from a realistic simulation of the dynamics of a Unimation Puma 560 robot arm. There are eight datastets in this family . In this repository we…
2 runs0 likes5 downloads5 reach1 impact
8192 instances - 9 features - 0 classes - 0 missing values
Dataset from Smoothing Methods in Statistics (ftp stat.cmu.edu/datasets) Simonoff, J.S. (1996). Smoothing Methods in Statistics. New York: Springer-Verlag. Gasoline comnsumption is being treated as…
2 runs0 likes0 downloads0 reach1 impact
27 instances - 5 features - 0 classes - 0 missing values
Dataset from Smoothing Methods in Statistics (ftp stat.cmu.edu/datasets) Simonoff, J.S. (1996). Smoothing Methods in Statistics. New York: Springer-Verlag. Electicity usage is being treated as the…
4 runs0 likes0 downloads0 reach1 impact
55 instances - 3 features - 0 classes - 0 missing values
As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based learning with encoding length selection. In Progress in Connectionist-Based Information Systems.…
2 runs0 likes1 downloads1 reach1 impact
200 instances - 11 features - 0 classes - 0 missing values
The problem concerns Relative CPU Performance Data. More information can be obtained in the UCI Machine Learning repository (http://www.ics.uci.edu/~mlearn/MLSummary.html). The used attributes are :…
2 runs0 likes2 downloads2 reach1 impact
209 instances - 7 features - 0 classes - 0 missing values
No data.
307 runs0 likes3 downloads3 reach2 impact
1000000 instances - 41 features - 3 classes - 0 missing values
No data.
291 runs0 likes4 downloads4 reach1 impact
1000000 instances - 18 features - 7 classes - 0 missing values
No data.
353 runs0 likes16 downloads16 reach1 impact
120919 instances - 1002 features - 2 classes - 0 missing values
No data.
167 runs0 likes8 downloads8 reach2 impact
399940 instances - 1002 features - 2 classes - 0 missing values
No data.
293 runs0 likes2 downloads2 reach2 impact
1000000 instances - 17 features - 10 classes - 0 missing values
No data.
65 runs0 likes3 downloads3 reach1 impact
1000000 instances - 40 features - 2 classes - 0 missing values
No data.
309 runs0 likes6 downloads6 reach2 impact
1000000 instances - 35 features - 6 classes - 0 missing values
No data.
296 runs0 likes7 downloads7 reach1 impact
1000000 instances - 61 features - 2 classes - 0 missing values
No data.
75 runs0 likes2 downloads2 reach1 impact
137781 instances - 10 features - 7 classes - 0 missing values
No data.
310 runs0 likes2 downloads2 reach1 impact
1000000 instances - 14 features - 5 classes - 0 missing values
No data.
326 runs0 likes4 downloads4 reach2 impact
1000000 instances - 14 features - 2 classes - 0 missing values
No data.
304 runs0 likes3 downloads3 reach1 impact
1000000 instances - 19 features - 4 classes - 0 missing values
No data.
331 runs0 likes7 downloads7 reach1 impact
1000000 instances - 20 features - 2 classes - 0 missing values
No data.
863 runs0 likes11 downloads11 reach1 impact
39366 instances - 10 features - 2 classes - 0 missing values
No data.
52 runs0 likes2 downloads2 reach1 impact
No data.
306 runs0 likes3 downloads3 reach1 impact
1000000 instances - 13 features - 6 classes - 0 missing values
No data.
52 runs0 likes3 downloads3 reach2 impact
1000000 instances - 48 features - 10 classes - 0 missing values
No data.
960 runs0 likes8 downloads8 reach1 impact
55296 instances - 10 features - 3 classes - 0 missing values
No data.
163 runs0 likes5 downloads5 reach1 impact
1000000 instances - 28 features - 2 classes - 0 missing values
No data.
68 runs0 likes4 downloads4 reach1 impact
1000000 instances - 23 features - 2 classes - 0 missing values
No data.
326 runs0 likes4 downloads4 reach2 impact
1000000 instances - 16 features - 2 classes - 0 missing values
No data.
315 runs0 likes2 downloads2 reach2 impact
295245 instances - 11 features - 5 classes - 0 missing values
No data.
225 runs0 likes7 downloads7 reach2 impact
1000000 instances - 21 features - 2 classes - 0 missing values
This is the famous covertype dataset in its binary version, retrieved 2013-11-13 from the libSVM site (called covtype.binary there). Additional to the preprocessing done there (see LibSVM site for…
22 runs0 likes8 downloads8 reach7 impact
581012 instances - 55 features - 2 classes - 0 missing values
This data set addresses a control problem, namely flying a F16 aircraft. The attributes describe the status of the aeroplane, while the goal is to predict the control action on the ailerons of the…
0 runs0 likes5 downloads5 reach5 impact
13750 instances - 41 features - 0 classes - 0 missing values
Information about customers consists of 86 variables and includes product usage data and socio-demographic data derived from zip area codes. The data was supplied by the Dutch data mining company…
0 runs0 likes2 downloads2 reach5 impact
9822 instances - 86 features - 0 classes - 0 missing values
University of Sao Paulo, School of Art, Sciences and Humanities, Sao Paulo, SP, Brazil ### LIBRAS Movement Database LIBRAS, acronym of the Portuguese name "LIngua BRAsileira de Sinais", is the…
0 runs0 likes4 downloads4 reach7 impact
360 instances - 91 features - 0 classes - 0 missing values
1. Title: Part of the IRAS Low Resolution Spectrometer Database 2. Sources: (a) Originator: Infra-Red Astronomy Satellite Project Database (b) Donor: John Stutz (c) Date:…
1243 runs0 likes43 downloads43 reach7 impact
531 instances - 103 features - 48 classes - 0 missing values
Yeast dataset Past Usage: André Elisseeff and Jason Weston. A kernel method for multi-labelled classification. In Thomas G. Dietterich, Susan Becker, and Zoubin Ghahramani, editors, Advances in…
139 runs0 likes8 downloads8 reach6 impact
2417 instances - 117 features - 2 classes - 0 missing values
No data.
206 runs0 likes3 downloads3 reach2 impact
1000000 instances - 39 features - 6 classes - 0 missing values
No data.
67 runs0 likes2 downloads2 reach2 impact
1000000 instances - 39 features - 6 classes - 0 missing values
No data.
332 runs0 likes4 downloads4 reach2 impact
1000000 instances - 17 features - 2 classes - 0 missing values
No data.
311 runs0 likes3 downloads3 reach2 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
65 runs0 likes8 downloads8 reach1 impact
1000000 instances - 26 features - 7 classes - 0 missing values
No data.
310 runs0 likes4 downloads4 reach2 impact
1000000 instances - 19 features - 4 classes - 0 missing values
No data.
290 runs0 likes5 downloads5 reach2 impact
1000000 instances - 77 features - 10 classes - 0 missing values
No data.
874 runs0 likes6 downloads6 reach2 impact
71 instances - 63 features - 6 classes - 0 missing values
No data.
940 runs0 likes5 downloads5 reach2 impact
74 instances - 63 features - 4 classes - 0 missing values
No data.
941 runs0 likes4 downloads4 reach2 impact
74 instances - 63 features - 4 classes - 0 missing values
No data.
988 runs0 likes3 downloads3 reach2 impact
74 instances - 63 features - 4 classes - 0 missing values
No data.
400 runs0 likes6 downloads6 reach2 impact
45164 instances - 75 features - 11 classes - 0 missing values
ARFF version of UCI dataset 'flags'. Creators: Collected primarily from the "Collins Gem Guide to Flags": Collins Publishers (1986). Donor: Richard S. Forsyth. Date 5/15/1990 This data file contains…
103 runs0 likes8 downloads8 reach9 impact
194 instances - 30 features - 8 classes - 0 missing values
1. Title: Ozone Level Detection 2. Source: Kun Zhang zhang.kun05 '@' gmail.com Department of Computer Science, Xavier University of Lousiana Wei Fan wei.fan '@' gmail.com IBM T.J.Watson Research…
0 runs0 likes1 downloads1 reach5 impact
2536 instances - 73 features - 0 classes - 0 missing values
Speaker independent recognition of the eleven steady state vowels of British English using a specified training set of lpc derived log area ratios. Collected by David Deterding (data and…
24479 runs0 likes14 downloads14 reach35 impact
990 instances - 13 features - 11 classes - 0 missing values
This is one of a family of datasets synthetically generated from a realistic simulation of the dynamics of a Unimation Puma 560 robot arm. There are eight datastets in this family . In this repository…
0 runs0 likes5 downloads5 reach5 impact
8192 instances - 33 features - 0 classes - 0 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) The infamous Longley data, "An appraisal of least-squares programs from the point of view of the user", JASA, 62(1967) p819-841. Variables are: Number of…
3 runs0 likes1 downloads1 reach1 impact
16 instances - 7 features - 0 classes - 0 missing values
This data set concerns the study of the factors affecting patterns of insulin-dependent diabetes mellitus in children. The objective is to investigate the dependence of the level of serum C-peptide on…
2 runs0 likes0 downloads0 reach1 impact
43 instances - 3 features - 0 classes - 0 missing values
Dataset from Smoothing Methods in Statistics (ftp stat.cmu.edu/datasets) Simonoff, J.S. (1996). Smoothing Methods in Statistics. New York: Springer-Verlag. Points scored per minute is being treated as…
2 runs0 likes0 downloads0 reach1 impact
96 instances - 5 features - 0 classes - 0 missing values
This data set is also obtained from the task of controlling a F16 aircraft, although the target variable and attributes are different from the ailerons domain. In this case the goal variable is…
2 runs0 likes5 downloads5 reach1 impact
16599 instances - 19 features - 0 classes - 0 missing values
The task consists of Learning Quantitative Structure Activity Relationships (QSARs). The Inhibition of Dihydrofolate Reductase by Pyrimidines.The data are described in: King, Ross .D., Muggleton,…
6 runs0 likes1 downloads1 reach1 impact
74 instances - 28 features - 0 classes - 0 missing values
Publication Request: >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> This file describes the contents of the heart-disease directory. This directory contains 4 databases…
10 runs0 likes0 downloads0 reach1 impact
294 instances - 14 features - 0 classes - 782 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Weight treated as the class attribute. Identifier deleted. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric…
10 runs0 likes2 downloads2 reach1 impact
158 instances - 8 features - 0 classes - 87 missing values
This data set contains unweighted PUMS census data from the Los Angeles and Long Beach areas for the years 1970, 1980, and 1990. The coding schemes have been standardized (by the IPUMS project) to be…
354 runs0 likes7 downloads7 reach6 impact
7485 instances - 61 features - 7 classes - 52048 missing values
This data set contains unweighted PUMS census data from the Los Angeles and Long Beach areas for the years 1970, 1980, and 1990. The coding schemes have been standardized (by the IPUMS project) to be…
434 runs0 likes10 downloads10 reach6 impact
7019 instances - 61 features - 8 classes - 48089 missing values
No data.
414 runs0 likes8 downloads8 reach51 impact
690 instances - 8262 features - 10 classes - 0 missing values
No data.
220 runs0 likes7 downloads7 reach10 impact
336 instances - 7903 features - 6 classes - 0 missing values
No data.
108 runs0 likes4 downloads4 reach10 impact
927 instances - 10129 features - 7 classes - 0 missing values
No data.
377 runs0 likes10 downloads10 reach51 impact
913 instances - 3101 features - 10 classes - 0 missing values
No data.
219 runs0 likes5 downloads5 reach10 impact
414 instances - 6430 features - 9 classes - 0 missing values
No data.
215 runs0 likes7 downloads7 reach10 impact
204 instances - 5833 features - 6 classes - 0 missing values
No data.
426 runs0 likes15 downloads15 reach76 impact
2463 instances - 2001 features - 17 classes - 0 missing values
No data.
67 runs0 likes11 downloads11 reach11 impact
9558 instances - 26833 features - 44 classes - 0 missing values
No data.
222 runs0 likes10 downloads10 reach7 impact
1504 instances - 2887 features - 13 classes - 0 missing values
No data.
428 runs0 likes12 downloads12 reach52 impact
1003 instances - 3183 features - 10 classes - 0 missing values
No data.
268 runs0 likes9 downloads9 reach36 impact
3075 instances - 12433 features - 6 classes - 0 missing values
No data.
373 runs0 likes8 downloads8 reach51 impact
918 instances - 3013 features - 10 classes - 0 missing values
No data.
159 runs0 likes11 downloads11 reach11 impact
1657 instances - 3759 features - 25 classes - 0 missing values
No data.
264 runs0 likes11 downloads11 reach36 impact
3204 instances - 13196 features - 6 classes - 0 missing values
No data.
211 runs0 likes4 downloads4 reach10 impact
313 instances - 5805 features - 8 classes - 0 missing values
No data.
163 runs0 likes13 downloads13 reach11 impact
1560 instances - 8461 features - 20 classes - 0 missing values
No data.
216 runs0 likes12 downloads12 reach52 impact
11162 instances - 11466 features - 10 classes - 0 missing values
No data.
203 runs0 likes5 downloads5 reach10 impact
878 instances - 7455 features - 10 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach5 impact
12 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach5 impact
11 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach5 impact
13 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes1 downloads1 reach5 impact
25 instances - 10 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach5 impact
30 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach5 impact
26 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach5 impact
79 instances - 321 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach5 impact
37 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach5 impact
14 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach5 impact
22 instances - 111 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach5 impact
10 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach5 impact
7 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach5 impact
8 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach5 impact
13 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach5 impact
34 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach5 impact
22 instances - 629 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach5 impact
32 instances - 1143 features - 0 classes - 0 missing values
This file contains 9 sets of sanitized user data drawn from the command histories of 8 UNIX computer users at Purdue over the course of up to 2 years (USER0 and USER1 were generated by the same…
11 runs0 likes8 downloads8 reach6 impact
9100 instances - 3 features - 9 classes - 0 missing values
This database contains the HTML source of web pages plus the ratings of a single user on these web pages. The web pages are on four separate subjects (Bands- recording artists; Goats; Sheep; and…
0 runs0 likes1 downloads1 reach10 impact
131 instances - 3 features - 3 classes - 0 missing values
This database contains the HTML source of web pages plus the ratings of a single user on these web pages. The web pages are on four separate subjects (Bands- recording artists; Goats; Sheep; and…
0 runs0 likes3 downloads3 reach10 impact
65 instances - 3 features - 2 classes - 0 missing values