Data
Filter results by:
No data.
306 runs0 likes3 downloads3 reach1 impact
1000000 instances - 13 features - 6 classes - 0 missing values
No data.
309 runs0 likes6 downloads6 reach2 impact
1000000 instances - 35 features - 6 classes - 0 missing values
#modelage
0 runs0 likes0 downloads0 reach0 impact
224 instances - 20 features - 6 classes - 205 missing values
Human Activity Recognition (HAR) database built from the recordings of 30 subjects performing activities of daily living (ADL) while carrying a waist-mounted smartphone with embedded inertial sensors.…
21322 runs0 likes23 downloads23 reach34 impact
10299 instances - 562 features - 6 classes - 0 missing values
No data.
143 runs0 likes4 downloads4 reach2 impact
1000000 instances - 39 features - 6 classes - 0 missing values
Automated file upload of BNG(anneal)
100 runs0 likes3 downloads3 reach3 impact
1000000 instances - 39 features - 6 classes - 0 missing values
wine-quality-red-pmlb
31 runs1 likes0 downloads1 reach12 impact
1599 instances - 12 features - 6 classes - 0 missing values
Relevant Information: -- The database contains 3 potential classes, one for the number of times a certain type of solar flare occured in a 24 hour period. -- Each instance represents captured features…
31 runs0 likes0 downloads0 reach11 impact
1066 instances - 13 features - 6 classes - 0 missing values
* Source: JP Marques de Sá, INEB-Instituto de Engenharia Biomédica, Porto, Portugal; e-mail: jpmdesa '@' gmail.com J Jossinet, inserm, Lyon, France * Data Set Information: Impedance measurements…
280 runs0 likes5 downloads5 reach5 impact
106 instances - 10 features - 6 classes - 0 missing values
Source: James P Bridge, Sean B Holden and Lawrence C Paulson University of Cambridge Computer Laboratory William Gates Building 15 JJ Thomson Avenue Cambridge CB3 0FD UK +44 (0)1223 763500…
24399 runs1 likes20 downloads21 reach34 impact
6118 instances - 52 features - 6 classes - 0 missing values
Datasets for `Pattern Recognition and Neural Networks' by B.D. Ripley ===================================================================== Cambridge University Press (1996) ISBN 0-521-46086-7 The…
640 runs0 likes6 downloads6 reach6 impact
214 instances - 10 features - 6 classes - 0 missing values
This data set consists of three types of entities: (a) the specification of an auto in terms of various characteristics, (b) its assigned insurance risk rating, (c) its normalized losses in use as…
3252 runs2 likes25 downloads27 reach2 impact
205 instances - 26 features - 6 classes - 59 missing values
1. Title: Dermatology Database 2. Source Information: (a) Original owners: -- 1. Nilsel Ilter, M.D., Ph.D., Gazi University, School of Medicine 06510 Ankara, Turkey Phone: +90 (312) 214 1080 -- 2. H.…
1752 runs0 likes13 downloads13 reach2 impact
366 instances - 35 features - 6 classes - 8 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
490 runs0 likes4 downloads4 reach5 impact
364 instances - 33 features - 6 classes - 101 missing values
One of the datasets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff. It contains data on the DMFT Index (Decayed, Missing, and Filled Teeth) before and after different prevention…
26667 runs0 likes11 downloads11 reach34 impact
797 instances - 5 features - 6 classes - 0 missing values
CODING: ITEM 1 = BUSINESS CONDIDIONS 6 MONTHS FROM NOW (CONFERENCE BOARD) ITEM 2 = JOBS 6 MONTHS FROM NOW (CONFERENCE BOARD) ITEM 3 = FAMILY INCOME 6 MONTHS FROM NOW (CONFERENCE BOARD) ITEM 4 =…
560 runs0 likes4 downloads4 reach6 impact
72 instances - 4 features - 6 classes - 0 missing values
The database consists of the multi-spectral values of pixels in 3x3 neighbourhoods in a satellite image, and the classification associated with the central pixel in each neighbourhood. The aim is to…
27180 runs2 likes23 downloads25 reach2 impact
6430 instances - 37 features - 6 classes - 0 missing values
1. Title: Glass Identification Database 2. Sources: (a) Creator: B. German -- Central Research Establishment Home Office Forensic Science Service Aldermaston, Reading, Berkshire RG7 4PN (b) Donor:…
1776 runs0 likes50 downloads50 reach1 impact
214 instances - 10 features - 6 classes - 0 missing values
No data.
874 runs0 likes6 downloads6 reach2 impact
71 instances - 63 features - 6 classes - 0 missing values
No data.
215 runs0 likes7 downloads7 reach10 impact
204 instances - 5833 features - 6 classes - 0 missing values
No data.
220 runs0 likes7 downloads7 reach10 impact
336 instances - 7903 features - 6 classes - 0 missing values
### Description Synthetic Control Chart Time Series. This is actually time series classification. ### Sources ``` * Original Owner and Donor Dr Robert Alcock rob@skyblue.csd.auth.gr ``` ### Dataset…
20354 runs0 likes10 downloads10 reach40 impact
600 instances - 62 features - 6 classes - 0 missing values
### Description Gas Sensor Array Drift Dataset Data Set ### Sources ``` (a) Creators: Alexander Vergara (vergara '@' ucsd.edu) BioCircutis Institute University of California San Diego San Diego,…
18352 runs1 likes20 downloads21 reach34 impact
13910 instances - 129 features - 6 classes - 0 missing values
A Vergara, S Vembu, T Ayhan, M Ryan, M Homer, R Huerta. "Chemical gas sensor drift compensation using classifier ensembles." Sensors and Actuators B: Chemical 166 (2012): 320-329. I Rodriguez-Lujan, J…
68 runs1 likes10 downloads11 reach5 impact
13910 instances - 130 features - 6 classes - 0 missing values
The experiments were carried out with a group of 30 volunteers within an age bracket of 19-48 years. They performed a protocol of activities composed of six basic activities: three static postures…
83 runs0 likes9 downloads9 reach4 impact
180 instances - 68 features - 6 classes - 0 missing values
Data from the RSCTC 2010 Discovery Challenge. All datasets contain between 100 and 400 samples, characterized by values of 20,000 - 65,000 attributes. Samples are assigned to several (2-10) classes.…
11 runs0 likes3 downloads3 reach6 impact
214 instances - 45102 features - 7 classes - 0 missing values
No data.
32 runs0 likes1 downloads1 reach2 impact
1000000 instances - 26 features - 7 classes - 0 missing values
No data.
33 runs0 likes4 downloads4 reach2 impact
1000000 instances - 26 features - 7 classes - 0 missing values
No data.
29 runs0 likes6 downloads6 reach2 impact
1000000 instances - 26 features - 7 classes - 0 missing values
No data.
29 runs0 likes4 downloads4 reach2 impact
1000000 instances - 26 features - 7 classes - 0 missing values
No data.
27 runs1 likes4 downloads5 reach2 impact
1000000 instances - 26 features - 7 classes - 0 missing values
No data.
27 runs0 likes5 downloads5 reach2 impact
1000000 instances - 26 features - 7 classes - 0 missing values
No data.
28 runs0 likes3 downloads3 reach2 impact
1000000 instances - 26 features - 7 classes - 0 missing values
No data.
27 runs0 likes2 downloads2 reach2 impact
1000000 instances - 26 features - 7 classes - 0 missing values
eating
9413 runs0 likes15 downloads15 reach43 impact
945 instances - 6374 features - 7 classes - 0 missing values
No data.
211 runs0 likes3 downloads3 reach2 impact
1000000 instances - 20 features - 7 classes - 0 missing values
No data.
63 runs0 likes3 downloads3 reach1 impact
1000000 instances - 26 features - 7 classes - 0 missing values
No data.
65 runs1 likes2 downloads3 reach1 impact
1000000 instances - 18 features - 7 classes - 0 missing values
Normalized version of the Forest Covertype dataset (see version 1), so that the numerical values are between 0 and 1. Contains the forest cover type for 30 x 30 meter cells obtained from US Forest…
342 runs1 likes39 downloads40 reach2 impact
581012 instances - 55 features - 7 classes - 0 missing values
No data.
90 runs0 likes4 downloads4 reach1 impact
137781 instances - 10 features - 7 classes - 0 missing values
No data.
108 runs0 likes4 downloads4 reach10 impact
927 instances - 10129 features - 7 classes - 0 missing values
No data.
65 runs0 likes8 downloads8 reach1 impact
1000000 instances - 26 features - 7 classes - 0 missing values
No data.
75 runs0 likes2 downloads2 reach1 impact
137781 instances - 10 features - 7 classes - 0 missing values
No data.
291 runs0 likes4 downloads4 reach1 impact
1000000 instances - 18 features - 7 classes - 0 missing values
No data.
27 runs1 likes3 downloads4 reach2 impact
1000000 instances - 26 features - 7 classes - 0 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
0 runs0 likes0 downloads0 reach7 impact
8237 instances - 801 features - 7 classes - 0 missing values
This is the original version of the famous covertype dataset in ARFF format. Predicting forest cover type from cartographic variables only (no remotely sensed data). The actual forest cover type for a…
2 runs1 likes14 downloads15 reach13 impact
581012 instances - 55 features - 7 classes - 0 missing values
shuttle-pmlb
6 runs0 likes2 downloads2 reach13 impact
58000 instances - 10 features - 7 classes - 0 missing values
The instances were drawn randomly from a database of 7 outdoor images. The images were hand-segmented to create a classification for every pixel. Each instance is a 3x3 region. __Major changes w.r.t.…
7680 runs0 likes2 downloads2 reach13 impact
2310 instances - 20 features - 7 classes - 0 missing values
Automated file upload of BNG(segment)
99 runs0 likes1 downloads1 reach2 impact
1000000 instances - 20 features - 7 classes - 0 missing values
Citation Request: This dataset is public available for research. The details are described in [Cortez et al., 2009]. Please include this citation if you plan to use this database: P. Cortez, A.…
64 runs1 likes5 downloads6 reach7 impact
4898 instances - 12 features - 7 classes - 0 missing values
Predicting forest cover type from cartographic variables only (no remotely sensed data). The actual forest cover type for a given observation (30 x 30 meter cell) was determined from US Forest Service…
216 runs0 likes11 downloads11 reach2 impact
110393 instances - 55 features - 7 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
537 runs0 likes4 downloads4 reach6 impact
285 instances - 8 features - 7 classes - 27 missing values
A simple database containing 17 Boolean-valued attributes describing animals. The "type" attribute appears to be the class attribute. Notes: * I find it unusual that there are 2 instances of "frog"…
168 runs2 likes16 downloads18 reach1 impact
101 instances - 17 features - 7 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
1187 runs1 likes10 downloads11 reach1 impact
412 instances - 9 features - 7 classes - 96 missing values
The instances were drawn randomly from a database of 7 outdoor images. The images were hand-segmented to create a classification for every pixel. Each instance is a 3x3 region. ### Attribute…
23138 runs0 likes22 downloads22 reach2 impact
2310 instances - 20 features - 7 classes - 0 missing values
This data set contains unweighted PUMS census data from the Los Angeles and Long Beach areas for the years 1970, 1980, and 1990. The coding schemes have been standardized (by the IPUMS project) to be…
366 runs0 likes10 downloads10 reach6 impact
8844 instances - 61 features - 7 classes - 51515 missing values
This data set contains unweighted PUMS census data from the Los Angeles and Long Beach areas for the years 1970, 1980, and 1990. The coding schemes have been standardized (by the IPUMS project) to be…
354 runs0 likes7 downloads7 reach6 impact
7485 instances - 61 features - 7 classes - 52048 missing values
__Changes w.r.t. version 1: included one target factor with 7 levels as target variable for the classification. Also deleted the previous 7 binary target variables.__ A dataset of steel plates'…
7076 runs1 likes2 downloads3 reach7 impact
1941 instances - 28 features - 7 classes - 0 missing values
Re-upload of the dataset as it is present in the Penn ML Benchmark (https://github.com/EpistasisLab/penn-ml-benchmarks/tree/master/datasets/classification/fars). It's a dataset on traffic accidents,…
1 runs0 likes0 downloads0 reach13 impact
100968 instances - 30 features - 8 classes - 0 missing values
Expression levels of 77 proteins measured in the cerebral cortex of 8 classes of control and Down syndrome mice exposed to context fear conditioning, a task used to assess associative learning. The…
6779 runs0 likes0 downloads0 reach11 impact
1080 instances - 82 features - 8 classes - 1396 missing values
Data contains the information of 9144 samples form 220 spectral bands. The classes represent land-use types: alfalfa, corn, grass, hay, oats, soybeans, trees, and wheat.
0 runs0 likes0 downloads0 reach0 impact
9144 instances - 221 features - 8 classes - 0 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au6-250-drift-au6-cd1-500 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and…
11011 runs0 likes9 downloads9 reach39 impact
750 instances - 41 features - 8 classes - 0 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au6-cd1-400 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and heterogeneity…
144 runs0 likes3 downloads3 reach5 impact
400 instances - 41 features - 8 classes - 0 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au6-1000 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and heterogeneity of…
11010 runs0 likes16 downloads16 reach39 impact
1000 instances - 41 features - 8 classes - 0 missing values
1. Title: Protein Localization Sites 2. Creator and Maintainer: Kenta Nakai Institue of Molecular and Cellular Biology Osaka, University 1-3 Yamada-oka, Suita 565 Japan nakai@imcb.osaka-u.ac.jp…
1803 runs0 likes12 downloads12 reach2 impact
336 instances - 8 features - 8 classes - 0 missing values
ARFF version of UCI dataset 'flags'. Creators: Collected primarily from the "Collins Gem Guide to Flags": Collins Publishers (1986). Donor: Richard S. Forsyth. Date 5/15/1990 This data file contains…
103 runs0 likes8 downloads8 reach9 impact
194 instances - 30 features - 8 classes - 0 missing values
This data set contains unweighted PUMS census data from the Los Angeles and Long Beach areas for the years 1970, 1980, and 1990. The coding schemes have been standardized (by the IPUMS project) to be…
434 runs0 likes10 downloads10 reach6 impact
7019 instances - 61 features - 8 classes - 48089 missing values
No data.
211 runs0 likes4 downloads4 reach10 impact
313 instances - 5805 features - 8 classes - 0 missing values
Data from the RSCTC 2010 Discovery Challenge. All datasets contain between 100 and 400 samples, characterized by values of 20,000 - 65,000 attributes. Samples are assigned to several (2-10) classes.…
1 runs0 likes2 downloads2 reach6 impact
383 instances - 54676 features - 9 classes - 0 missing values
No data.
288 runs0 likes2 downloads2 reach2 impact
1000000 instances - 15 features - 9 classes - 0 missing values
* Abstract: 9-class version of poker-hand dataset, it was removed the minority class.
1 runs0 likes2 downloads2 reach6 impact
1025000 instances - 11 features - 9 classes - 0 missing values
This file contains 9 sets of sanitized user data drawn from the command histories of 8 UNIX computer users at Purdue over the course of up to 2 years (USER0 and USER1 were generated by the same…
11 runs0 likes8 downloads8 reach6 impact
9100 instances - 3 features - 9 classes - 0 missing values
### Description This is a data set containing 1080 documents of free text business descriptions of Brazilian companies categorized into a subset of 9 categories. ### Source ``` Patrick Marques…
30032 runs0 likes15 downloads15 reach46 impact
1080 instances - 857 features - 9 classes - 0 missing values
DATA-SETS FROM DIGGLE, P.J. (1990). TIME SERIES : A BIOSTATISTICAL INTRODUCTION. Oxford University Press. Table: Table A2 Wool prices Information about the dataset CLASSTYPE: numeric CLASSINDEX: none…
626 runs0 likes6 downloads6 reach6 impact
310 instances - 9 features - 9 classes - 0 missing values
No data.
296 runs0 likes5 downloads5 reach13 impact
96 instances - 4027 features - 9 classes - 19667 missing values
No data.
219 runs0 likes5 downloads5 reach10 impact
414 instances - 6430 features - 9 classes - 0 missing values
This dataset records 640 time series of 12 LPC cepstrum coefficients taken from nine male speakers. The data was collected for examining our newly developed classifier for multidimensional curves…
23156 runs0 likes11 downloads11 reach46 impact
9961 instances - 15 features - 9 classes - 0 missing values
This database has been artificially generated. It describes the structure of the capital letters A, C, D, E, F, G, H, L, P, R, indicated by a number 1-10, in that order (A=1,C=2,...). Each letter's…
24305 runs0 likes10 downloads10 reach49 impact
10218 instances - 8 features - 10 classes - 0 missing values
* Title of Database: Spoken Arabic Digit * Abstract: This dataset contains time series of mel-frequency cepstrum coefficients (MFCCs) corresponding to spoken Arabic digits. Includes data from 44 males…
1 runs0 likes6 downloads6 reach6 impact
263256 instances - 15 features - 10 classes - 0 missing values
* Abstract: Purpose is to predict poker hands * Source - Creators: Robert Cattral (cattral '@' gmail.com) Franz Oppacher (oppacher '@' scs.carleton.ca) Carleton University, Department of Computer…
1 runs0 likes4 downloads4 reach6 impact
1025009 instances - 11 features - 10 classes - 0 missing values
No data.
194 runs0 likes3 downloads3 reach2 impact
1000000 instances - 65 features - 10 classes - 0 missing values
No data.
67 runs0 likes2 downloads2 reach2 impact
1000000 instances - 17 features - 10 classes - 0 missing values
No data.
48 runs1 likes4 downloads5 reach2 impact
1000000 instances - 77 features - 10 classes - 0 missing values
No data.
50 runs0 likes1 downloads1 reach2 impact
1000000 instances - 65 features - 10 classes - 0 missing values
No data.
373 runs0 likes8 downloads8 reach51 impact
918 instances - 3013 features - 10 classes - 0 missing values
No data.
216 runs0 likes12 downloads12 reach52 impact
11162 instances - 11466 features - 10 classes - 0 missing values
Dataset created to study concept drift in stream mining. It is constructed by combining the Covertype, Poker-Hand, and Electricity datasets. More details can be found in: Albert Bifet, Geoff Holmes,…
332 runs0 likes26 downloads26 reach3 impact
1455525 instances - 73 features - 10 classes - 0 missing values
No data.
304 runs0 likes6 downloads6 reach2 impact
1000000 instances - 25 features - 10 classes - 0 missing values
Normalized version of the pokerhand data set. Automated file upload of pokerhand-normalized.arff
314 runs0 likes11 downloads11 reach2 impact
829201 instances - 11 features - 10 classes - 0 missing values
No data.
377 runs0 likes10 downloads10 reach51 impact
913 instances - 3101 features - 10 classes - 0 missing values
No data.
290 runs0 likes5 downloads5 reach2 impact
1000000 instances - 77 features - 10 classes - 0 missing values
No data.
52 runs0 likes3 downloads3 reach2 impact
1000000 instances - 48 features - 10 classes - 0 missing values
No data.
52 runs0 likes2 downloads2 reach1 impact
1000000 instances - 65 features - 10 classes - 0 missing values
No data.
293 runs0 likes2 downloads2 reach2 impact
1000000 instances - 17 features - 10 classes - 0 missing values
No data.
51 runs1 likes4 downloads5 reach2 impact
1000000 instances - 48 features - 10 classes - 0 missing values
50% stratified subsample of the original SVHN data
0 runs0 likes0 downloads0 reach0 impact
49644 instances - 3073 features - 10 classes - 0 missing values
10% stratified subsample of the original SVHN data
0 runs0 likes0 downloads0 reach0 impact
9927 instances - 3073 features - 10 classes - 0 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
5 runs0 likes0 downloads0 reach7 impact
10000 instances - 7201 features - 10 classes - 0 missing values
Fashion-MNIST is a dataset of Zalando's article images, consisting of a training set of 60,000 examples and a test set of 10,000 examples. Each example is a 28x28 grayscale image, associated with a…
436 runs0 likes10 downloads10 reach14 impact
70000 instances - 785 features - 10 classes - 0 missing values