Data
Filter results by:
Multi-label dataset. The genbase dataset contains protein sequences that can be assigned to several classes of protein families.
0 runs0 likes0 downloads0 reach2 impact
662 instances - 1212 features - classes - 0 missing values
Data set shows information about participants of math conference. isPresent is target column for classification task.
0 runs0 likes0 downloads0 reach2 impact
246 instances - 7 features - 2 classes - 0 missing values
analysis of stocks
0 runs0 likes0 downloads0 reach1 impact
245 instances - 15 features - classes - 0 missing values
Data set of around 45 language and 25 Category. Consist of articles.
0 runs0 likes0 downloads0 reach1 impact
65428 instances - 3 features - classes - 0 missing values
Elegibilidade ecommerce
0 runs0 likes1 downloads1 reach1 impact
269177 instances - 2 features - 2 classes - 0 missing values
Citation Request: This breast cancer domain was obtained from the University Medical Centre, Institute of Oncology, Ljubljana, Yugoslavia. Thanks go to M. Zwitter and M. Soklic for providing the data.…
66 runs0 likes4 downloads4 reach7 impact
277 instances - 10 features - 2 classes - 0 missing values
* Dataset: DBworld e-mails data set Task: dbworld-subjects * Source: Michele Filannino, PhD University of Manchester Centre for Doctoral Training Email: filannim_AT_cs.man.ac.uk * Data Set…
40 runs0 likes2 downloads2 reach6 impact
64 instances - 243 features - 2 classes - 0 missing values
led24-pmlb
31 runs0 likes2 downloads2 reach14 impact
3200 instances - 25 features - 10 classes - 0 missing values
flare-pmlb
32 runs0 likes1 downloads1 reach14 impact
1066 instances - 11 features - 2 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach4 impact
24 instances - 5 features - classes - 0 missing values
* Title: Nursery Database * Abstract: 4-class version of the original Nursery dataset
121 runs0 likes6 downloads6 reach7 impact
12958 instances - 9 features - 4 classes - 0 missing values
Automated file upload of 20_newsgroups.drift
124 runs0 likes2 downloads2 reach8 impact
399940 instances - 1001 features - 2 classes - 0 missing values
TEST STUDENTS DATA
0 runs0 likes0 downloads0 reach0 impact
100 instances - 11 features - classes - 0 missing values
No data.
50 runs0 likes2 downloads2 reach5 impact
1000000 instances - 18 features - 22 classes - 0 missing values
No data.
304 runs0 likes7 downloads7 reach4 impact
1000000 instances - 25 features - 10 classes - 0 missing values
Uploead test
0 runs0 likes0 downloads0 reach0 impact
958 instances - 10 features - classes - 0 missing values
Test
0 runs0 likes1 downloads1 reach0 impact
958 instances - 10 features - classes - 0 missing values
Test
0 runs0 likes0 downloads0 reach0 impact
2 instances - 1 features - classes - 0 missing values
ddfef fvdf
0 runs0 likes0 downloads0 reach0 impact
8 instances - 1 features - classes - 0 missing values
xscdc frfgrg
0 runs0 likes0 downloads0 reach0 impact
3 instances - 1 features - classes - 0 missing values
qsqs
0 runs0 likes0 downloads0 reach0 impact
2 instances - 1 features - classes - 0 missing values
wdwd
0 runs0 likes0 downloads0 reach0 impact
2 instances - 1 features - classes - 0 missing values
dedfef
0 runs0 likes0 downloads0 reach0 impact
2 instances - 1 features - classes - 0 missing values
* Abstract: Predict the Bankruptcy from Qualitative parameters from experts. * Source: Source Information -- Creator : Mr.A.Martin(jayamartin '@' yahoo.com) Mr.J.Uthayakumar (uthayakumar17691 '@'…
147 runs0 likes11 downloads11 reach7 impact
250 instances - 7 features - 2 classes - 0 missing values
dataset for bme
0 runs0 likes0 downloads0 reach0 impact
63 instances - 12 features - classes - 52 missing values
The midwest survey dataset contain individual responses from surveys about regional identification conducted for FiveThirtyEight by SurveyMonkey.
0 runs0 likes0 downloads0 reach0 impact
2778 instances - 28 features - 10 classes - 1744 missing values
The midwest survey dataset contain individual responses from surveys about regional identification conducted for FiveThirtyEight by SurveyMonkey.
0 runs0 likes0 downloads0 reach0 impact
2778 instances - 28 features - 10 classes - 1744 missing values
This database contains all legal 8-ply positions in the game of connect-4 in which neither player has won yet, and in which the next move is not forced. Attributes represent board positions on a 6x6…
9329 runs0 likes8 downloads8 reach19 impact
67557 instances - 43 features - 3 classes - 0 missing values
punch sound
0 runs0 likes1 downloads1 reach2 impact
221 instances - 1 features - classes - 0 missing values
dsd efe
0 runs0 likes0 downloads0 reach0 impact
601 instances - 7 features - classes - 0 missing values
Wikidata with top-474 most frequent types and ingoing/outgoing properties as features
0 runs0 likes15 downloads15 reach5 impact
19254100 instances - 2331 features - classes - 0 missing values
Dataset showing Data from matches played RB Leipzig prior to 14.06.2020
0 runs0 likes0 downloads0 reach0 impact
102 instances - 1 features - classes - 0 missing values
* Dataset: DBworld e-mails data set Task: dbworld-subjects-stemmed * Source: Michele Filannino, PhD University of Manchester Centre for Doctoral Training Email: filannim_AT_cs.man.ac.uk * Data Set…
71 runs0 likes2 downloads2 reach7 impact
64 instances - 230 features - 2 classes - 0 missing values
Source: http://www.ijcaonline.org/archives/volume47/number18/7291-0509 Data Set Information: In this paper, we look for to recognize the causes of users tend to cyber space in Kohkiloye and Boyer…
373 runs0 likes7 downloads7 reach7 impact
100 instances - 6 features - 2 classes - 0 missing values
Originally from the StatLog project. The raw data is still available on [UCI](https://archive.ics.uci.edu/ml/datasets/Molecular+Biology+(Splice-junction+Gene+Sequences)). The data consists of 3,186…
7055 runs0 likes4 downloads4 reach18 impact
3186 instances - 181 features - 3 classes - 0 missing values
### Description The data consists of real historical data collected from 2010 & 2011. Employees are manually allowed or denied access to resources over time. The data is used to create an algorithm…
35323 runs0 likes18 downloads18 reach21 impact
32769 instances - 10 features - 2 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
60197 instances - 6 features - classes - 42138 missing values
test
0 runs0 likes0 downloads0 reach0 impact
60197 instances - 6 features - classes - 42138 missing values
test
0 runs0 likes0 downloads0 reach0 impact
60197 instances - 6 features - classes - 42138 missing values
test
0 runs0 likes0 downloads0 reach0 impact
60197 instances - 6 features - classes - 42138 missing values
test
0 runs0 likes0 downloads0 reach0 impact
60197 instances - 6 features - classes - 128136 missing values
This database was derived from a simple hierarchical decision model originally developed for the demonstration of DEX (M. Bohanec, V. Rajkovic: Expert system for decision making. Sistemica 1(1), pp.…
7179 runs0 likes7 downloads7 reach16 impact
1728 instances - 7 features - 4 classes - 0 missing values
Pittsburgh bridges This version is derived from version 2 (the discretized version) by removing all instances with missing values in the last (target) attribute. The bridges dataset is originally not…
31 runs0 likes3 downloads3 reach12 impact
105 instances - 13 features - 6 classes - 61 missing values
test
0 runs0 likes0 downloads0 reach0 impact
60197 instances - 6 features - classes - 128136 missing values
Salary Emp
0 runs0 likes0 downloads0 reach0 impact
31 instances - 2 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
8124 instances - 23 features - classes - 2480 missing values
Source: Rami Mustafa A Mohammad ( University of Huddersfield, rami.mohammad '@' hud.ac.uk, rami.mustafa.a '@' gmail.com) Lee McCluskey (University of Huddersfield,t.l.mccluskey '@' hud.ac.uk ) Fadi…
51512 runs1 likes23 downloads24 reach24 impact
11055 instances - 31 features - 2 classes - 0 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. The maps were scanned in 8 bit grey value at density of 400dpi,…
26228 runs0 likes17 downloads17 reach10 impact
2000 instances - 241 features - 10 classes - 0 missing values
1. Title: Nursery Database 2. Sources: (a) Creator: Vladislav Rajkovic et al. (13 experts) (b) Donors: Marko Bohanec (marko.bohanec@ijs.si) Blaz Zupan (blaz.zupan@ijs.si) (c) Date: June, 1997 3. Past…
2210 runs0 likes17 downloads17 reach9 impact
12960 instances - 9 features - 5 classes - 0 missing values
### Description This dataset describes mushrooms in terms of their physical characteristics. They are classified into: poisonous or edible. ### Source ``` (a) Origin: Mushroom records are drawn from…
16392 runs1 likes42 downloads43 reach10 impact
8124 instances - 23 features - 2 classes - 2480 missing values
1. Title: Postoperative Patient Data 2. Source Information: -- Creators: Sharon Summers, School of Nursing, University of Kansas Medical Center, Kansas City, KS 66160 Linda Woolery, School of Nursing,…
1758 runs0 likes10 downloads10 reach7 impact
90 instances - 9 features - 3 classes - 3 missing values
This is the large soybean database from the UCI repository, with its training and test database combined into a single file. There are 19 classes, only the first 15 of which have been used in prior…
40719 runs1 likes52 downloads53 reach10 impact
683 instances - 36 features - 19 classes - 2337 missing values
Primate splice-junction gene sequences (DNA) with associated imperfect domain theory. Splice junctions are points on a DNA sequence at which 'superfluous' DNA is removed during the process of protein…
24188 runs1 likes17 downloads18 reach7 impact
3190 instances - 61 features - 3 classes - 0 missing values
1. Title: 1984 United States Congressional Voting Records Database 2. Source Information: (a) Source: Congressional Quarterly Almanac, 98th Congress, 2nd session 1984, Volume XL: Congressional…
2262 runs0 likes17 downloads17 reach7 impact
435 instances - 17 features - 2 classes - 392 missing values
1. Title: INDUCE Trains Data set 2. Sources: - Donor: GMU, Center for AI, Software Librarian, Eric E. Bloedorn (bloedorn@aic.gmu.edu) - Original owners: Ryszard S. Michalski (michalski@aic.gmu.edu)…
1973 runs0 likes9 downloads9 reach12 impact
10 instances - 33 features - 2 classes - 51 missing values
This database encodes the complete set of possible board configurations at the end of tic-tac-toe games, where "x" is assumed to have played first. The target concept is "win for x" (i.e., true when…
386329 runs2 likes75 downloads77 reach8 impact
958 instances - 10 features - 2 classes - 0 missing values
No data.
7303 runs0 likes12 downloads12 reach9 impact
226 instances - 70 features - 24 classes - 317 missing values
Citation Request: This breast cancer domain was obtained from the University Medical Centre, Institute of Oncology, Ljubljana, Yugoslavia. Thanks go to M. Zwitter and M. Soklic for providing the data.…
2009 runs1 likes35 downloads36 reach7 impact
286 instances - 10 features - 2 classes - 9 missing values
1. Title: Chess End-Game -- King+Rook versus King+Pawn on a7 (usually abbreviated KRKPA7). The pawn on a7 means it is one square away from queening. It is the King+Rook's side (white) to move. 2.…
273605 runs0 likes38 downloads38 reach11 impact
3196 instances - 37 features - 2 classes - 0 missing values
Citation Request: This primary tumor domain was obtained from the University Medical Centre, Institute of Oncology, Ljubljana, Yugoslavia. Thanks go to M. Zwitter and M. Soklic for providing the data.…
1261 runs0 likes16 downloads16 reach9 impact
339 instances - 18 features - 21 classes - 225 missing values
1. Title: Space Shuttle Autolanding Domain 2. Sources: (a) Original source: unknown -- NASA: Mr. Roger Burke's autolander design team (b) Donor: Bojan Cestnik Jozef Stefan Institute Jamova 39 61000…
1466 runs0 likes9 downloads9 reach7 impact
15 instances - 7 features - 2 classes - 26 missing values
Compilation of promoters with known transcriptional start points for E. coli genes. The task is to recognize promoters in strings that represent nucleotides (one of A, G, T, or C). A promoter is a…
138 runs1 likes9 downloads10 reach9 impact
106 instances - 59 features - 2 classes - 0 missing values
Classify a chess game based on the position of the white king, the white rook and the black king.
1777 runs0 likes15 downloads15 reach7 impact
28056 instances - 7 features - 18 classes - 0 missing values
1. Title: Lung Cancer Data 2. Source Information: - Data was published in : Hong, Z.Q. and Yang, J.Y. "Optimal Discriminant Plane for a Small Number of Samples and Design Method of Classifier on the…
1238 runs0 likes18 downloads18 reach10 impact
32 instances - 57 features - 3 classes - 5 missing values
No data.
1038 runs0 likes9 downloads9 reach7 impact
55296 instances - 10 features - 3 classes - 0 missing values
No data.
1457 runs0 likes12 downloads12 reach7 impact
39366 instances - 10 features - 2 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
1030 runs0 likes8 downloads8 reach12 impact
132 instances - 4 features - 2 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
1116 runs0 likes9 downloads9 reach12 impact
120 instances - 4 features - 2 classes - 0 missing values
Once upon a time, in July 1991, the monks of Corsendonk Priory were faced with a school held in their priory, namely the 2nd European Summer School on Machine Learning. After listening more than one…
394292 runs2 likes23 downloads25 reach35 impact
601 instances - 7 features - 2 classes - 0 missing values
Once upon a time, in July 1991, the monks of Corsendonk Priory were faced with a school held in their priory, namely the 2nd European Summer School on Machine Learning. After listening more than one…
358449 runs2 likes18 downloads20 reach37 impact
556 instances - 7 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1252 runs0 likes9 downloads9 reach12 impact
130 instances - 3 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
736 runs0 likes6 downloads6 reach13 impact
364 instances - 33 features - 2 classes - 80 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
765 runs0 likes12 downloads12 reach13 impact
1728 instances - 7 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
728 runs0 likes7 downloads7 reach13 impact
2000 instances - 241 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
722 runs0 likes6 downloads6 reach13 impact
683 instances - 36 features - 2 classes - 2337 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
757 runs0 likes8 downloads8 reach13 impact
400 instances - 6 features - 2 classes - 0 missing values
SPECT heart data This is a merged version of the separate train and test set which are usually distributed. On OpenML this train-test split can be found as one of the possible tasks. Sources: --…
1296 runs1 likes12 downloads13 reach14 impact
267 instances - 23 features - 2 classes - 0 missing values
Once upon a time, in July 1991, the monks of Corsendonk Priory were faced with a school held in their priory, namely the 2nd European Summer School on Machine Learning. After listening more than one…
108666 runs1 likes14 downloads15 reach32 impact
554 instances - 7 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
709 runs0 likes9 downloads9 reach12 impact
48 instances - 5 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
135 runs0 likes9 downloads9 reach13 impact
3190 instances - 62 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
104 runs0 likes6 downloads6 reach13 impact
379 instances - 9 features - 2 classes - 1368 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
752 runs0 likes7 downloads7 reach13 impact
339 instances - 18 features - 2 classes - 225 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
622 runs0 likes6 downloads6 reach15 impact
10108 instances - 69 features - 2 classes - 2699 missing values
This data set contains unweighted PUMS census data from the Los Angeles and Long Beach areas for the years 1970, 1980, and 1990. The coding schemes have been standardized (by the IPUMS project) to be…
366 runs0 likes10 downloads10 reach12 impact
8844 instances - 61 features - 7 classes - 51515 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
173 runs0 likes6 downloads6 reach21 impact
106 instances - 59 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
652 runs0 likes16 downloads16 reach13 impact
12960 instances - 9 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
717 runs0 likes5 downloads5 reach12 impact
90 instances - 9 features - 2 classes - 3 missing values
This data set contains unweighted PUMS census data from the Los Angeles and Long Beach areas for the years 1970, 1980, and 1990. The coding schemes have been standardized (by the IPUMS project) to be…
354 runs0 likes7 downloads7 reach12 impact
7485 instances - 61 features - 7 classes - 52048 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes0 downloads0 reach11 impact
379 instances - 9 features - 4 classes - 1418 missing values
One of the datasets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff. It contains data on the DMFT Index (Decayed, Missing, and Filled Teeth) before and after different prevention…
27710 runs0 likes11 downloads11 reach40 impact
797 instances - 5 features - 6 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
726 runs0 likes10 downloads10 reach13 impact
576 instances - 12 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1202 runs0 likes9 downloads9 reach12 impact
100 instances - 4 features - 2 classes - 0 missing values
This data set contains unweighted PUMS census data from the Los Angeles and Long Beach areas for the years 1970, 1980, and 1990. The coding schemes have been standardized (by the IPUMS project) to be…
434 runs0 likes10 downloads10 reach12 impact
7019 instances - 61 features - 8 classes - 48089 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1058 runs0 likes9 downloads9 reach13 impact
167 instances - 5 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
106 runs0 likes5 downloads5 reach12 impact
76 instances - 46 features - 2 classes - 22 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
721 runs0 likes5 downloads5 reach13 impact
226 instances - 70 features - 2 classes - 317 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
490 runs0 likes4 downloads4 reach11 impact
364 instances - 33 features - 6 classes - 101 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
774 runs0 likes9 downloads9 reach13 impact
797 instances - 5 features - 2 classes - 0 missing values
1. Title: Ozone Level Detection 2. Source: Kun Zhang zhang.kun05 '@' gmail.com Department of Computer Science, Xavier University of Lousiana Wei Fan wei.fan '@' gmail.com IBM T.J.Watson Research…
0 runs0 likes1 downloads1 reach5 impact
2536 instances - 73 features - 0 classes - 0 missing values
------------------------------------------------------------------------------- TIME SERIES USED IN LONG-MEMORY PROCESSES, THE ALLAN VARIANCE AND WAVELETS BY D. B. PERCIVAL AND P. GUTTORP, A CHAPTER…
0 runs0 likes1 downloads1 reach5 impact
6875 instances - 1 features - 0 classes - 0 missing values