OpenML
Filter results by:
These weekly averages are ultimately based on measurements of 4 air samples per hour taken atop intake lines on several towers during steady periods of CO2 concentration of not less than 6 hours per…
0 runs1 likes1 downloads2 reach0 impact
2225 instances - 7 features - 0 classes - 0 missing values
Datasets from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch) Dataset from: http://www.agnostic.inf.ethz.ch/datasets.php Modified by TunedIT (converted to ARFF…
406 runs1 likes11 downloads12 reach7 impact
4229 instances - 1618 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
131 runs1 likes9 downloads10 reach6 impact
990 instances - 14 features - 2 classes - 0 missing values
The data is cleaned, regularized and encrypted global equity data. The first 21 columns (feature1 - feature21) are features, and target is the binary class you’re trying to predict.
284 runs1 likes1 downloads2 reach4 impact
96320 instances - 22 features - 2 classes - 0 missing values
Abstract: CART book's waveform domains Source: Original Owners: Breiman,L., Friedman,J.H., Olshen,R.A., & Stone,C.J. (1984). Classification and Regression Trees. Wadsworth International Group:…
0 runs1 likes3 downloads4 reach2 impact
5000 instances - 22 features - classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
736 runs1 likes5 downloads6 reach6 impact
452 instances - 280 features - 2 classes - 408 missing values
Citation Request: This breast cancer domain was obtained from the University Medical Centre, Institute of Oncology, Ljubljana, Yugoslavia. Thanks go to M. Zwitter and M. Soklic for providing the data.…
2006 runs1 likes30 downloads31 reach0 impact
286 instances - 10 features - 2 classes - 9 missing values
A dataset of steel plates' faults, classified into 7 different types. The goal was to train machine learning for automatic pattern recognition. The dataset consists of 27 features describing each…
276158 runs1 likes32 downloads33 reach15 impact
1941 instances - 34 features - 2 classes - 0 missing values
QSAR biodegradation Data Set * Abstract: Data set containing values for 41 attributes (molecular descriptors) used to classify 1055 chemicals into 2 classes (ready and not ready biodegradable). *…
260170 runs1 likes16 downloads17 reach16 impact
1055 instances - 42 features - 2 classes - 0 missing values
Current dataset was adapted to ARFF format from the UCI version. Sample code ID's were removed. ! Note that there is also a related Breast Cancer Wisconsin (Original) Data Set with a different set of…
221394 runs1 likes32 downloads33 reach16 impact
569 instances - 31 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
2865 runs1 likes16 downloads17 reach15 impact
1545 instances - 10937 features - 2 classes - 0 missing values
Dataset from the MLRR repository: http://axon.cs.byu.edu:5000/ More infos: https://archive.ics.uci.edu/ml/datasets/Musk+(Version+2)
82514 runs1 likes17 downloads18 reach21 impact
6598 instances - 170 features - 2 classes - 0 missing values
### Description This dataset describes mushrooms in terms of their physical characteristics. They are classified into: poisonous or edible. ### Source ``` (a) Origin: Mushroom records are drawn from…
15619 runs1 likes36 downloads37 reach0 impact
8124 instances - 23 features - 2 classes - 2480 missing values
Abstract: This data has been prepared to analyze factors related to readmission as well as other outcomes pertaining to patients with diabetes. Source: The data are submitted on behalf of the Center…
0 runs1 likes13 downloads14 reach3 impact
101766 instances - 50 features - 3 classes - 0 missing values
Multi-label dataset. The image benchmark dataset consists of 2000 natural scene images. Zhou and Zhang (2007) extracted 135 features for each image and made it publicly available as processed image…
0 runs1 likes8 downloads9 reach2 impact
2000 instances - 140 features - 2 classes - 0 missing values
A dataset relating characteristics of telephony account features and usage and whether or not the customer churned. Originally used in [Discovering Knowledge in Data: An Introduction to Data…
4854 runs1 likes3 downloads4 reach11 impact
5000 instances - 21 features - 2 classes - 0 missing values
This file concerns credit card applications. All attribute names and values have been changed to meaningless symbols to protect the confidentiality of the data. This dataset is interesting because…
22109 runs1 likes28 downloads29 reach0 impact
690 instances - 16 features - 2 classes - 67 missing values
1. Title of Database: Wine recognition data Updated Sept 21, 1998 by C.Blake : Added attribute information 2. Sources: (a) Forina, M. et al, PARVUS - An Extendible Package for Data Exploration,…
1180 runs1 likes15 downloads16 reach0 impact
178 instances - 14 features - 3 classes - 0 missing values
No data.
794 runs1 likes13 downloads14 reach5 impact
107 instances - 30 features - 2 classes - 0 missing values
General Description of Thyroid Disease Databases and Related Files This directory contains 6 databases, corresponding test set, and corresponding documentation. They were left at the University of…
31 runs1 likes8 downloads9 reach3 impact
2800 instances - 27 features - 5 classes - 0 missing values
General Description of Thyroid Disease Databases and Related Files This directory contains 6 databases, corresponding test set, and corresponding documentation. They were left at the University of…
31 runs1 likes8 downloads9 reach3 impact
2800 instances - 27 features - 5 classes - 0 missing values
1. TITLE: Letter Image Recognition Data The objective is to identify each of a large number of black-and-white rectangular pixel displays as one of the 26 capital letters in the English alphabet. The…
64269 runs1 likes68 downloads69 reach0 impact
20000 instances - 17 features - 26 classes - 0 missing values
This database encodes the complete set of possible board configurations at the end of tic-tac-toe games, where "x" is assumed to have played first. The target concept is "win for x" (i.e., true when…
379940 runs1 likes47 downloads48 reach0 impact
958 instances - 10 features - 2 classes - 0 missing values
Once upon a time, in July 1991, the monks of Corsendonk Priory were faced with a school held in their priory, namely the 2nd European Summer School on Machine Learning. After listening more than one…
357647 runs1 likes16 downloads17 reach26 impact
556 instances - 7 features - 2 classes - 0 missing values
No data.
68 runs0 likes3 downloads3 reach0 impact
1000000 instances - 21 features - 2 classes - 0 missing values
No data.
67 runs0 likes2 downloads2 reach0 impact
1000000 instances - 17 features - 10 classes - 0 missing values
No data.
65 runs0 likes4 downloads4 reach0 impact
1000000 instances - 40 features - 2 classes - 0 missing values
No data.
66 runs0 likes3 downloads3 reach0 impact
1000000 instances - 35 features - 6 classes - 0 missing values
No data.
211 runs0 likes3 downloads3 reach0 impact
1000000 instances - 20 features - 7 classes - 0 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. Corresponding patterns in different datasets correspond to the same…
26854 runs0 likes17 downloads17 reach0 impact
2000 instances - 217 features - 10 classes - 0 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. Corresponding patterns in different datasets correspond to the same…
26672 runs0 likes10 downloads10 reach0 impact
2000 instances - 77 features - 10 classes - 0 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. Corresponding patterns in different datasets correspond to the same…
26665 runs0 likes19 downloads19 reach0 impact
2000 instances - 65 features - 10 classes - 0 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. Corresponding patterns in different datasets correspond to the same…
27245 runs0 likes16 downloads16 reach0 impact
2000 instances - 7 features - 10 classes - 0 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. The maps were scanned in 8 bit grey value at density of 400dpi,…
22666 runs0 likes17 downloads17 reach0 impact
2000 instances - 241 features - 10 classes - 0 missing values
No data.
66 runs0 likes2 downloads2 reach0 impact
1000000 instances - 39 features - 6 classes - 0 missing values
No data.
324 runs0 likes5 downloads5 reach0 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
71 runs0 likes5 downloads5 reach0 impact
1000000 instances - 17 features - 2 classes - 0 missing values
No data.
60 runs0 likes2 downloads2 reach0 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
63 runs0 likes3 downloads3 reach0 impact
1000000 instances - 26 features - 7 classes - 0 missing values
No data.
63 runs0 likes4 downloads4 reach0 impact
1000000 instances - 19 features - 4 classes - 0 missing values
No data.
68 runs0 likes10 downloads10 reach0 impact
1000000 instances - 10 features - 2 classes - 0 missing values
Publication Request: >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> This file describes the contents of the heart-disease directory. This directory contains 4 databases…
1787 runs0 likes9 downloads9 reach0 impact
294 instances - 14 features - 2 classes - 782 missing values
1. Title: INDUCE Trains Data set 2. Sources: - Donor: GMU, Center for AI, Software Librarian, Eric E. Bloedorn (bloedorn@aic.gmu.edu) - Original owners: Ryszard S. Michalski (michalski@aic.gmu.edu)…
1973 runs0 likes9 downloads9 reach0 impact
10 instances - 33 features - 2 classes - 51 missing values
This database contains 13 attributes (which have been extracted from a larger set of 75) Attribute Information: ------------------------ -- 1. age -- 2. sex -- 3. chest pain type (4 values) -- 4.…
3208 runs0 likes17 downloads17 reach0 impact
270 instances - 14 features - 2 classes - 0 missing values
1. Title: Hepatitis Domain 2. Sources: (a) unknown (b) Donor: G.Gong (Carnegie-Mellon University) via Bojan Cestnik Jozef Stefan Institute Jamova 39 61000 Ljubljana Yugoslavia (tel.: (38)(+61) 214-399…
2090 runs0 likes10 downloads10 reach0 impact
155 instances - 20 features - 2 classes - 167 missing values
1. Title: 1984 United States Congressional Voting Records Database 2. Source Information: (a) Source: Congressional Quarterly Almanac, 98th Congress, 2nd session 1984, Volume XL: Congressional…
2250 runs0 likes16 downloads16 reach0 impact
435 instances - 17 features - 2 classes - 392 missing values
; ; Thyroid disease records supplied by the Garavan Institute and J. Ross ; Quinlan, New South Wales Institute, Syndney, Australia. ; ; 1987. ; hypothyroid, primary hypothyroid, compensated…
880 runs0 likes10 downloads10 reach0 impact
3772 instances - 30 features - 4 classes - 6064 missing values
No data.
143 runs0 likes3 downloads3 reach0 impact
1000000 instances - 39 features - 6 classes - 0 missing values
No data.
7302 runs0 likes10 downloads10 reach0 impact
226 instances - 70 features - 24 classes - 317 missing values
No data.
50 runs0 likes1 downloads1 reach0 impact
1000000 instances - 65 features - 10 classes - 0 missing values
No data.
67 runs0 likes3 downloads3 reach0 impact
1000000 instances - 13 features - 6 classes - 0 missing values
No data.
66 runs0 likes3 downloads3 reach0 impact
1000000 instances - 13 features - 6 classes - 0 missing values
No data.
1038 runs0 likes8 downloads8 reach0 impact
55296 instances - 10 features - 3 classes - 0 missing values
1. Title: Postoperative Patient Data 2. Source Information: -- Creators: Sharon Summers, School of Nursing, University of Kansas Medical Center, Kansas City, KS 66160 Linda Woolery, School of Nursing,…
1758 runs0 likes9 downloads9 reach0 impact
90 instances - 9 features - 3 classes - 3 missing values
1. Title: Dermatology Database 2. Source Information: (a) Original owners: -- 1. Nilsel Ilter, M.D., Ph.D., Gazi University, School of Medicine 06510 Ankara, Turkey Phone: +90 (312) 214 1080 -- 2. H.…
1752 runs0 likes13 downloads13 reach0 impact
366 instances - 35 features - 6 classes - 8 missing values
The instances were drawn randomly from a database of 7 outdoor images. The images were hand-segmented to create a classification for every pixel. Each instance is a 3x3 region. ### Attribute…
19234 runs0 likes22 downloads22 reach0 impact
2310 instances - 20 features - 7 classes - 0 missing values
1. Title: Protein Localization Sites 2. Creator and Maintainer: Kenta Nakai Institue of Molecular and Cellular Biology Osaka, University 1-3 Yamada-oka, Suita 565 Japan nakai@imcb.osaka-u.ac.jp…
1799 runs0 likes12 downloads12 reach0 impact
336 instances - 8 features - 8 classes - 0 missing values
1. Title: Glass Identification Database 2. Sources: (a) Creator: B. German -- Central Research Establishment Home Office Forensic Science Service Aldermaston, Reading, Berkshire RG7 4PN (b) Donor:…
1772 runs0 likes49 downloads49 reach0 impact
214 instances - 10 features - 6 classes - 0 missing values
This is the large soybean database from the UCI repository, with its training and test database combined into a single file. There are 19 classes, only the first 15 of which have been used in prior…
37282 runs0 likes50 downloads50 reach0 impact
683 instances - 36 features - 19 classes - 2337 missing values
Primate splice-junction gene sequences (DNA) with associated imperfect domain theory. Splice junctions are points on a DNA sequence at which 'superfluous' DNA is removed during the process of protein…
15852 runs0 likes14 downloads14 reach0 impact
3190 instances - 62 features - 3 classes - 0 missing values
1. Title: Teaching Assistant Evaluation 2. Sources: (a) Collector: Wei-Yin Loh (Department of Statistics, UW-Madison) (b) Donor: Tjen-Sien Lim (limt@stat.wisc.edu) (b) Date: June 7, 1997 3. Past…
2028 runs0 likes12 downloads12 reach0 impact
151 instances - 6 features - 3 classes - 0 missing values
Publication Request: >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> This file describes the contents of the heart-disease directory. This directory contains 4 databases…
1761 runs0 likes10 downloads10 reach0 impact
303 instances - 14 features - 2 classes - 7 missing values
Donor: Will Taylor (taylor@pluto.arc.nasa.gov) Database of surgeries on horses. Possible class attributes: 24 (whether lesion is surgical), others include: 23, 25, 26, and 27 Notes: * Hospital_Number…
233 runs0 likes8 downloads8 reach0 impact
368 instances - 28 features - 2 classes - 1927 missing values
1. Title: Nursery Database 2. Sources: (a) Creator: Vladislav Rajkovic et al. (13 experts) (b) Donors: Marko Bohanec (marko.bohanec@ijs.si) Blaz Zupan (blaz.zupan@ijs.si) (c) Date: June, 1997 3. Past…
2210 runs0 likes15 downloads15 reach0 impact
12960 instances - 9 features - 5 classes - 0 missing values
Donor: Will Taylor (taylor@pluto.arc.nasa.gov) In this version (version 2), some features were removed. It is unclear why of how this was done.
1883 runs0 likes9 downloads9 reach0 impact
368 instances - 23 features - 2 classes - 1927 missing values
1. Title of Database: Blocks Classification 2. Sources: (a) Donato Malerba Dipartimento di Informatica University of Bari via Orabona 4 70126 Bari - Italy phone: +39 - 80 - 5443269 fax: +39 - 80 -…
2719 runs0 likes17 downloads17 reach0 impact
5473 instances - 11 features - 5 classes - 0 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. Corresponding patterns in different datasets correspond to the same…
26776 runs0 likes21 downloads21 reach0 impact
2000 instances - 48 features - 10 classes - 0 missing values
1. Title: Contraceptive Method Choice 2. Sources: (a) Origin: This dataset is a subset of the 1987 National Indonesia Contraceptive Prevalence Survey (b) Creator: Tjen-Sien Lim (limt@stat.wisc.edu)…
18003 runs0 likes17 downloads17 reach0 impact
1473 instances - 10 features - 3 classes - 0 missing values
No data.
68 runs0 likes2 downloads2 reach0 impact
1000000 instances - 19 features - 4 classes - 0 missing values
No data.
69 runs0 likes4 downloads4 reach0 impact
1000000 instances - 20 features - 2 classes - 0 missing values
No data.
356 runs0 likes7 downloads7 reach0 impact
131072 instances - 17 features - 2 classes - 0 missing values
No data.
65 runs0 likes5 downloads5 reach0 impact
1000000 instances - 30 features - 4 classes - 0 missing values
No data.
230 runs0 likes4 downloads4 reach0 impact
1000000 instances - 35 features - 2 classes - 0 missing values
No data.
63 runs0 likes2 downloads2 reach0 impact
1000000 instances - 41 features - 3 classes - 0 missing values
Dataset created to study concept drift in stream mining. It is constructed by combining the Covertype, Poker-Hand, and Electricity datasets. More details can be found in: Albert Bifet, Geoff Holmes,…
332 runs0 likes26 downloads26 reach0 impact
1455525 instances - 73 features - 10 classes - 0 missing values
No data.
73 runs0 likes5 downloads5 reach0 impact
1000000 instances - 30 features - 2 classes - 0 missing values
No data.
50 runs0 likes3 downloads3 reach0 impact
1000000 instances - 61 features - 2 classes - 0 missing values
No data.
219 runs0 likes4 downloads4 reach0 impact
1000000 instances - 58 features - 2 classes - 0 missing values
No data.
66 runs0 likes2 downloads2 reach0 impact
1000000 instances - 14 features - 5 classes - 0 missing values
No data.
1457 runs0 likes12 downloads12 reach0 impact
39366 instances - 10 features - 2 classes - 0 missing values
No data.
66 runs0 likes2 downloads2 reach0 impact
1000000 instances - 14 features - 5 classes - 0 missing values
No data.
334 runs0 likes4 downloads4 reach0 impact
1000000 instances - 33 features - 2 classes - 0 missing values
No data.
70 runs0 likes2 downloads2 reach0 impact
1000000 instances - 14 features - 2 classes - 0 missing values
Citation Request: This primary tumor domain was obtained from the University Medical Centre, Institute of Oncology, Ljubljana, Yugoslavia. Thanks go to M. Zwitter and M. Soklic for providing the data.…
1261 runs0 likes13 downloads13 reach0 impact
339 instances - 18 features - 21 classes - 225 missing values
1. Title: Space Shuttle Autolanding Domain 2. Sources: (a) Original source: unknown -- NASA: Mr. Roger Burke's autolander design team (b) Donor: Bojan Cestnik Jozef Stefan Institute Jamova 39 61000…
1466 runs0 likes9 downloads9 reach0 impact
15 instances - 7 features - 2 classes - 26 missing values
No data.
2193 runs0 likes15 downloads15 reach0 impact
1484 instances - 9 features - 10 classes - 0 missing values
1. Title of Database: Abalone data 2. Sources: (a) Original owners of database: Marine Resources Division Marine Research Laboratories - Taroona Department of Primary Industry and Fisheries, Tasmania…
34900 runs0 likes17 downloads17 reach0 impact
4177 instances - 9 features - 28 classes - 0 missing values
No data.
1777 runs0 likes15 downloads15 reach0 impact
28056 instances - 7 features - 18 classes - 0 missing values
The objective was to determine which seedlots in a species are best for soil conservation in seasonally dry hill country. Determination is found by measurement of height, diameter by height, survival,…
21607 runs0 likes9 downloads9 reach0 impact
736 instances - 20 features - 5 classes - 448 missing values
This is data set is concerned with the forward kinematics of an 8 link robot arm. Among the existing variants of this data set we have used the variant 8nm, which is known to be highly non-linear and…
19 runs0 likes7 downloads7 reach0 impact
8192 instances - 9 features - 0 classes - 0 missing values
Dataset from Smoothing Methods in Statistics (ftp stat.cmu.edu/datasets) Simonoff, J.S. (1996). Smoothing Methods in Statistics. New York: Springer-Verlag.
4 runs0 likes1 downloads1 reach0 impact
61 instances - 3 features - 0 classes - 0 missing values
1. Title: Wisconsin Prognostic Breast Cancer (WPBC) 2. Source Information a) Creators: Dr. William H. Wolberg, General Surgery Dept., University of Wisconsin, Clinical Sciences Center, Madison, WI…
5 runs0 likes4 downloads4 reach0 impact
194 instances - 33 features - 0 classes - 0 missing values
Donor: David W. Aha (aha@ics.uci.edu) This database contains 76 attributes, but all published experiments refer to using a subset of 14 of them. In particular, the Cleveland database is the only one…
36 runs0 likes5 downloads5 reach0 impact
303 instances - 14 features - 0 classes - 6 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Identifier attribute deleted. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based…
0 runs0 likes2 downloads2 reach0 impact
398 instances - 8 features - 0 classes - 6 missing values
This data set is also obtained from the task of controlling the ailerons of a F16 aircraft, although the target variable and attributes are different from the ailerons domain. The target variable here…
2 runs0 likes3 downloads3 reach0 impact
9517 instances - 7 features - 0 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Identifier attribute deleted. !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! NAME: Sexual activity and the lifespan of male fruitflies TYPE: Designed (almost factorial)…
4 runs0 likes1 downloads1 reach0 impact
125 instances - 5 features - 0 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Case number deleted. X treated as the class attribute. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric…
10 runs0 likes1 downloads1 reach0 impact
418 instances - 19 features - 0 classes - 1239 missing values
No data.
328 runs0 likes3 downloads3 reach0 impact
1000000 instances - 4 features - 2 classes - 0 missing values
1. Title: Lung Cancer Data 2. Source Information: - Data was published in : Hong, Z.Q. and Yang, J.Y. "Optimal Discriminant Plane for a Small Number of Samples and Design Method of Classifier on the…
1238 runs0 likes16 downloads16 reach0 impact
32 instances - 57 features - 3 classes - 5 missing values
No data.
310 runs0 likes4 downloads4 reach0 impact
1000000 instances - 11 features - 2 classes - 0 missing values