OpenML
Filter results by:
# Data Description This is the historical price data of the FOREX EUR/GBP from Dukascopy. One instance (row) is one candlestick of one hour. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes1 downloads1 reach8 impact
43825 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX AUD/NZD from Dukascopy. One instance (row) is one candlestick of one hour. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes1 downloads1 reach8 impact
43825 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX EUR/NZD from Dukascopy. One instance (row) is one candlestick of one day. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes1 downloads1 reach8 impact
1832 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX EUR/AUD from Dukascopy. One instance (row) is one candlestick of one hour. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes1 downloads1 reach8 impact
43825 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX USD/CAD from Dukascopy. One instance (row) is one candlestick of one hour. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes1 downloads1 reach8 impact
43825 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX NZD/USD from Dukascopy. One instance (row) is one candlestick of one day. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes1 downloads1 reach8 impact
1828 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX NZD/USD from Dukascopy. One instance (row) is one candlestick of one day. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes1 downloads1 reach8 impact
1828 instances - 12 features - 2 classes - 0 missing values
Forest Fires Data Set This is a difficult regression task, where the aim is to predict the burned area of forest fires, in the northeast region of Portugal, by using meteorological and other data.…
0 runs0 likes0 downloads0 reach8 impact
517 instances - 13 features - 0 classes - 0 missing values
Bike sharing systems are new generation of traditional bike rentals where whole process from membership, rental and return back has become automatic. Through these systems, user is able to easily rent…
0 runs0 likes0 downloads0 reach2 impact
17379 instances - 13 features - 0 classes - 0 missing values
This is an artificial data set described in Breiman et al. (1984,p.238) (with variance 1 instead of 2). Generate the values of the 10 attributes independently using the following probabilities: P(X_1…
2 runs1 likes4 downloads5 reach10 impact
40768 instances - 11 features - 0 classes - 0 missing values
Citation Request: This dataset is public available for research. The details are described in [Cortez et al., 2009]. Please include this citation if you plan to use this database: P. Cortez, A.…
64 runs2 likes6 downloads8 reach16 impact
4898 instances - 12 features - 7 classes - 0 missing values
1. Title: Social Workers Decisions (Ordinal SWD) 2. Source Informaion: Donor: Arie Ben David MIS, Dept. of Technology Management Holon Academic Inst. of Technology 52 Golomb St. Holon 58102 Israel…
0 runs0 likes1 downloads1 reach13 impact
1000 instances - 11 features - 0 classes - 0 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes0 downloads0 reach13 impact
500 instances - 11 features - 0 classes - 0 missing values
No data.
0 runs0 likes1 downloads1 reach12 impact
177147 instances - 11 features - 0 classes - 0 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes0 downloads0 reach13 impact
250 instances - 11 features - 0 classes - 0 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes0 downloads0 reach13 impact
100 instances - 11 features - 0 classes - 0 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes0 downloads0 reach13 impact
250 instances - 11 features - 0 classes - 0 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes0 downloads0 reach13 impact
250 instances - 11 features - 0 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach9 impact
177147 instances - 11 features - 0 classes - 0 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes0 downloads0 reach13 impact
100 instances - 11 features - 0 classes - 0 missing values
Balanced version of click prediction data
36 runs0 likes14 downloads14 reach13 impact
1997410 instances - 12 features - 2 classes - 0 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes0 downloads0 reach13 impact
27 instances - 11 features - 0 classes - 0 missing values
This data is derived from the 2012 KDD Cup. The data is subsampled to 1% of the original number of instances, downsampling the majority class (click=0) so that the target feature is reasonably…
313 runs0 likes36 downloads36 reach15 impact
399482 instances - 12 features - 2 classes - 0 missing values
Even smaller sample of version 1
0 runs0 likes4 downloads4 reach12 impact
149639 instances - 12 features - 2 classes - 0 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes0 downloads0 reach13 impact
100 instances - 11 features - 0 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach12 impact
1000000 instances - 19 features - 0 classes - 0 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes0 downloads0 reach13 impact
500 instances - 11 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
107 runs0 likes4 downloads4 reach15 impact
66 instances - 12 features - 2 classes - 0 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes1 downloads1 reach13 impact
1000 instances - 11 features - 0 classes - 0 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes0 downloads0 reach13 impact
100 instances - 11 features - 0 classes - 0 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes0 downloads0 reach13 impact
250 instances - 11 features - 0 classes - 0 missing values
Dataset from the MLRR repository: http://axon.cs.byu.edu:5000/
153 runs0 likes8 downloads8 reach14 impact
81 instances - 12 features - 3 classes - 0 missing values
Donated by P. Savicky, Institute of Computer Science, AS of CR, Czech Republic Methods for multidimensional event classification: a case study using images from a Cherenkov gamma-ray telescope.…
64659 runs1 likes29 downloads30 reach25 impact
19020 instances - 12 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
131 runs1 likes9 downloads10 reach15 impact
990 instances - 14 features - 2 classes - 0 missing values
* Abstract: Purpose is to predict poker hands * Source - Creators: Robert Cattral (cattral '@' gmail.com) Franz Oppacher (oppacher '@' scs.carleton.ca) Carleton University, Department of Computer…
1 runs0 likes5 downloads5 reach15 impact
1025009 instances - 11 features - 10 classes - 0 missing values
* Abstract: 9-class version of poker-hand dataset, it was removed the minority class.
1 runs0 likes3 downloads3 reach14 impact
1025000 instances - 11 features - 9 classes - 0 missing values
No data.
44 runs0 likes1 downloads1 reach9 impact
1000000 instances - 13 features - 11 classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Energy Building dataset (Tsanas and Xifara 2012) concerns the prediction of the heating load…
0 runs0 likes0 downloads0 reach9 impact
768 instances - 10 features - classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Concrete Slump dataset (Yeh 2007) concerns the prediction of three properties of concrete…
0 runs1 likes0 downloads1 reach9 impact
103 instances - 10 features - classes - 0 missing values
#study_1
0 runs0 likes0 downloads0 reach10 impact
944 instances - 17 features - classes - 0 missing values
The happiness scores and rankings use data from the Gallup World Poll. The scores are based on answers to the main life evaluation question asked in the poll. This question, known as the Cantril…
2 runs0 likes1 downloads1 reach12 impact
158 instances - 12 features - 0 classes - 0 missing values
The ILPD liver dataset from the OpenCC18 with the gender binary encoded so all features are numeric
1 runs0 likes0 downloads0 reach9 impact
583 instances - 11 features - 2 classes - 0 missing values
The proposed forecasting approach is tested by using the database from UCI machine learning repository. Using a Deep Learning Model Based on 1D Convnets and Bidirectional GRU
0 runs0 likes0 downloads0 reach9 impact
43800 instances - 10 features - classes - 0 missing values
Touch Signals
0 runs0 likes0 downloads0 reach6 impact
265 instances - 11 features - classes - 0 missing values
Touch samples 2
0 runs0 likes0 downloads0 reach8 impact
265 instances - 11 features - 8 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach13 impact
15 instances - 10 features - 0 classes - 0 missing values
This dataset contains 3 more features compared to version 1 of the same dataset. Data from which conclusions were drawn in the article "Sleep in Mammals: Ecological and Constitutional Correlates" by…
0 runs0 likes0 downloads0 reach13 impact
62 instances - 10 features - 0 classes - 38 missing values
Israeli lottery
0 runs0 likes0 downloads0 reach8 impact
1153 instances - 11 features - classes - 0 missing values
Normalized version of the Forest Covertype dataset (see version 1), so that the numerical values are between 0 and 1. Contains the forest cover type for 30 x 30 meter cells obtained from US Forest…
342 runs1 likes39 downloads40 reach11 impact
581012 instances - 55 features - 7 classes - 0 missing values
No data.
308 runs0 likes2 downloads2 reach11 impact
1000000 instances - 11 features - 5 classes - 0 missing values
No data.
309 runs0 likes3 downloads3 reach11 impact
1000000 instances - 11 features - 5 classes - 0 missing values
This is the poker dataset, retrieved 2013-11-14 from the libSVM site. Additional to the preprocessing done there (see LibSVM site for details), this dataset was created as follows: -join test and…
23 runs0 likes18 downloads18 reach15 impact
1025010 instances - 11 features - 2 classes - 0 missing values
This file contains the data in "The MU284 Population" from Appendix B of the book "Model Assisted Survey Sampling" by Sarndal, Swensson and Wretman, published by Springer-Verlag, New York, 1992. The…
0 runs0 likes0 downloads0 reach13 impact
284 instances - 10 features - 0 classes - 0 missing values
No data.
305 runs0 likes2 downloads2 reach11 impact
1000000 instances - 11 features - 5 classes - 0 missing values
No data.
298 runs0 likes3 downloads3 reach11 impact
1000000 instances - 11 features - 5 classes - 0 missing values
This is a dataset obtained from the StatLib repository. Here is the included description: The data provided are daily stock prices from January 1988 through October 1991, for ten aerospace companies.…
5 runs1 likes9 downloads10 reach9 impact
950 instances - 10 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes1 downloads1 reach13 impact
25 instances - 10 features - 0 classes - 0 missing values
This file contains the weather information of Ankara from 01/01/1994 to 28/05/1998. From given features, the goal is to predict the mean temperature. 1. Max_temperature real [23.0, 100.0] 2.…
0 runs0 likes0 downloads0 reach8 impact
321 instances - 10 features - 0 classes - 0 missing values
This file contains the weather information of Izmir from 01/01/1994 to 31/12/1997. From given features, the goal is to predict the mean temperature. 1. Max_temperature real[36.7,105.0] 2.…
0 runs0 likes0 downloads0 reach8 impact
1461 instances - 10 features - 0 classes - 0 missing values
No data.
315 runs0 likes2 downloads2 reach11 impact
295245 instances - 11 features - 5 classes - 0 missing values
No data.
307 runs0 likes2 downloads2 reach11 impact
1000000 instances - 11 features - 5 classes - 0 missing values
No data.
310 runs0 likes4 downloads4 reach11 impact
1000000 instances - 11 features - 2 classes - 0 missing values
Synthetic dataset. Almost identical to [dataset 152](https://www.openml.org/d/153/edit)
319 runs0 likes4 downloads4 reach11 impact
1000000 instances - 11 features - 2 classes - 0 missing values
e fvr
0 runs0 likes0 downloads0 reach6 impact
2 instances - 11 features - classes - 0 missing values
Experiment data obtained by running random configurations of ranger through mlr on 119 different classification tasks from openml.
0 runs0 likes0 downloads0 reach7 impact
278863 instances - 16 features - classes - 138965 missing values
String datetime information extracted to numeric columns.Trip Record Data provided by the New York City Taxi and Limousine Commission (TLC)…
0 runs0 likes0 downloads0 reach0 impact
581835 instances - 19 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach13 impact
19 instances - 10 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach13 impact
15 instances - 10 features - 0 classes - 0 missing values
Data on the population density of tree pipits, Anthus trivialis, in Franconian oak forests including variables describing the forest ecosystem. This data is taken from R package coin. This study is…
0 runs0 likes2 downloads2 reach13 impact
86 instances - 10 features - 0 classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Energy Building dataset (Tsanas and Xifara 2012) concerns the prediction of the heating load…
0 runs0 likes0 downloads0 reach9 impact
768 instances - 10 features - classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Concrete Slump dataset (Yeh 2007) concerns the prediction of three properties of concrete…
0 runs1 likes0 downloads1 reach10 impact
103 instances - 10 features - classes - 0 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes0 downloads0 reach13 impact
52 instances - 10 features - 0 classes - 0 missing values
No data.
0 runs0 likes3 downloads3 reach9 impact
59049 instances - 10 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes1 downloads1 reach13 impact
20 instances - 10 features - 0 classes - 0 missing values
This is the original version of the famous covertype dataset in ARFF format. Predicting forest cover type from cartographic variables only (no remotely sensed data). The actual forest cover type for a…
9 runs1 likes14 downloads15 reach23 impact
581012 instances - 55 features - 7 classes - 0 missing values
Speaker independent recognition of the eleven steady state vowels of British English using a specified training set of lpc derived log area ratios. Collected by David Deterding (data and…
26450 runs1 likes17 downloads18 reach43 impact
990 instances - 13 features - 11 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
984 runs0 likes8 downloads8 reach14 impact
100 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
773 runs0 likes8 downloads8 reach15 impact
250 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
791 runs0 likes6 downloads6 reach15 impact
250 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
988 runs0 likes9 downloads9 reach14 impact
100 instances - 11 features - 2 classes - 0 missing values
Dataset from `Pattern Recognition and Neural Networks' by B.D. Ripley. Cambridge University Press (1996) ISBN 0-521-46086-7 The background to the datasets is described in section 1.4; this file…
587 runs0 likes5 downloads5 reach14 impact
61 instances - 19 features - 4 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
567 runs0 likes14 downloads14 reach15 impact
40768 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1024 runs0 likes8 downloads8 reach14 impact
100 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
615 runs0 likes9 downloads9 reach15 impact
1000 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
753 runs0 likes8 downloads8 reach15 impact
508 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
747 runs0 likes6 downloads6 reach15 impact
200 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
575 runs0 likes9 downloads9 reach15 impact
1000 instances - 11 features - 2 classes - 0 missing values
1. Title of Database: Blocks Classification 2. Sources: (a) Donato Malerba Dipartimento di Informatica University of Bari via Orabona 4 70126 Bari - Italy phone: +39 - 80 - 5443269 fax: +39 - 80 -…
2719 runs0 likes18 downloads18 reach11 impact
5473 instances - 11 features - 5 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
785 runs0 likes7 downloads7 reach15 impact
500 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
773 runs0 likes6 downloads6 reach15 impact
250 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
985 runs0 likes8 downloads8 reach14 impact
100 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
797 runs0 likes7 downloads7 reach15 impact
500 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
814 runs0 likes7 downloads7 reach15 impact
500 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
791 runs0 likes6 downloads6 reach15 impact
250 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
723 runs0 likes5 downloads5 reach15 impact
418 instances - 19 features - 2 classes - 1239 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
625 runs0 likes10 downloads10 reach15 impact
1000 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
773 runs0 likes6 downloads6 reach15 impact
500 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
782 runs0 likes6 downloads6 reach15 impact
250 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
618 runs0 likes12 downloads12 reach15 impact
40768 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
970 runs0 likes8 downloads8 reach14 impact
100 instances - 11 features - 2 classes - 0 missing values