Data
Filter results by:
# Data Description This is the historical price data of the FOREX EUR/CAD from Dukascopy. One instance (row) is one candlestick of one day. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes1 downloads1 reach9 impact
1834 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX AUD/USD from Dukascopy. One instance (row) is one candlestick of one day. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes1 downloads1 reach9 impact
1834 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX USD/CAD from Dukascopy. One instance (row) is one candlestick of one hour. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes1 downloads1 reach9 impact
43825 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX USD/DKK from Dukascopy. One instance (row) is one candlestick of one day. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes1 downloads1 reach9 impact
1832 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX EUR/CAD from Dukascopy. One instance (row) is one candlestick of one day. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes1 downloads1 reach9 impact
1834 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX EUR/JPY from Dukascopy. One instance (row) is one candlestick of one day. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes1 downloads1 reach9 impact
1832 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX AUD/CHF from Dukascopy. One instance (row) is one candlestick of one hour. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes1 downloads1 reach9 impact
43825 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX EUR/USD from Dukascopy. One instance (row) is one candlestick of one minute. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes1 downloads1 reach9 impact
375840 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX EUR/CAD from Dukascopy. One instance (row) is one candlestick of one hour. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes1 downloads1 reach9 impact
43825 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX EUR/NZD from Dukascopy. One instance (row) is one candlestick of one day. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes1 downloads1 reach9 impact
1832 instances - 12 features - 2 classes - 0 missing values
GAMETES_Epistasis_2-Way_20atts_0.1H_EDM-1_1-pmlb
31 runs0 likes1 downloads1 reach22 impact
1600 instances - 21 features - 2 classes - 0 missing values
GAMETES_Epistasis_2-Way_20atts_0.4H_EDM-1_1-pmlb
31 runs0 likes1 downloads1 reach22 impact
1600 instances - 21 features - 2 classes - 0 missing values
GAMETES_Epistasis_3-Way_20atts_0.2H_EDM-1_1-pmlb
31 runs0 likes1 downloads1 reach22 impact
1600 instances - 21 features - 2 classes - 0 missing values
GAMETES_Heterogeneity_20atts_1600_Het_0.4_0.2_50_EDM-2_001-pmlb
0 runs0 likes1 downloads1 reach22 impact
1600 instances - 21 features - 2 classes - 0 missing values
GAMETES_Heterogeneity_20atts_1600_Het_0.4_0.2_75_EDM-2_001-pmlb
31 runs0 likes1 downloads1 reach22 impact
1600 instances - 21 features - 2 classes - 0 missing values
calendarDOW-pmlb
31 runs0 likes1 downloads1 reach21 impact
399 instances - 33 features - 5 classes - 0 missing values
Derived from the Musk dataset: https://www.openml.org/d/1116
31 runs0 likes1 downloads1 reach22 impact
476 instances - 169 features - 2 classes - 0 missing values
Derived from the Musk dataset: https://www.openml.org/d/1116
31 runs0 likes1 downloads1 reach22 impact
6598 instances - 169 features - 2 classes - 0 missing values
corral-pmlb
31 runs0 likes1 downloads1 reach22 impact
160 instances - 7 features - 2 classes - 0 missing values
flare-pmlb
32 runs0 likes1 downloads1 reach22 impact
1066 instances - 11 features - 2 classes - 0 missing values
PMLB version of the Titanic dataset, which only uses 3 features. See version 1 for the complete version: https://www.openml.org/d/40945
35 runs0 likes1 downloads1 reach23 impact
2201 instances - 4 features - 2 classes - 0 missing values
)), [PMLB](https://github.com/EpistasisLab/penn-ml-benchmarks/tree/master/datasets/classification/tokyo1) This is Performance co-pilot (PCP) data for the Tokyo server at Silicon Graphics International…
37 runs0 likes1 downloads1 reach22 impact
959 instances - 45 features - 2 classes - 0 missing values
allrep-pmlb
31 runs0 likes1 downloads1 reach21 impact
3772 instances - 30 features - 4 classes - 0 missing values
cleve-pmlb
32 runs0 likes1 downloads1 reach21 impact
303 instances - 14 features - 2 classes - 0 missing values
ecoli-pmlb
31 runs0 likes1 downloads1 reach21 impact
327 instances - 8 features - 5 classes - 0 missing values
cleveland-nominal-pmlb
31 runs0 likes1 downloads1 reach21 impact
303 instances - 8 features - 5 classes - 0 missing values
dis-pmlb
31 runs0 likes1 downloads1 reach22 impact
3772 instances - 30 features - 2 classes - 0 missing values
mux6-pmlb
31 runs0 likes1 downloads1 reach21 impact
128 instances - 7 features - 2 classes - 0 missing values
postoperative-patient-data-pmlb
26 runs0 likes1 downloads1 reach21 impact
88 instances - 9 features - 2 classes - 0 missing values
Relevant Information: -- The database contains 3 potential classes, one for the number of times a certain type of solar flare occured in a 24 hour period. -- Each instance represents captured features…
31 runs0 likes1 downloads1 reach21 impact
315 instances - 13 features - 5 classes - 0 missing values
Relevant Information: -- The database contains 3 potential classes, one for the number of times a certain type of solar flare occured in a 24 hour period. -- Each instance represents captured features…
31 runs0 likes1 downloads1 reach21 impact
1066 instances - 13 features - 6 classes - 0 missing values
__Major change w.r.t. version 1: updated data type of binary variables to factor type.__ Dataset from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch), which…
0 runs0 likes1 downloads1 reach10 impact
4562 instances - 49 features - classes - 0 missing values
The database covers all the international short track games in the last 5 years. Currently it contains only men's 500m. Detailed lap data including personal time and ranking in each game from seasons…
0 runs0 likes1 downloads1 reach10 impact
### Description ### This dataset is part of a collection datasets based on the game "Jungle Chess" (a.k.a. Dou Shou Qi). For a description of the rules, please refer to the paper (link attached). The…
11 runs0 likes1 downloads1 reach13 impact
44819 instances - 47 features - 3 classes - 10584 missing values
EMNIST Balanced https://www.nist.gov/itl/iad/image-group/emnist-dataset
73 runs0 likes1 downloads1 reach18 impact
131600 instances - 785 features - 47 classes - 0 missing values
Much of machine learning research focuses on producing models which perform well on benchmark tasks, in turn improving our understanding of the challenges associated with those tasks. From the…
0 runs0 likes1 downloads1 reach11 impact
70000 instances - 785 features - 10 classes - 0 missing values
The German Traffic Sign Benchmark is a multi-class, single-image classification challenge held at the International Joint Conference on Neural Networks (IJCNN) 2011. We cordially invite researchers…
0 runs0 likes1 downloads1 reach11 impact
51839 instances - 1569 features - 43 classes - 0 missing values
The German Traffic Sign Benchmark is a multi-class, single-image classification challenge held at the International Joint Conference on Neural Networks (IJCNN) 2011. We cordially invite researchers…
0 runs0 likes1 downloads1 reach11 impact
51839 instances - 1569 features - 43 classes - 0 missing values
This collection includes 21 data sets of one-dimensional ultrasound raw RF data (A-Scans) acquired from the calf muscles of 8 healthy volunteers. The subjects were asked to manually annotate the data…
0 runs0 likes1 downloads1 reach8 impact
212872 instances - 4 features - classes - 0 missing values
source: http://www.cs.ubc.ca/labs/beta/Projects/SATzilla/ authors: L. Xu, F. Hutter, H. Hoos, K. Leyton-Brown translator in coseal format: M. Lindauer with the help of Alexandre Frechette the data do…
0 runs0 likes1 downloads1 reach9 impact
4440 instances - 117 features - 0 classes - 27150 missing values
test
0 runs0 likes1 downloads1 reach9 impact
1994 instances - 127 features - 0 classes - 39202 missing values
Elegibilidade ecommerce
0 runs0 likes1 downloads1 reach8 impact
269177 instances - 2 features - 2 classes - 0 missing values
source: An Algorithm Selection Benchmark for the Container Pre-Marshalling Problem (CPMP) authors: K. Tierney and Y. Malitsky (features) / K. Tierney and D. Pacino and S. Voss (algorithms) translator…
0 runs0 likes1 downloads1 reach8 impact
2108 instances - 24 features - 0 classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Concrete Slump dataset (Yeh 2007) concerns the prediction of three properties of concrete…
0 runs1 likes0 downloads1 reach9 impact
103 instances - 10 features - classes - 0 missing values
The YouTube personality dataset consists of a collection of behavorial features, speech transcriptions, and personality impression scores for a set of 404 YouTube vloggers that explicitly show…
0 runs0 likes1 downloads1 reach9 impact
404 instances - 31 features - classes - 0 missing values
BitcoinHeist Ransomware Dataset Akcora, C.G., Li, Y., Gel, Y.R. and Kantarcioglu, M., 2019. BitcoinHeist. Topological Data Analysis for Ransomware Detection on the Bitcoin Blockchain. IJCAI-PRICAI…
0 runs1 likes0 downloads1 reach6 impact
2916697 instances - 10 features - 29 classes - 0 missing values
ede wey
0 runs0 likes1 downloads1 reach6 impact
589 instances - 2909 features - classes - 0 missing values
dd efrg
15 runs0 likes1 downloads1 reach15 impact
1556 instances - 5629 features - classes - 0 missing values
Data Set Information: This research aimed at the case of customers’ default payments in Taiwan and compares the predictive accuracy of probability of default among six data mining methods. From…
0 runs0 likes1 downloads1 reach7 impact
30000 instances - 24 features - 2 classes - 0 missing values
Data used in an analysis of the Brown and Frown corpora for my doctoral dissertation titled ``Variations in Written English: Characterizing Authors' Rhetorical Language Choices Across Corpora of…
2048 runs0 likes1 downloads1 reach12 impact
1000 instances - 24 features - 30 classes - 0 missing values
This file holds global land temperatures by country
0 runs0 likes1 downloads1 reach10 impact
577462 instances - 4 features - classes - 64563 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
10 runs0 likes1 downloads1 reach20 impact
20000 instances - 4297 features - 2 classes - 0 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
13 runs0 likes1 downloads1 reach19 impact
58310 instances - 181 features - 10 classes - 0 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
12 runs0 likes1 downloads1 reach19 impact
83733 instances - 55 features - 4 classes - 0 missing values
Sensor data measurements of one Boiler, containing WaterInput/SteamOutput (flow, temperature, pressure) for one month, which is measured every minute.
0 runs0 likes1 downloads1 reach9 impact
44643 instances - 8 features - classes - 44643 missing values
Klaverjas is an example of the Jack-Nine card games, which are characterized as trick-taking games where the the Jack and nine of the trump suit are the highest-ranking trumps, and the tens and aces…
0 runs0 likes1 downloads1 reach10 impact
981541 instances - 33 features - 2 classes - 0 missing values
student performance analysis 1
0 runs0 likes1 downloads1 reach6 impact
3892 instances - 36 features - classes - 0 missing values
Rows with NaN and inf values removed. Converted file format from CSV to ARFF.
0 runs0 likes1 downloads1 reach6 impact
18982 instances - 80 features - 5 classes - 0 missing values
test
0 runs0 likes1 downloads1 reach3 impact
1000 instances - 21 features - classes - 0 missing values
test
0 runs0 likes1 downloads1 reach6 impact
1000 instances - 21 features - classes - 0 missing values
test
0 runs0 likes1 downloads1 reach6 impact
1000 instances - 21 features - classes - 0 missing values
test
0 runs0 likes1 downloads1 reach6 impact
1000 instances - 21 features - classes - 0 missing values
Automated file upload of BNG(segment)
99 runs0 likes1 downloads1 reach12 impact
1000000 instances - 20 features - 7 classes - 0 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
0 runs0 likes1 downloads1 reach17 impact
425240 instances - 79 features - 2 classes - 2734000 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
1 runs0 likes1 downloads1 reach17 impact
4147 instances - 49 features - 2 classes - 0 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
0 runs0 likes1 downloads1 reach16 impact
100 instances - 10001 features - 2 classes - 0 missing values
SVHN is a real-world image dataset for developing machine learning and object recognition algorithms with minimal requirement on data preprocessing and formatting. It can be seen as similar in flavor…
52 runs0 likes1 downloads1 reach16 impact
99289 instances - 3073 features - 10 classes - 0 missing values
The dataset and this description is made available on http://www-stat.stanford.edu/~tibs/ElemStatLearn/data.html. Normalized handwritten digits, automatically scanned from envelopes by the U.S. Postal…
57 runs0 likes1 downloads1 reach11 impact
9298 instances - 257 features - 10 classes - 0 missing values
The Sheffield (previously UMIST) Face Database consists of 564 images of 20 individuals (mixed race/gender/appearance). Each individual is shown in a range of poses from profile to frontal views -…
53 runs0 likes1 downloads1 reach16 impact
575 instances - 10305 features - 20 classes - 0 missing values
This dataset contains traffic violation information from all electronic traffic violations issued in the County. Any information that can be used to uniquely identify the vehicle, the vehicle owner or…
0 runs0 likes1 downloads1 reach8 impact
1578154 instances - 43 features - 4 classes - 8006541 missing values
10% stratified subsample of the original SVHN data
0 runs0 likes1 downloads1 reach11 impact
9927 instances - 3073 features - 10 classes - 0 missing values
classification
0 runs0 likes1 downloads1 reach8 impact
150 instances - 5 features - classes - 0 missing values
Public procurement data for the European Economic Area, Switzerland, and the Macedonia. 2015
0 runs0 likes1 downloads1 reach8 impact
Israeli lottery
0 runs0 likes1 downloads1 reach8 impact
1153 instances - 11 features - classes - 0 missing values
Public procurement data for the European Economic Area, Switzerland, and the Macedonia. 2015
0 runs0 likes1 downloads1 reach8 impact
565163 instances - 75 features - 0 classes - 15247061 missing values
Chocolate Bar Ratings. Expert ratings of over 1,700 chocolate bars. Each chocolate is evaluated from a combination of both objective qualities and subjective interpretation. A rating here only…
0 runs0 likes1 downloads1 reach8 impact
1794 instances - 9 features - 41 classes - 0 missing values
50% stratified subsample of the original SVHN data
0 runs0 likes1 downloads1 reach11 impact
49644 instances - 3073 features - 10 classes - 0 missing values
Estimated article influence scores in 2015
0 runs0 likes1 downloads1 reach8 impact
3615 instances - 7 features - 3169 classes - 48 missing values
Regroups information for about 7800 different US colleges. Including geographical information, stats about the population attending and post graduation career earnings.
0 runs0 likes1 downloads1 reach9 impact
7063 instances - 50 features - 0 classes - 125494 missing values
Dataset KDD98 challenge: https://kdd.ics.uci.edu/databases/kddcup98/kddcup98.html The goal is to estimate the return from a direct mailing in order to maximize donation profits. This dataset…
0 runs0 likes1 downloads1 reach9 impact
82318 instances - 478 features - 2 classes - 2399311 missing values
Incident reports from the San Franciso Police Department between January 2003 and May 2018, provided by the City and County of San Francisco. The dataset was downloaded on 05.11.2018. from…
0 runs0 likes1 downloads1 reach7 impact
538638 instances - 7 features - 2 classes - 0 missing values
Global soil saturated hydraulic conductivity measurements for geoscience applications. Total of 1,832 sites with 13,072 Ksat measurements were assembled from published literature and other sources and…
0 runs0 likes1 downloads1 reach7 impact
Global soil hydraulic properties (Ksat, Water Content 33 kPa <2mm, Water Content 1500 kPa <2mm) for geoscience applications. Total of 155,649 measurements were assembled from published…
0 runs0 likes1 downloads1 reach7 impact
Is some hand drawn digits with labels that are 1 or 0
0 runs1 likes0 downloads1 reach8 impact
This data approach student achievement in secondary education of two Portuguese schools. The data attributes include student grades, demographic, social and school related features) and it was…
0 runs0 likes1 downloads1 reach8 impact
395 instances - 33 features - 0 classes - 0 missing values
Multi-label dataset. The UC Berkeley enron4 dataset represents a subset of the original enron5 dataset and consists of 1684 cases of emails with 21 labels and 1001 predictor variables.
0 runs0 likes1 downloads1 reach9 impact
1702 instances - 1054 features - classes - 0 missing values
Multi-label dataset. The genbase dataset contains protein sequences that can be assigned to several classes of protein families.
0 runs0 likes1 downloads1 reach9 impact
662 instances - 1212 features - classes - 0 missing values
The langLog dataset includes 1004 textual predictors and was originally compiled in the doctorial thesis of Read (2010). It consists of 956 text samples that can be assigned to one or more topics such…
0 runs0 likes1 downloads1 reach9 impact
1460 instances - 1079 features - classes - 0 missing values
Multi-label dataset. Audio dataset (emotions) consists of 593 musical files with 6 clustered emotional labels and 72 predictors. Each song can be labeled with one or more of the labels…
0 runs0 likes1 downloads1 reach10 impact
593 instances - 78 features - classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Concrete Slump dataset (Yeh 2007) concerns the prediction of three properties of concrete…
0 runs1 likes0 downloads1 reach10 impact
103 instances - 10 features - classes - 0 missing values
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes1 downloads1 reach9 impact
150 instances - 5 features - 3 classes - 0 missing values
The weather problem is a tiny dataset that we will use repeatedly to illustrate machine learning methods. Entirely fictitious, it supposedly concerns the conditions that are suitable for playing some…
0 runs0 likes1 downloads1 reach9 impact
14 instances - 5 features - 2 classes - 0 missing values
Test dataset
0 runs0 likes1 downloads1 reach13 impact
15547 instances - 61 features - 0 classes - 280 missing values
Test dataset
0 runs0 likes1 downloads1 reach14 impact
15547 instances - 61 features - 0 classes - 280 missing values
Daily air quality measurements in New York, May to September 1973. This data is taken from R.
0 runs0 likes1 downloads1 reach8 impact
Daily air quality measurements in New York, May to September 1973. This data is taken from R.
0 runs0 likes1 downloads1 reach8 impact
Daily air quality measurements in New York, May to September 1973. This data is taken from R.
0 runs0 likes1 downloads1 reach8 impact
punch sound
0 runs0 likes1 downloads1 reach8 impact
221 instances - 1 features - classes - 0 missing values
nominal features and target for COMPAS
0 runs0 likes1 downloads1 reach9 impact
5278 instances - 14 features - 2 classes - 0 missing values
Original data from https://github.com/propublica/compas-analysis/ by ProPublica. The data was subsequently preprocessed and reduced to relevant features for classification. The target variable is…
0 runs0 likes1 downloads1 reach10 impact
5278 instances - 14 features - 2 classes - 0 missing values