Data
Filter results by:
Data has been taken from various sources such as data gov and various other websites and has been pre processed for analysis purpose
0 runs0 likes0 downloads0 reach7 impact
204 instances - 5 features - classes - 0 missing values
Conventional and Social Media Movies (CSM) - Dataset 2014 and 2015 Data Set 12 features categorized as conventional and social media features. Both conventional features, collected from movies…
0 runs0 likes0 downloads0 reach8 impact
231 instances - 13 features - classes - 46 missing values
Auto MPG (6 variables) dataset The data concerns city-cycle fuel consumption in miles per gallon (Mpg), to be predicted in terms of 1 multivalued discrete and 5 continuous attributes (two multivalued…
0 runs0 likes0 downloads0 reach8 impact
392 instances - 6 features - 0 classes - 0 missing values
No data.
50 runs0 likes1 downloads1 reach11 impact
1000000 instances - 65 features - 10 classes - 0 missing values
No data.
66 runs0 likes2 downloads2 reach9 impact
1000000 instances - 14 features - 5 classes - 0 missing values
No data.
331 runs0 likes7 downloads7 reach9 impact
1000000 instances - 20 features - 2 classes - 0 missing values
No data.
315 runs0 likes2 downloads2 reach11 impact
295245 instances - 11 features - 5 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach13 impact
9 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach13 impact
10 instances - 1143 features - 0 classes - 0 missing values
No data.
143 runs0 likes4 downloads4 reach11 impact
1000000 instances - 39 features - 6 classes - 0 missing values
Dataset from Smoothing Methods in Statistics (ftp stat.cmu.edu/datasets) Simonoff, J.S. (1996). Smoothing Methods in Statistics. New York: Springer-Verlag.
2 runs0 likes2 downloads2 reach9 impact
2178 instances - 4 features - 0 classes - 0 missing values
No data.
67 runs0 likes3 downloads3 reach9 impact
1000000 instances - 13 features - 6 classes - 0 missing values
The task consists of Learning Quantitative Structure Activity Relationships (QSARs). The Inhibition of Dihydrofolate Reductase by Pyrimidines.The data are described in: King, Ross .D., Muggleton,…
6 runs0 likes2 downloads2 reach9 impact
74 instances - 28 features - 0 classes - 0 missing values
Information about the dataset CLASSTYPE: numeric CLASSINDEX: last
2 runs0 likes0 downloads0 reach13 impact
559 instances - 5 features - 0 classes - 0 missing values
No data.
225 runs0 likes7 downloads7 reach11 impact
1000000 instances - 21 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
2841 runs0 likes4 downloads4 reach24 impact
630 instances - 10936 features - 2 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach13 impact
80 instances - 113 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach13 impact
34 instances - 1143 features - 0 classes - 0 missing values
No data.
50 runs0 likes2 downloads2 reach12 impact
1000000 instances - 18 features - 22 classes - 0 missing values
Datasets of Data And Story Library, project illustrating use of basic statistic methods, converted to arff format by Hakan Kjellerstrand. Source: TunedIT: http://tunedit.org/repo/DASL DASL file…
2 runs0 likes0 downloads0 reach14 impact
59 instances - 16 features - 0 classes - 0 missing values
source: http://www.cs.ubc.ca/labs/beta/Projects/SATzilla/ authors: L. Xu, F. Hutter, H. Hoos, K. Leyton-Brown translator in coseal format: M. Lindauer with the help of Alexandre Frechette the data do…
0 runs0 likes0 downloads0 reach8 impact
296 instances - 116 features - 14 classes - 1810 missing values
Much of machine learning research focuses on producing models which perform well on benchmark tasks, in turn improving our understanding of the challenges associated with those tasks. From the…
0 runs0 likes1 downloads1 reach10 impact
70000 instances - 785 features - 10 classes - 0 missing values
The German Traffic Sign Benchmark is a multi-class, single-image classification challenge held at the International Joint Conference on Neural Networks (IJCNN) 2011. We cordially invite researchers…
0 runs0 likes0 downloads0 reach10 impact
51839 instances - 1569 features - 43 classes - 0 missing values
The German Traffic Sign Benchmark is a multi-class, single-image classification challenge held at the International Joint Conference on Neural Networks (IJCNN) 2011. We cordially invite researchers…
0 runs0 likes0 downloads0 reach10 impact
51839 instances - 1569 features - 43 classes - 0 missing values
The German Traffic Sign Benchmark is a multi-class, single-image classification challenge held at the International Joint Conference on Neural Networks (IJCNN) 2011. We cordially invite researchers…
1 runs0 likes0 downloads0 reach12 impact
51839 instances - 2917 features - 43 classes - 0 missing values
The German Traffic Sign Benchmark is a multi-class, single-image classification challenge held at the International Joint Conference on Neural Networks (IJCNN) 2011. We cordially invite researchers…
1 runs0 likes0 downloads0 reach10 impact
51839 instances - 257 features - 43 classes - 0 missing values
sdwd dede
0 runs0 likes0 downloads0 reach7 impact
44 instances - 2 features - classes - 0 missing values
At Santander our mission is to help people and businesses prosper. We are always looking for ways to help our customers understand their financial health and identify which products and services might…
0 runs0 likes0 downloads0 reach8 impact
200000 instances - 202 features - 2 classes - 0 missing values
titanic surviual prediction
0 runs0 likes0 downloads0 reach7 impact
891 instances - 8 features - classes - 0 missing values
swd dced
0 runs0 likes0 downloads0 reach6 impact
589 instances - 3 features - classes - 0 missing values
sdsw frfr
0 runs0 likes0 downloads0 reach6 impact
1556 instances - 3 features - classes - 0 missing values
efe rgrg
0 runs0 likes0 downloads0 reach6 impact
Data Set Information: This research aimed at the case of customers’ default payments in Taiwan and compares the predictive accuracy of probability of default among six data mining methods. From…
0 runs0 likes1 downloads1 reach7 impact
30000 instances - 24 features - 2 classes - 0 missing values
e fvr
0 runs0 likes0 downloads0 reach6 impact
2 instances - 11 features - classes - 0 missing values
dd fgrfg
0 runs0 likes0 downloads0 reach7 impact
2 instances - 3 features - classes - 0 missing values
efef ffrf
0 runs0 likes0 downloads0 reach6 impact
9 instances - 3 features - classes - 0 missing values
Identify jets of particles from the LHC, created for the study of ultra low latency inference with hls4ml. Use 16 high level features to identify the 5 jet classes: quark (q), gluon (g), W boson (w),…
0 runs0 likes0 downloads0 reach6 impact
830000 instances - 17 features - 5 classes - 0 missing values
ssc vdv
0 runs0 likes0 downloads0 reach6 impact
1556 instances - 2 features - classes - 0 missing values
Experiment data obtained by running random configurations of the hnsw kNN through mlr on 116 different classification tasks from openml.
0 runs0 likes0 downloads0 reach7 impact
111753 instances - 13 features - classes - 0 missing values
Experiment data obtained by running random configurations of glmnet through mlr on 114 different classification tasks from openml.
0 runs0 likes0 downloads0 reach7 impact
104820 instances - 10 features - classes - 0 missing values
Experiment data obtained by running random configurations of an SVM through mlr on 106 different classification tasks from openml.
0 runs0 likes0 downloads0 reach7 impact
540576 instances - 15 features - classes - 658962 missing values
Experiment data obtained by running random configurations of rpart through mlr on 115 different classification tasks from openml.
0 runs0 likes0 downloads0 reach7 impact
92067 instances - 12 features - classes - 0 missing values
Experiment data obtained by running random configurations of ranger through mlr on 119 different classification tasks from openml.
0 runs0 likes0 downloads0 reach7 impact
278863 instances - 16 features - classes - 138965 missing values
Experiment data obtained by running random configurations of xgboost through mlr on 118 different classification tasks from openml. Parameter descriptions:…
0 runs0 likes0 downloads0 reach7 impact
2955210 instances - 21 features - classes - 7051006 missing values
dataset for bme
0 runs0 likes0 downloads0 reach7 impact
63 instances - 12 features - classes - 52 missing values
titanic surviual prediction
0 runs0 likes0 downloads0 reach7 impact
891 instances - 8 features - classes - 0 missing values
titanic surviual prediction
0 runs0 likes0 downloads0 reach7 impact
891 instances - 8 features - classes - 0 missing values
titanic surviual prediction
6 runs0 likes0 downloads0 reach8 impact
891 instances - 8 features - classes - 0 missing values
titanic surviual prediction
0 runs0 likes1 downloads1 reach7 impact
891 instances - 8 features - 0 classes - 0 missing values
r fgtgt
0 runs0 likes1 downloads1 reach7 impact
2 instances - 8 features - classes - 0 missing values
swd cdef
0 runs0 likes0 downloads0 reach7 impact
3 instances - 2 features - classes - 0 missing values
sxd cde
0 runs0 likes0 downloads0 reach7 impact
3 instances - 2 features - classes - 0 missing values
xscdc frfgrg
0 runs0 likes0 downloads0 reach7 impact
3 instances - 1 features - classes - 0 missing values
wded def
0 runs0 likes0 downloads0 reach7 impact
3 instances - 2 features - classes - 0 missing values
sds dcdcc
0 runs0 likes0 downloads0 reach7 impact
3 instances - 2 features - classes - 0 missing values
Download test
0 runs0 likes0 downloads0 reach7 impact
3 instances - 2 features - classes - 0 missing values
xsxs cdf
0 runs0 likes0 downloads0 reach7 impact
6 instances - 2 features - classes - 0 missing values
Uploead test
0 runs0 likes0 downloads0 reach7 impact
958 instances - 10 features - classes - 0 missing values
ddfef fvdf
0 runs0 likes0 downloads0 reach7 impact
8 instances - 1 features - classes - 0 missing values
wdwd cd
0 runs0 likes0 downloads0 reach7 impact
2 instances - 2 features - classes - 0 missing values
efef fdfef
0 runs0 likes0 downloads0 reach7 impact
2 instances - 2 features - classes - 0 missing values
zaxa xcdc
0 runs0 likes0 downloads0 reach7 impact
2 instances - 2 features - classes - 0 missing values
dedfef
0 runs0 likes0 downloads0 reach7 impact
2 instances - 1 features - classes - 0 missing values
scs
0 runs0 likes0 downloads0 reach7 impact
2 instances - 2 features - classes - 0 missing values
wdede
0 runs0 likes0 downloads0 reach7 impact
2 instances - 2 features - classes - 0 missing values
wdwd
0 runs0 likes0 downloads0 reach7 impact
2 instances - 1 features - classes - 0 missing values
qsqs
0 runs0 likes0 downloads0 reach7 impact
2 instances - 1 features - classes - 0 missing values
swdw
0 runs0 likes0 downloads0 reach7 impact
2 instances - 2 features - classes - 0 missing values
werr
0 runs0 likes0 downloads0 reach7 impact
2 instances - 2 features - classes - 0 missing values
ssf
0 runs0 likes0 downloads0 reach7 impact
2 instances - 2 features - classes - 0 missing values
swd
0 runs0 likes0 downloads0 reach7 impact
2 instances - 2 features - classes - 0 missing values
ddef
0 runs0 likes0 downloads0 reach7 impact
2 instances - 2 features - classes - 0 missing values
frf r
0 runs0 likes0 downloads0 reach6 impact
2 instances - 3 features - classes - 0 missing values
e eded
0 runs0 likes0 downloads0 reach6 impact
2 instances - 4 features - classes - 0 missing values
e3r4vr t4r
0 runs0 likes0 downloads0 reach6 impact
2 instances - 5 features - classes - 0 missing values
f fr
0 runs0 likes0 downloads0 reach6 impact
2 instances - 5 features - classes - 0 missing values
testing temperature and ph
0 runs0 likes0 downloads0 reach3 impact
26 instances - 8 features - classes - 0 missing values
This data is used to test water contamination
0 runs0 likes0 downloads0 reach7 impact
26 instances - 8 features - classes - 0 missing values
AutoML challenge 2014. Original task: regression. Test and validation sets can be obtained on the Cha Learn website: https://automl.chalearn.org/data
0 runs0 likes0 downloads0 reach2 impact
99 instances - 200001 features - 0 classes - 0 missing values
% Title: Flora % Source: https://automl.chalearn.org/data % % Dataset from the first ChaLearn AutoML challenge (2014). % Only the training data is included, as there were no labels for validation and…
0 runs0 likes0 downloads0 reach3 impact
15000 instances - 200001 features - 0 classes - 0 missing values
A subset of the 3D dataset from Princeton\'s COS 429 Computer Vision course. The dataset consists of 40 models organised into 4 classes of 10 objects each.
0 runs0 likes0 downloads0 reach2 impact
16000 instances - 4 features - classes - 0 missing values
Version with corrected feature types. 'PrivacySuppressed' are converted to None. Regroups information for about 7800 different US colleges. Including geographical information, stats about the…
0 runs0 likes0 downloads0 reach0 impact
7063 instances - 47 features - 0 classes - 104305 missing values
User profile data for San Francisco OkCupid users published in [Kim, A. Y., & Escobedo-Land, A. (2015). OKCupid data for introductory statistics and data science courses. Journal of Statistics…
0 runs0 likes0 downloads0 reach10 impact
50789 instances - 20 features - 3 classes - 154107 missing values
Bike sharing systems are new generation of traditional bike rentals where whole process from membership, rental and return back has become automatic. Through these systems, user is able to easily rent…
0 runs0 likes0 downloads0 reach3 impact
17379 instances - 13 features - 0 classes - 0 missing values
Bike sharing systems are new generation of traditional bike rentals where whole process from membership, rental and return back has become automatic. Through these systems, user is able to easily rent…
0 runs0 likes0 downloads0 reach2 impact
17379 instances - 13 features - 0 classes - 0 missing values
This is a preprocessed version of the anneal dataset (version 1). All missing values are treated as a nominal value with label '?'. (Quotes for clarity). Because this is not good…
0 runs0 likes0 downloads0 reach0 impact
898 instances - 39 features - 5 classes - 0 missing values
The Inpatient Utilization and Payment Public Use File (Inpatient PUF) provides information on inpatient discharges for Medicare fee-for-service beneficiaries. The Inpatient PUF includes information on…
0 runs0 likes0 downloads0 reach0 impact
163065 instances - 12 features - 0 classes - 0 missing values
sample
0 runs0 likes0 downloads0 reach2 impact
14 instances - 5 features - classes - 0 missing values
this is test data
0 runs0 likes0 downloads0 reach2 impact
5 instances - 5 features - classes - 0 missing values
newtest3
0 runs0 likes0 downloads0 reach3 impact
2 instances - 6 features - classes - 0 missing values
test3
0 runs0 likes0 downloads0 reach2 impact
2 instances - 8 features - classes - 0 missing values
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach2 impact
150 instances - 5 features - classes - 0 missing values
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach2 impact
150 instances - 5 features - 3 classes - 0 missing values
This is the same data as version 5 (OpenML ID = 1220) with '_id' features coded as nominal factor variables.
0 runs0 likes0 downloads0 reach10 impact
39948 instances - 12 features - 2 classes - 0 missing values
No data.
50 runs0 likes3 downloads3 reach9 impact
1000000 instances - 61 features - 2 classes - 0 missing values
No data.
65 runs0 likes5 downloads5 reach9 impact
1000000 instances - 30 features - 4 classes - 0 missing values
No data.
230 runs0 likes4 downloads4 reach11 impact
1000000 instances - 35 features - 2 classes - 0 missing values
No data.
293 runs0 likes2 downloads2 reach11 impact
1000000 instances - 17 features - 10 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach13 impact
32 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes1 downloads1 reach13 impact
22 instances - 111 features - 0 classes - 0 missing values