Data
Filter results by:
Data set of around 45 language and 25 Category. Consist of articles.
0 runs0 likes0 downloads0 reach8 impact
65428 instances - 3 features - classes - 0 missing values
The ILPD dataset from the OpenCC18 with all categorical variables label encoded
0 runs0 likes0 downloads0 reach8 impact
583 instances - 11 features - 0 classes - 0 missing values
The sick dataset from the OpenCC18 with all categorical data label encoded so all data is numeric
0 runs0 likes0 downloads0 reach8 impact
3772 instances - 30 features - classes - 0 missing values
The ILPD liver dataset from the OpenCC18 with the gender binary encoded so all features are numeric
1 runs0 likes0 downloads0 reach9 impact
583 instances - 11 features - 2 classes - 0 missing values
Elegibilidade ecommerce
0 runs0 likes1 downloads1 reach8 impact
269177 instances - 2 features - 2 classes - 0 missing values
test openml upload
0 runs0 likes0 downloads0 reach10 impact
150 instances - 5 features - 3 classes - 0 missing values
source: An Algorithm Selection Benchmark for the Container Pre-Marshalling Problem (CPMP) authors: K. Tierney and Y. Malitsky (features) / K. Tierney and D. Pacino and S. Voss (algorithms) translator…
0 runs0 likes1 downloads1 reach8 impact
2108 instances - 24 features - 0 classes - 0 missing values
Asteroid Dataset
0 runs0 likes1 downloads1 reach8 impact
126131 instances - 34 features - 2 classes - 99 missing values
Asteroid Dataset
0 runs0 likes1 downloads1 reach11 impact
126131 instances - 34 features - 2 classes - 99 missing values
Test
0 runs0 likes0 downloads0 reach7 impact
6330 instances - 8 features - classes - 0 missing values
good
0 runs0 likes0 downloads0 reach8 impact
10 instances - 4 features - classes - 2 missing values
Fixed dataset for autoHorse.csv I suggest...
0 runs0 likes0 downloads0 reach11 impact
201 instances - 69 features - 186 classes - 0 missing values
price col is int now. autoHorse dataset
15 runs0 likes0 downloads0 reach12 impact
201 instances - 69 features - 0 classes - 0 missing values
testing
0 runs0 likes0 downloads0 reach7 impact
366 instances - 3 features - classes - 0 missing values
iris-example
0 runs0 likes0 downloads0 reach7 impact
150 instances - 5 features - 3 classes - 0 missing values
hydraulic
0 runs0 likes0 downloads0 reach7 impact
2205 instances - 22 features - classes - 0 missing values
sde c
5 runs0 likes0 downloads0 reach7 impact
1556 instances - 5629 features - classes - 0 missing values
as cscs
3 runs0 likes0 downloads0 reach7 impact
1557 instances - 5629 features - classes - 0 missing values
sd vfv
0 runs0 likes0 downloads0 reach7 impact
4 instances - 50 features - 2 classes - 0 missing values
r rg
0 runs0 likes0 downloads0 reach8 impact
4 instances - 50 features - classes - 0 missing values
as dwd
1 runs0 likes0 downloads0 reach7 impact
1557 instances - 5629 features - classes - 0 missing values
ef r
2 runs0 likes0 downloads0 reach7 impact
1557 instances - 5629 features - classes - 0 missing values
dd ref
0 runs0 likes0 downloads0 reach7 impact
4 instances - 50 features - classes - 0 missing values
The Inpatient Utilization and Payment Public Use File (Inpatient PUF) provides information on inpatient discharges for Medicare fee-for-service beneficiaries. The Inpatient PUF includes…
0 runs0 likes0 downloads0 reach6 impact
163065 instances - 12 features - 0 classes - 0 missing values
Touch Signals
0 runs0 likes0 downloads0 reach6 impact
265 instances - 11 features - classes - 0 missing values
Touch samples 2
0 runs0 likes0 downloads0 reach8 impact
265 instances - 11 features - 8 classes - 0 missing values
valores de saida de fardamento com temperaturas, admissões e demissões
0 runs0 likes0 downloads0 reach8 impact
6277 instances - 7 features - 0 classes - 0 missing values
ef f
0 runs0 likes0 downloads0 reach7 impact
4 instances - 49 features - classes - 0 missing values
dsd efe
1 runs0 likes0 downloads0 reach7 impact
601 instances - 7 features - classes - 0 missing values
rrvrf 4rr
0 runs0 likes0 downloads0 reach7 impact
4 instances - 49 features - classes - 0 missing values
de d
3 runs0 likes0 downloads0 reach7 impact
1556 instances - 5628 features - classes - 0 missing values
fr frf
2 runs0 likes0 downloads0 reach7 impact
1556 instances - 5629 features - classes - 0 missing values
This data represents crime reported to the Seattle Police Department (SPD). Each row contains the record of a unique event where at least one criminal offense was reported by a member of the community…
0 runs0 likes0 downloads0 reach6 impact
52358 instances - 8 features - 0 classes - 650 missing values
ede wey
0 runs0 likes0 downloads0 reach6 impact
589 instances - 2909 features - classes - 0 missing values
eevrr der
0 runs0 likes0 downloads0 reach9 impact
1557 instances - 5629 features - classes - 0 missing values
The midwest survey dataset contain individual responses from surveys about regional identification conducted for FiveThirtyEight by SurveyMonkey.
0 runs0 likes0 downloads0 reach6 impact
2778 instances - 28 features - 10 classes - 1744 missing values
dd efrg
15 runs0 likes0 downloads0 reach15 impact
1556 instances - 5629 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach6 impact
150 instances - 5 features - classes - 0 missing values
efe def
0 runs0 likes0 downloads0 reach8 impact
4 instances - 49 features - classes - 0 missing values
Dataset showing Data from matches played RB Leipzig prior to 14.06.2020
0 runs0 likes0 downloads0 reach6 impact
102 instances - 1 features - classes - 0 missing values
student performance 1
0 runs0 likes1 downloads1 reach6 impact
3892 instances - 36 features - classes - 0 missing values
mydata
0 runs0 likes0 downloads0 reach6 impact
3892 instances - 36 features - classes - 0 missing values
sqs efrf
0 runs0 likes0 downloads0 reach7 impact
4 instances - 5 features - classes - 0 missing values
b gtrg
0 runs0 likes0 downloads0 reach7 impact
4 instances - 7 features - classes - 0 missing values
Zurich public transport delay data 2016-10-30 03:30:00 CET - 2016-11-27 01:20:00 CET cleaned and prepared at Open Data Day 2017. For this version, the task was downsampled to 0.5 percent. Some…
0 runs0 likes0 downloads0 reach6 impact
27327 instances - 18 features - 0 classes - 657 missing values
![palmerpenguins](https://github.com/allisonhorst/palmerpenguins/raw/master/man/figures/logo.png) ## Description The goal of palmerpenguins is to provide a great dataset for data exploration &…
0 runs0 likes0 downloads0 reach6 impact
344 instances - 7 features - 3 classes - 18 missing values
The midwest survey dataset contain individual responses from surveys about regional identification conducted for FiveThirtyEight by SurveyMonkey.
0 runs0 likes0 downloads0 reach6 impact
2778 instances - 28 features - 10 classes - 1744 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach13 impact
13 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach13 impact
13 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes1 downloads1 reach13 impact
5 instances - 1143 features - 0 classes - 0 missing values
No data.
206 runs0 likes3 downloads3 reach11 impact
1000000 instances - 39 features - 6 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach13 impact
10 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach13 impact
6 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach13 impact
31 instances - 54 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes2 downloads2 reach13 impact
195 instances - 33 features - 0 classes - 0 missing values
This is the hip measurement data from Table B.13 in Chatfield's Problem Solving (1995, 2nd edn, Chapman and Hall). It is given in 8 columns. First 4 columns are for Control Group. Last 4 columns are…
0 runs0 likes0 downloads0 reach11 impact
54 instances - 8 features - classes - 120 missing values
No data.
73 runs0 likes5 downloads5 reach11 impact
1000000 instances - 16 features - 2 classes - 0 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) The infamous Longley data, "An appraisal of least-squares programs from the point of view of the user", JASA, 62(1967) p819-841. Variables are: Number of…
3 runs0 likes1 downloads1 reach9 impact
16 instances - 7 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach13 impact
16 instances - 24 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach13 impact
15 instances - 10 features - 0 classes - 0 missing values
No data.
66 runs0 likes3 downloads3 reach12 impact
1000000 instances - 35 features - 6 classes - 0 missing values
This is one of a family of datasets synthetically generated from a realistic simulation of the dynamics of a Unimation Puma 560 robot arm. There are eight datastets in this family . In this repository…
0 runs0 likes6 downloads6 reach13 impact
8192 instances - 33 features - 0 classes - 0 missing values
Datasets of Data And Story Library, project illustrating use of basic statistic methods, converted to arff format by Hakan Kjellerstrand. Source: TunedIT: http://tunedit.org/repo/DASL DASL file…
0 runs0 likes1 downloads1 reach13 impact
39 instances - 4 features - 0 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
82 runs0 likes5 downloads5 reach15 impact
405 instances - 10936 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
65 runs0 likes3 downloads3 reach15 impact
412 instances - 10936 features - 2 classes - 0 missing values
Datasets of Data And Story Library, project illustrating use of basic statistic methods, converted to arff format by Hakan Kjellerstrand. Source: TunedIT: http://tunedit.org/repo/DASL DASL file…
593 runs0 likes7 downloads7 reach14 impact
478 instances - 11 features - 3 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
65 runs0 likes4 downloads4 reach15 impact
470 instances - 10936 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
77 runs0 likes5 downloads5 reach15 impact
275 instances - 10936 features - 2 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach9 impact
1000000 instances - 41 features - 0 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach12 impact
1000000 instances - 16 features - 0 classes - 0 missing values
This collection includes 21 data sets of one-dimensional ultrasound raw RF data (A-Scans) acquired from the calf muscles of 8 healthy volunteers. The subjects were asked to manually annotate the data…
0 runs0 likes1 downloads1 reach8 impact
212872 instances - 4 features - classes - 0 missing values
Data contains the information of 9144 samples form 220 spectral bands. The classes represent land-use types: alfalfa, corn, grass, hay, oats, soybeans, trees, and wheat.
0 runs0 likes2 downloads2 reach10 impact
9144 instances - 221 features - 8 classes - 0 missing values
Binarized version of the semeion dataset (see version 1). Only instances with class labels 1 and 2 from the original dataset are considered.
0 runs0 likes0 downloads0 reach9 impact
319 instances - 257 features - 2 classes - 0 missing values
This is a meta-dataset which describes the SVM hyperparameter tuning problem. The target attribute indicates whether tuning is required or default hyperparameter values are enough to each dataset…
0 runs0 likes0 downloads0 reach8 impact
156 instances - 81 features - 2 classes - 0 missing values
This is a meta-dataset which describes the SVM hyperparameter tuning problem. The target attribute indicates whether tuning is required or default hyperparameter values are enough to each dataset…
0 runs0 likes0 downloads0 reach8 impact
156 instances - 91 features - 2 classes - 0 missing values
This is a meta-dataset which describes the SVM hyperparameter tuning problem. The target attribute indicates whether tuning is required or default hyperparameter values are enough to each dataset…
0 runs0 likes0 downloads0 reach8 impact
156 instances - 81 features - 2 classes - 0 missing values
uci adult partitioned
0 runs0 likes0 downloads0 reach8 impact
48844 instances - 17 features - classes - 6495 missing values
uci
0 runs0 likes0 downloads0 reach8 impact
30000 instances - 27 features - classes - 0 missing values
uci
0 runs0 likes0 downloads0 reach8 impact
101766 instances - 52 features - classes - 192849 missing values
hmeq_p,BAD,binary
0 runs0 likes0 downloads0 reach8 impact
5960 instances - 15 features - classes - 5271 missing values
kaggle_santander_p
0 runs0 likes0 downloads0 reach8 impact
200000 instances - 203 features - classes - 0 missing values
Synthetic 2-d data with N=5000 vectors and k=15 Gaussian clusters with different degree of cluster overlap P. Fränti and O. Virmajoki, "Iterative shrinking method for clustering…
0 runs0 likes0 downloads0 reach8 impact
5000 instances - 3 features - 0 classes - 0 missing values
classification
0 runs0 likes1 downloads1 reach8 impact
150 instances - 5 features - classes - 0 missing values
This dataset contains house sale prices for King County, which includes Seattle. It includes homes sold between May 2014 and May 2015. It contains 19 house features plus the price and the id columns,…
0 runs0 likes2 downloads2 reach9 impact
21613 instances - 20 features - 0 classes - 0 missing values
Public procurement data for the European Economic Area, Switzerland, and the Macedonia. 2015
0 runs0 likes1 downloads1 reach8 impact
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach8 impact
150 instances - 5 features - 3 classes - 0 missing values
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach8 impact
150 instances - 5 features - 3 classes - 0 missing values
Wine data gathered by https://www.kaggle.com/zynicideThe data was scraped from WineEnthusiast during the week of June 15th, 2017. The code for the scraper can be found at…
0 runs0 likes0 downloads0 reach8 impact
150930 instances - 10 features - classes - 174477 missing values
Data are collected from Kickstarter Platform You'll find most useful data for project analysis. Columns are self explanatory except: usd_pledged: conversion in US dollars of the pledged column…
0 runs0 likes0 downloads0 reach8 impact
331675 instances - 14 features - classes - 210 missing values
This dataset consists of beer reviews from Beeradvocate. The data span a period of more than 10 years, including all ~1.5 million reviews up to November 2011. Each review includes ratings in terms of…
0 runs0 likes0 downloads0 reach8 impact
1586614 instances - 13 features - 104 classes - 68148 missing values
General Description 2015-current: greater than $200.00. The Commission categorizes contributions from individuals using the calendar year-to-date amount for political action committee (PAC) and party…
0 runs0 likes2 downloads2 reach8 impact
3348209 instances - 21 features - 0 classes - 10786577 missing values
Regroups information for about 7800 different US colleges. Including geographical information, stats about the population attending and post graduation career earnings.
0 runs0 likes0 downloads0 reach8 impact
Estimated article influence scores in 2015
0 runs0 likes0 downloads0 reach8 impact
3615 instances - 7 features - 3169 classes - 48 missing values
Annual salary information including gross pay and overtime pay for all active, permanent employees of Montgomery County, MD paid in calendar year 2016. This information will be published annually each…
0 runs0 likes3 downloads3 reach8 impact
9228 instances - 13 features - 0 classes - 11169 missing values
Synthetic 2-d data with N=5000 vectors and k=15 Gaussian clusters with different degree of cluster overlap P. Fränti and O. Virmajoki, "Iterative shrinking method for clustering…
0 runs0 likes0 downloads0 reach8 impact
5000 instances - 3 features - 0 classes - 0 missing values
Synthetic 2-d data with N=5000 vectors and k=15 Gaussian clusters with different degree of cluster overlap P. Fränti and O. Virmajoki, "Iterative shrinking method for clustering…
0 runs0 likes0 downloads0 reach8 impact
5000 instances - 3 features - 0 classes - 0 missing values
Synthetic 2-d data with N=5000 vectors and k=15 Gaussian clusters with different degree of cluster overlap P. Fränti and O. Virmajoki, "Iterative shrinking method for clustering…
0 runs0 likes0 downloads0 reach8 impact
5000 instances - 3 features - 0 classes - 0 missing values
Public procurement data for the European Economic Area, Switzerland, and the Macedonia. 2015
0 runs0 likes0 downloads0 reach8 impact
This dataset consists of beer reviews from Beeradvocate. The data span a period of more than 10 years, including all ~1.5 million reviews up to November 2011. Each review includes ratings in terms of…
0 runs0 likes0 downloads0 reach8 impact
1586614 instances - 13 features - 104 classes - 68148 missing values
This dataset consists of beer reviews from Beeradvocate. The data span a period of more than 10 years, including all ~1.5 million reviews up to November 2011. Each review includes ratings in terms of…
0 runs0 likes0 downloads0 reach8 impact
1586614 instances - 13 features - 104 classes - 68148 missing values