OpenML
Filter results by:
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 20151, and it has 1427 rows and 1026 features…
1 runs0 likes1 downloads1 reach5 impact
1427 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 17075, and it has 15 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach5 impact
15 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12863, and it has 30 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach5 impact
30 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100479, and it has 11 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach5 impact
11 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12238, and it has 30 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach5 impact
30 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10929, and it has 154 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach5 impact
154 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 250, and it has 2446 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach5 impact
2446 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10280, and it has 3134 rows and 1026 features…
1 runs0 likes1 downloads1 reach5 impact
3134 instances - 1026 features - 0 classes - 0 missing values
libSVM","AAD group #Dataset from the LIBSVM data repository.
0 runs0 likes0 downloads0 reach10 impact
49749 instances - 301 features - 0 classes - 0 missing values
libSVM","AAD group #Dataset from the LIBSVM data repository. Preprocessing: The original Adult data set has 14 features, among which six are continuous and eight are categorical. In this data set,…
0 runs0 likes1 downloads1 reach10 impact
32561 instances - 124 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 103561, and it has 47 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach5 impact
47 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11169, and it has 52 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach5 impact
52 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10444, and it has 44 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach5 impact
44 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 104499, and it has 24 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach5 impact
24 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11036, and it has 396 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach5 impact
396 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 226, and it has 1431 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach5 impact
1431 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11755, and it has 1089 rows and 1026 features…
1 runs0 likes1 downloads1 reach5 impact
1089 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12252, and it has 2998 rows and 1026 features…
1 runs0 likes1 downloads1 reach5 impact
2998 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 30008, and it has 837 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach5 impact
837 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10979, and it has 1671 rows and 1026 features…
1 runs0 likes1 downloads1 reach5 impact
1671 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11414, and it has 61 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach5 impact
61 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12391, and it has 17 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach5 impact
17 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 17061, and it has 152 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach5 impact
152 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101496, and it has 79 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach5 impact
79 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 20025, and it has 89 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach5 impact
89 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11969, and it has 1246 rows and 1026 features…
1 runs0 likes1 downloads1 reach5 impact
1246 instances - 1026 features - 0 classes - 0 missing values
Ask a home buyer to describe their dream house, and they probably won't begin with the height of the basement ceiling or the proximity to an east-west railroad. But this playground competition's…
0 runs0 likes2 downloads2 reach2 impact
1460 instances - 81 features - 0 classes - 6965 missing values
The goal is to predict the Fare. Variable description: pclass: A proxy for socio-economic status (SES) 1st = Upper 2nd = Middle 3rd = Lower age: Age is fractional if less than 1. If the age is…
0 runs0 likes3 downloads3 reach4 impact
1307 instances - 8 features - 0 classes - 0 missing values
titanic surviual prediction
0 runs0 likes1 downloads1 reach1 impact
891 instances - 8 features - 0 classes - 0 missing values
titanic surviual prediction
0 runs0 likes1 downloads1 reach1 impact
891 instances - 8 features - 0 classes - 0 missing values
titanic surviual prediction
0 runs0 likes1 downloads1 reach1 impact
891 instances - 8 features - 0 classes - 0 missing values
This data set concerns the study of the factors affecting patterns of insulin-dependent diabetes mellitus in children. The objective is to investigate the dependence of the level of serum C-peptide on…
2 runs0 likes1 downloads1 reach3 impact
43 instances - 3 features - 0 classes - 0 missing values
This dataset contains house sale prices for King County, which includes Seattle. It includes homes sold between May 2014 and May 2015. It contains 19 house features plus the price and the id columns,…
0 runs0 likes2 downloads2 reach2 impact
21613 instances - 20 features - 0 classes - 0 missing values
Anonymized data of dating profiles from OkCupid
0 runs0 likes2 downloads2 reach2 impact
59946 instances - 31 features - 0 classes - 273249 missing values
this is titanic survival prediction
0 runs0 likes1 downloads1 reach1 impact
891 instances - 8 features - 0 classes - 0 missing values
https://www.kaggle.com/harlfoxem/ This dataset contains house sale prices for King County, which includes Seattle. It includes homes sold between May 2014 and May 2015. It contains 19 house features…
0 runs0 likes0 downloads0 reach0 impact
21613 instances - 21 features - 0 classes - 0 missing values
Data for an stock long position
0 runs0 likes0 downloads0 reach0 impact
4477 instances - 20 features - 0 classes - 0 missing values
this is titanic survival prediction
0 runs0 likes2 downloads2 reach1 impact
891 instances - 8 features - 0 classes - 0 missing values
Dataset from Smoothing Methods in Statistics (ftp stat.cmu.edu/datasets) Simonoff, J.S. (1996). Smoothing Methods in Statistics. New York: Springer-Verlag.
2 runs0 likes2 downloads2 reach6 impact
2178 instances - 4 features - 0 classes - 0 missing values
This is a dataset obtained from the StatLib repository. Here is the included description: The data provided are daily stock prices from January 1988 through October 1991, for ten aerospace companies.…
5 runs1 likes8 downloads9 reach6 impact
950 instances - 10 features - 0 classes - 0 missing values
Context It is important that credit card companies are able to recognize fraudulent credit card transactions so that customers are not charged for items that they did not purchase. Content The…
0 runs0 likes6 downloads6 reach6 impact
284807 instances - 31 features - 0 classes - 0 missing values
URL dataset
0 runs0 likes0 downloads0 reach0 impact
121001 instances - 501 features - 0 classes - 0 missing values
Email dataset 1a
0 runs0 likes0 downloads0 reach0 impact
4585 instances - 4 features - 0 classes - 0 missing values
Email dataset 1c
0 runs0 likes0 downloads0 reach0 impact
4585 instances - 792 features - 0 classes - 0 missing values
Email dataset 1b
0 runs0 likes0 downloads0 reach0 impact
4585 instances - 24 features - 0 classes - 161 missing values
Phishing website 1
0 runs0 likes0 downloads0 reach0 impact
11055 instances - 31 features - 0 classes - 0 missing values
URL dataset 2
0 runs0 likes0 downloads0 reach0 impact
95911 instances - 13 features - 0 classes - 0 missing values
Email dataset 1d
0 runs0 likes0 downloads0 reach0 impact
4585 instances - 11 features - 0 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Tumor-size treated as the class attribute. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using…
0 runs0 likes3 downloads3 reach10 impact
286 instances - 10 features - 0 classes - 9 missing values
Email dataset 1e
0 runs0 likes0 downloads0 reach0 impact
4585 instances - 580 features - 0 classes - 0 missing values
Email dataset 2
0 runs0 likes0 downloads0 reach0 impact
11507 instances - 4 features - 0 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
77 runs0 likes3 downloads3 reach7 impact
337 instances - 10937 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
67 runs0 likes1 downloads1 reach7 impact
458 instances - 10937 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
2855 runs0 likes4 downloads4 reach16 impact
1545 instances - 10937 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
2862 runs0 likes5 downloads5 reach16 impact
1545 instances - 10937 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
2849 runs0 likes5 downloads5 reach16 impact
1545 instances - 10937 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
65 runs0 likes2 downloads2 reach7 impact
324 instances - 10937 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
80 runs0 likes5 downloads5 reach7 impact
405 instances - 10937 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
2865 runs1 likes18 downloads19 reach16 impact
1545 instances - 10937 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
77 runs0 likes2 downloads2 reach7 impact
384 instances - 10937 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
2853 runs1 likes7 downloads8 reach16 impact
1545 instances - 10937 features - 2 classes - 0 missing values
Datasets from ACM KDD Cup (http://www.sigkdd.org/kddcup/index.php) KDD Cup 2009 http://www.kddcup-orange.com Converted to ARFF format by TunedIT Customer Relationship Management (CRM) is a key element…
11301 runs0 likes12 downloads12 reach17 impact
50000 instances - 231 features - 2 classes - 8024152 missing values
knugget chase 3
0 runs0 likes2 downloads2 reach4 impact
194 instances - 40 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
2834 runs0 likes4 downloads4 reach16 impact
1545 instances - 10937 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
2855 runs0 likes8 downloads8 reach16 impact
542 instances - 10937 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
65 runs0 likes1 downloads1 reach7 impact
321 instances - 10937 features - 2 classes - 0 missing values
No data.
43 runs0 likes2 downloads2 reach1 impact
1000000 instances - 45 features - 2 classes - 0 missing values
No data.
44 runs0 likes3 downloads3 reach2 impact
1000000 instances - 15 features - 2 classes - 0 missing values
No data.
90 runs2 likes3 downloads5 reach2 impact
663552 instances - 13 features - 2 classes - 0 missing values
No data.
45 runs0 likes2 downloads2 reach1 impact
1000000 instances - 23 features - 2 classes - 0 missing values
Datasets from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch) Dataset from: http://www.agnostic.inf.ethz.ch/datasets.php Modified by TunedIT (converted to ARFF…
406 runs1 likes11 downloads12 reach8 impact
4229 instances - 1618 features - 2 classes - 0 missing values
No data.
28 runs0 likes1 downloads1 reach2 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
28 runs0 likes1 downloads1 reach2 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
29 runs0 likes2 downloads2 reach2 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
28 runs0 likes1 downloads1 reach2 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
29 runs0 likes1 downloads1 reach2 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
29 runs0 likes1 downloads1 reach2 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
29 runs0 likes1 downloads1 reach2 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
29 runs0 likes2 downloads2 reach2 impact
1000000 instances - 37 features - 2 classes - 0 missing values
Normalized form of codrna (351) Andrew V Uzilov, Joshua M Keegan, and David H Mathews. Detection of non-coding RNAs on the basis of predicted secondary structure formation free energy change. BMC…
309 runs0 likes5 downloads5 reach1 impact
488565 instances - 9 features - 2 classes - 0 missing values
No data.
311 runs0 likes5 downloads5 reach2 impact
1000000 instances - 10 features - 2 classes - 0 missing values
No data.
306 runs0 likes4 downloads4 reach2 impact
1000000 instances - 4 features - 2 classes - 0 missing values
No data.
305 runs0 likes3 downloads3 reach2 impact
1000000 instances - 4 features - 2 classes - 0 missing values
No data.
313 runs0 likes3 downloads3 reach1 impact
1000000 instances - 23 features - 2 classes - 0 missing values
No data.
47 runs0 likes1 downloads1 reach1 impact
1000000 instances - 45 features - 2 classes - 0 missing values
No data.
51 runs0 likes2 downloads2 reach1 impact
1000000 instances - 15 features - 2 classes - 0 missing values
Datasets from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch) Dataset from: http://www.agnostic.inf.ethz.ch/datasets.php Modified by TunedIT (converted to ARFF…
548 runs0 likes9 downloads9 reach8 impact
3468 instances - 785 features - 2 classes - 0 missing values
DEXTER is a text classification problem in a bag-of-word representation. This is a two-class classification problem with sparse continuous input variables. This dataset is one of five datasets of the…
0 runs0 likes5 downloads5 reach11 impact
600 instances - 20001 features - 2 classes - 0 missing values
Om algos te testen
74 runs0 likes5 downloads5 reach7 impact
14240 instances - 31 features - 2 classes - 0 missing values
No data.
70 runs0 likes3 downloads3 reach1 impact
1000000 instances - 28 features - 2 classes - 0 missing values
No data.
72 runs0 likes3 downloads3 reach1 impact
1000000 instances - 23 features - 2 classes - 0 missing values
No data.
73 runs0 likes5 downloads5 reach2 impact
1000000 instances - 16 features - 2 classes - 0 missing values
No data.
68 runs0 likes4 downloads4 reach2 impact
1000000 instances - 21 features - 2 classes - 0 missing values
No data.
65 runs0 likes4 downloads4 reach1 impact
1000000 instances - 40 features - 2 classes - 0 missing values
No data.
324 runs0 likes5 downloads5 reach2 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
68 runs0 likes11 downloads11 reach1 impact
1000000 instances - 10 features - 2 classes - 0 missing values
No data.
230 runs0 likes4 downloads4 reach2 impact
1000000 instances - 35 features - 2 classes - 0 missing values
No data.
310 runs0 likes4 downloads4 reach2 impact
1000000 instances - 11 features - 2 classes - 0 missing values
Synthetic dataset. Almost identical to [dataset 152](https://www.openml.org/d/153/edit)
319 runs0 likes4 downloads4 reach2 impact
1000000 instances - 11 features - 2 classes - 0 missing values
No data.
73 runs0 likes5 downloads5 reach1 impact
1000000 instances - 30 features - 2 classes - 0 missing values