OpenML
Filter results by:
# Data Description This is the historical price data of the FOREX CAD/CHF from Dukascopy. One instance (row) is one candlestick of one day. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes0 downloads0 reach1 impact
1831 instances - 12 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10495, and it has 1829 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
1829 instances - 1026 features - 0 classes - 0 missing values
# Data Description This is the historical price data of the FOREX NZD/USD from Dukascopy. One instance (row) is one candlestick of one day. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes0 downloads0 reach1 impact
1828 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX NZD/USD from Dukascopy. One instance (row) is one candlestick of one day. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes0 downloads0 reach1 impact
1828 instances - 12 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10526, and it has 1814 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
1814 instances - 1026 features - 0 classes - 0 missing values
Chocolate Bar Ratings. Expert ratings of over 1,700 chocolate bars. Each chocolate is evaluated from a combination of both objective qualities and subjective interpretation. A rating here only…
0 runs0 likes0 downloads0 reach1 impact
1795 instances - 9 features - 42 classes - 1 missing values
Chocolate Bar Ratings. Expert ratings of over 1,700 chocolate bars. Each chocolate is evaluated from a combination of both objective qualities and subjective interpretation. A rating here only…
0 runs0 likes0 downloads0 reach1 impact
1794 instances - 9 features - 41 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10498, and it has 1748 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
1748 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 124, and it has 1745 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
1745 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 8, and it has 1739 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
1739 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 36, and it has 1731 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
1731 instances - 1026 features - 0 classes - 0 missing values
car-evaluation-pmlb
31 runs0 likes1 downloads1 reach11 impact
1728 instances - 22 features - 4 classes - 0 missing values
This database was derived from a simple hierarchical decision model originally developed for the demonstration of DEX (M. Bohanec, V. Rajkovic: Expert system for decision making. Sistemica 1(1), pp.…
6166 runs0 likes7 downloads7 reach13 impact
1728 instances - 7 features - 4 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
765 runs0 likes12 downloads12 reach8 impact
1728 instances - 7 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11534, and it has 1703 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
1703 instances - 1026 features - 0 classes - 0 missing values
Multi-label dataset. The UC Berkeley enron4 dataset represents a subset of the original enron5 dataset and consists of 1684 cases of emails with 21 labels and 1001 predictor variables.
1 runs0 likes3 downloads3 reach7 impact
1702 instances - 1054 features - 2 classes - 0 missing values
Multi-label dataset. The UC Berkeley enron4 dataset represents a subset of the original enron5 dataset and consists of 1684 cases of emails with 21 labels and 1001 predictor variables.
0 runs0 likes0 downloads0 reach2 impact
1702 instances - 1054 features - classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11082, and it has 1695 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
1695 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10979, and it has 1671 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
1671 instances - 1026 features - 0 classes - 0 missing values
No data.
159 runs0 likes11 downloads11 reach11 impact
1657 instances - 3759 features - 25 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11736, and it has 1649 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
1649 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12687, and it has 1648 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
1648 instances - 1026 features - 0 classes - 0 missing values
Domain dataset
0 runs0 likes0 downloads0 reach2 impact
1637 instances - 9839 features - 3 classes - 13231887 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 56, and it has 1631 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
1631 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10502, and it has 1627 rows and 1026 features…
1 runs0 likes2 downloads2 reach4 impact
1627 instances - 1026 features - 0 classes - 0 missing values
* Dataset Title: Volcanoes on Venus - JARtool experiment Data Set Experiment: A2 * Source: Michael C. Burl MS 126-347, JPL 4800 Oak Grove Drive Pasadena, CA 91109 (818) 393-5345 Michael.C.Burl '@'…
119 runs0 likes4 downloads4 reach6 impact
1623 instances - 4 features - 5 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11300, and it has 1616 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
1616 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12840, and it has 1608 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
1608 instances - 1026 features - 0 classes - 0 missing values
GAMETES_Epistasis_2-Way_1000atts_0.4H_EDM-1_EDM-1_1-pmlb
0 runs0 likes1 downloads1 reach12 impact
1600 instances - 1001 features - 2 classes - 0 missing values
GAMETES_Epistasis_2-Way_20atts_0.1H_EDM-1_1-pmlb
31 runs0 likes0 downloads0 reach12 impact
1600 instances - 21 features - 2 classes - 0 missing values
GAMETES_Epistasis_2-Way_20atts_0.4H_EDM-1_1-pmlb
31 runs0 likes0 downloads0 reach12 impact
1600 instances - 21 features - 2 classes - 0 missing values
GAMETES_Epistasis_3-Way_20atts_0.2H_EDM-1_1-pmlb
31 runs0 likes0 downloads0 reach12 impact
1600 instances - 21 features - 2 classes - 0 missing values
GAMETES_Heterogeneity_20atts_1600_Het_0.4_0.2_50_EDM-2_001-pmlb
0 runs0 likes0 downloads0 reach12 impact
1600 instances - 21 features - 2 classes - 0 missing values
GAMETES_Heterogeneity_20atts_1600_Het_0.4_0.2_75_EDM-2_001-pmlb
31 runs0 likes0 downloads0 reach12 impact
1600 instances - 21 features - 2 classes - 0 missing values
### Description One-hundred plant species leaves dataset (Class = Shape). ### Sources ``` (a) Original owners of colour Leaves Samples: James Cope, Thibaut Beghin, Paolo Remagnino, Sarah Barman. The…
143287 runs1 likes35 downloads36 reach408 impact
1600 instances - 65 features - 100 classes - 0 missing values
### Description One-hundred plant species leaves dataset (Class = Margin). ### Sources ``` (a) Original owners of colour Leaves Samples: James Cope, Thibaut Beghin, Paolo Remagnino, Sarah Barman. The…
143049 runs1 likes15 downloads16 reach410 impact
1600 instances - 65 features - 100 classes - 0 missing values
wine-quality-red-pmlb
31 runs1 likes0 downloads1 reach12 impact
1599 instances - 12 features - 6 classes - 0 missing values
Test file for ML training
0 runs0 likes0 downloads0 reach2 impact
1599 instances - 12 features - classes - 0 missing values
### Description One-hundred plant species leaves dataset (Class = Texture). ### Sources ``` (a) Original owners of colour Leaves Samples: James Cope, Thibaut Beghin, Paolo Remagnino, Sarah Barman. The…
143077 runs2 likes61 downloads63 reach411 impact
1599 instances - 65 features - 100 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 35, and it has 1597 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
1597 instances - 1026 features - 0 classes - 0 missing values
Tattile Via Gaetano Donizetti, 1-3-5,25030 Mairano (Brescia), Italy. ### Dataset Description Semeion Handwritten Digit Data Set, where 1593 handwritten digits from around 80 persons were scanned and…
31160 runs0 likes22 downloads22 reach51 impact
1593 instances - 257 features - 10 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11522, and it has 1585 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
1585 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 219, and it has 1584 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
1584 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11280, and it has 1584 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
1584 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10849, and it has 1580 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
1580 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 43, and it has 1577 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
1577 instances - 1026 features - 0 classes - 0 missing values
One of the NASA Metrics Data Program defect data sets. Data from flight software for earth orbiting satellite. Data comes from McCabe and Halstead features extractors of source code. These features…
144599 runs1 likes16 downloads17 reach19 impact
1563 instances - 38 features - 2 classes - 0 missing values
No data.
163 runs0 likes13 downloads13 reach11 impact
1560 instances - 8461 features - 20 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 235, and it has 1560 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
1560 instances - 1026 features - 0 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
59 runs0 likes6 downloads6 reach8 impact
1545 instances - 10937 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
2855 runs0 likes4 downloads4 reach16 impact
1545 instances - 10937 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
2862 runs0 likes5 downloads5 reach16 impact
1545 instances - 10937 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
2849 runs0 likes5 downloads5 reach16 impact
1545 instances - 10937 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
2865 runs1 likes18 downloads19 reach16 impact
1545 instances - 10937 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
2853 runs1 likes7 downloads8 reach16 impact
1545 instances - 10937 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
2834 runs0 likes4 downloads4 reach16 impact
1545 instances - 10937 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
72 runs1 likes7 downloads8 reach9 impact
1545 instances - 10937 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
2862 runs0 likes8 downloads8 reach17 impact
1545 instances - 10937 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100044, and it has 1541 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
1541 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11473, and it has 1529 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
1529 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 127, and it has 1524 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
1524 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11149, and it has 1521 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
1521 instances - 1026 features - 0 classes - 0 missing values
* Dataset Title: Volcanoes on Venus - JARtool experiment Data Set Experiment: A3 * Source: Michael C. Burl MS 126-347, JPL 4800 Oak Grove Drive Pasadena, CA 91109 (818) 393-5345 Michael.C.Burl '@'…
133 runs0 likes7 downloads7 reach7 impact
1521 instances - 4 features - 5 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 65, and it has 1515 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
1515 instances - 1026 features - 0 classes - 0 missing values
* Dataset Title: Volcanoes on Venus - JARtool experiment Data Set Experiment: A4 * Source: Michael C. Burl MS 126-347, JPL 4800 Oak Grove Drive Pasadena, CA 91109 (818) 393-5345 Michael.C.Burl '@'…
136 runs0 likes5 downloads5 reach7 impact
1515 instances - 4 features - 5 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12227, and it has 1510 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
1510 instances - 1026 features - 0 classes - 0 missing values
No data.
222 runs0 likes10 downloads10 reach8 impact
1504 instances - 2887 features - 13 classes - 0 missing values
Dataset creator and donator: Zhi Liu, e-mail: liuzhi8673 '@' gmail.com, institution: National Engineering Research Center for E-Learning, Hubei Wuhan, China Data Set Information: dataset are derived…
65168 runs2 likes42 downloads44 reach209 impact
1500 instances - 10001 features - 50 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11575, and it has 1490 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
1490 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11407, and it has 1488 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
1488 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 249, and it has 1487 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
1487 instances - 1026 features - 0 classes - 0 missing values
No data.
2193 runs1 likes16 downloads17 reach2 impact
1484 instances - 9 features - 10 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10811, and it has 1480 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
1480 instances - 1026 features - 0 classes - 0 missing values
1. Title: Contraceptive Method Choice 2. Sources: (a) Origin: This dataset is a subset of the 1987 National Indonesia Contraceptive Prevalence Survey (b) Creator: Tjen-Sien Lim (limt@stat.wisc.edu)…
23427 runs0 likes19 downloads19 reach2 impact
1473 instances - 10 features - 3 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
736 runs0 likes7 downloads7 reach8 impact
1473 instances - 10 features - 2 classes - 0 missing values
The langLog dataset includes 1004 textual predictors and was originally compiled in the doctorial thesis of Read (2010). It consists of 956 text samples that can be assigned to one or more topics such…
0 runs0 likes4 downloads4 reach3 impact
1460 instances - 1079 features - 2 classes - 0 missing values
The langLog dataset includes 1004 textual predictors and was originally compiled in the doctorial thesis of Read (2010). It consists of 956 text samples that can be assigned to one or more topics such…
0 runs0 likes0 downloads0 reach2 impact
1460 instances - 1079 features - classes - 0 missing values
Ask a home buyer to describe their dream house, and they probably won't begin with the height of the basement ceiling or the proximity to an east-west railroad. But this playground competition's…
0 runs0 likes1 downloads1 reach1 impact
1460 instances - 81 features - 0 classes - 6965 missing values
One of the NASA Metrics Data Program defect data sets. Data from flight software for earth orbiting satellite. Data comes from McCabe and Halstead features extractors of source code. These features…
113540 runs0 likes15 downloads15 reach19 impact
1458 instances - 38 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10907, and it has 1443 rows and 1026 features…
1 runs0 likes4 downloads4 reach3 impact
1443 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 226, and it has 1431 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
1431 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 20151, and it has 1427 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
1427 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 138, and it has 1426 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
1426 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 125, and it has 1425 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
1425 instances - 1026 features - 0 classes - 0 missing values
Binarized version of the USPS dataset (see version 2). Only instances with class labels 6 and 9 from the original dataset are considered and encoded as 0 (original class 6) and 1 (original class 9).
0 runs0 likes0 downloads0 reach2 impact
1424 instances - 257 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 50, and it has 1414 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
1414 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12247, and it has 1403 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
1403 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 128, and it has 1386 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
1386 instances - 1026 features - 0 classes - 0 missing values
Author: Volker Lohweg (University of Applied Sciences, Ostwestfalen-Lippe) Source: [UCI](https://archive.ics.uci.edu/ml/datasets/banknote+authentication) - 2012 Please cite:…
135489 runs3 likes25 downloads28 reach22 impact
1372 instances - 5 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 118, and it has 1362 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
1362 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 238, and it has 1358 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
1358 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12268, and it has 1352 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
1352 instances - 1026 features - 0 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
41 runs0 likes2 downloads2 reach8 impact
1340 instances - 18 features - 3 classes - 20 missing values
Database of baseball players and play statistics, including 'Games_played', 'At_bats', 'Runs', 'Hits', 'Doubles', 'Triples', 'Home_runs', 'RBIs', 'Walks', 'Strikeouts', 'Batting_average',…
795 runs0 likes10 downloads10 reach3 impact
1340 instances - 18 features - 3 classes - 20 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
131 runs0 likes6 downloads6 reach8 impact
1340 instances - 18 features - 2 classes - 20 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10378, and it has 1330 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
1330 instances - 1026 features - 0 classes - 0 missing values
The origin is not clear, but presumably this is an artificial problem representing M-of-N rules. The target is 1 if a certain M 'bits' are '1'? (Joaquin Vanschoren)
31 runs0 likes0 downloads0 reach12 impact
1324 instances - 11 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12090, and it has 1312 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
1312 instances - 1026 features - 0 classes - 0 missing values
The original Titanic dataset, describing the survival status of individual passengers on the Titanic. The titanic data does not contain information from the crew, but it does contain actual ages of…
0 runs0 likes10 downloads10 reach4 impact
1309 instances - 14 features - 2 classes - 3855 missing values
The goal is to predict the Fare. Variable description: pclass: A proxy for socio-economic status (SES) 1st = Upper 2nd = Middle 3rd = Lower age: Age is fractional if less than 1. If the age is…
0 runs0 likes1 downloads1 reach2 impact
1307 instances - 8 features - 0 classes - 0 missing values