OpenML
Filter results by:
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 30003, and it has 778 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
778 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 214, and it has 770 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
770 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10443, and it has 769 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
769 instances - 1026 features - 0 classes - 0 missing values
Source: The dataset was created by Angeliki Xifara (angxifara @ gmail.com, Civil/Structural Engineer) and was processed by Athanasios Tsanas (tsanasthanasis @ gmail.com, Oxford Centre for Industrial…
103 runs1 likes4 downloads5 reach5 impact
768 instances - 10 features - 37 classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Energy Building dataset (Tsanas and Xifara 2012) concerns the prediction of the heating load…
0 runs0 likes0 downloads0 reach1 impact
768 instances - 10 features - classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Energy Building dataset (Tsanas and Xifara 2012) concerns the prediction of the heating load…
0 runs0 likes0 downloads0 reach1 impact
768 instances - 10 features - classes - 0 missing values
1. Title: Pima Indians Diabetes Database 2. Sources: (a) Original owners: National Institute of Diabetes and Digestive and Kidney Diseases (b) Donor of database: Vincent Sigillito…
200304 runs4 likes75 downloads79 reach3 impact
768 instances - 9 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10970, and it has 763 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
763 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11926, and it has 756 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
756 instances - 1026 features - 0 classes - 0 missing values
Source: C. Okan Sakar a, Gorkem Serbes b, Aysegul Gunduz c, Hunkar C. Tunc a, Hatice Nizam d, Betul Erdogdu Sakar e, Melih Tutuncu c, Tarkan Aydin a, M. Erdem Isenkul d, Hulya Apaydin c a Department…
0 runs0 likes0 downloads0 reach0 impact
756 instances - 754 features - 0 classes - 0 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au6-250-drift-au6-cd1-500 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and…
11011 runs0 likes9 downloads9 reach39 impact
750 instances - 41 features - 8 classes - 0 missing values
Data taken from the Blood Transfusion Service Center in Hsin-Chu City in Taiwan -- this is a classification problem. To demonstrate the RFMTC marketing model (a modified version of RFM), this study…
464417 runs3 likes60 downloads63 reach28 impact
748 instances - 5 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11261, and it has 748 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
748 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100852, and it has 747 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
747 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100126, and it has 747 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
747 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100834, and it has 747 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
747 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101034, and it has 746 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
746 instances - 1026 features - 0 classes - 0 missing values
pie chart 2
101 runs0 likes5 downloads5 reach5 impact
745 instances - 37 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11624, and it has 742 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
742 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11265, and it has 740 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
740 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12742, and it has 737 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
737 instances - 1026 features - 0 classes - 0 missing values
The objective was to determine which seedlots in a species are best for soil conservation in seasonally dry hill country. Determination is found by measurement of height, diameter by height, survival,…
26402 runs0 likes10 downloads10 reach1 impact
736 instances - 20 features - 5 classes - 448 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
701 runs0 likes3 downloads3 reach7 impact
736 instances - 20 features - 2 classes - 448 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 30049, and it has 733 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
733 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101557, and it has 732 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
732 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 17120, and it has 731 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
731 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 20113, and it has 728 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
728 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 215, and it has 726 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
726 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 30001, and it has 717 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
717 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11691, and it has 715 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
715 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10009, and it has 714 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
714 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11085, and it has 712 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
712 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 30037, and it has 711 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
711 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101220, and it has 709 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
709 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10183, and it has 706 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
706 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11869, and it has 705 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
705 instances - 1026 features - 0 classes - 0 missing values
pie chart 1
102 runs0 likes5 downloads5 reach5 impact
705 instances - 38 features - 2 classes - 0 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au7-700 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and heterogeneity of…
4537 runs0 likes7 downloads7 reach19 impact
700 instances - 13 features - 3 classes - 0 missing values
Current dataset was adapted to ARFF format from the UCI version. Sample code ID's were removed. ! Note that there is also a related Breast Cancer Wisconsin (Diagnosis) Data Set with a different set of…
25224 runs1 likes18 downloads19 reach1 impact
699 instances - 10 features - 2 classes - 16 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 20007, and it has 694 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
694 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 13004, and it has 692 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
692 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11006, and it has 692 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
692 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12514, and it has 692 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
692 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 112, and it has 692 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
692 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10056, and it has 690 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
690 instances - 1026 features - 0 classes - 0 missing values
This dataset was retrieved 2014-11-14 from the UCI site and converted to the ARFF format. __Major changes w.r.t. version 3: dataset from UCI that matches description and data types__ ### Feature…
4196 runs0 likes4 downloads4 reach5 impact
690 instances - 15 features - 2 classes - 0 missing values
No data.
414 runs0 likes8 downloads8 reach51 impact
690 instances - 8262 features - 10 classes - 0 missing values
This file concerns credit card applications. All attribute names and values have been changed to meaningless symbols to protect the confidentiality of the data. This dataset is interesting because…
24246 runs1 likes28 downloads29 reach2 impact
690 instances - 16 features - 2 classes - 67 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11154, and it has 688 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
688 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11629, and it has 688 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
688 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 146, and it has 683 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
683 instances - 1026 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
722 runs0 likes6 downloads6 reach7 impact
683 instances - 36 features - 2 classes - 2337 missing values
This is the large soybean database from the UCI repository, with its training and test database combined into a single file. There are 19 classes, only the first 15 of which have been used in prior…
40718 runs0 likes50 downloads50 reach2 impact
683 instances - 36 features - 19 classes - 2337 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10477, and it has 682 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
682 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 30035, and it has 678 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
678 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100789, and it has 676 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
676 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101269, and it has 675 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
675 instances - 1026 features - 0 classes - 0 missing values
PRO FOOTBALL SCORES (raw data appears after the description below) How well do the oddsmakers of Las Vegas predict the outcome of professional football games? Is there really a home field advantage -…
15927 runs0 likes19 downloads19 reach17 impact
672 instances - 10 features - 2 classes - 1200 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 17074, and it has 671 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
671 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101281, and it has 670 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
670 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10329, and it has 669 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
669 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12968, and it has 668 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
668 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11288, and it has 665 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
665 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12226, and it has 665 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
665 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 103106, and it has 664 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
664 instances - 1026 features - 0 classes - 0 missing values
Data Used in "A BAYESIAN APPROACH TO DATA DISCLOSURE: OPTIMAL INTRUDER BEHAVIOR FOR CONTINUOUS DATA" by Stephen E. Fienberg, Udi E. Makov, and Ashish P. Sanil Background: ========== In this paper we…
0 runs0 likes0 downloads0 reach5 impact
662 instances - 4 features - 0 classes - 0 missing values
Data Used in "A BAYESIAN APPROACH TO DATA DISCLOSURE: OPTIMAL INTRUDER BEHAVIOR FOR CONTINUOUS DATA" by Stephen E. Fienberg, Udi E. Makov, and Ashish P. Sanil Background: ========== In this paper we…
0 runs0 likes0 downloads0 reach5 impact
662 instances - 4 features - 0 classes - 0 missing values
Data Used in "A BAYESIAN APPROACH TO DATA DISCLOSURE: OPTIMAL INTRUDER BEHAVIOR FOR CONTINUOUS DATA" by Stephen E. Fienberg, Udi E. Makov, and Ashish P. Sanil Background: ========== In this paper we…
0 runs0 likes1 downloads1 reach5 impact
662 instances - 4 features - 0 classes - 0 missing values
Data Used in "A BAYESIAN APPROACH TO DATA DISCLOSURE: OPTIMAL INTRUDER BEHAVIOR FOR CONTINUOUS DATA" by Stephen E. Fienberg, Udi E. Makov, and Ashish P. Sanil Background: ========== In this paper we…
0 runs0 likes0 downloads0 reach5 impact
662 instances - 4 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10396, and it has 662 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
662 instances - 1026 features - 0 classes - 0 missing values
Multi-label dataset. The genbase dataset contains protein sequences that can be assigned to several classes of protein families.
0 runs0 likes1 downloads1 reach3 impact
662 instances - 1213 features - 2 classes - 0 missing values
Multi-label dataset. The genbase dataset contains protein sequences that can be assigned to several classes of protein families.
0 runs0 likes0 downloads0 reach1 impact
662 instances - 1212 features - classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
813 runs0 likes7 downloads7 reach7 impact
662 instances - 4 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
792 runs0 likes7 downloads7 reach7 impact
662 instances - 4 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
802 runs0 likes8 downloads8 reach7 impact
662 instances - 4 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
739 runs0 likes6 downloads6 reach7 impact
662 instances - 4 features - 2 classes - 0 missing values
Pizza cutter
197 runs0 likes8 downloads8 reach6 impact
661 instances - 38 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 19623, and it has 656 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
656 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 30045, and it has 655 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
655 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101349, and it has 652 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
652 instances - 1026 features - 0 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
2 runs0 likes0 downloads0 reach5 impact
649 instances - 3 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12735, and it has 646 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
646 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 30002, and it has 646 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
646 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10653, and it has 645 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
645 instances - 1026 features - 0 classes - 0 missing values
Multi-label dataset. The birds dataset consists of 327 audio recordings of 12 different vocalizing bird species. Each sound can be assigned to various bird species.
0 runs0 likes4 downloads4 reach3 impact
645 instances - 279 features - 2 classes - 0 missing values
Multi-label dataset. The birds dataset consists of 327 audio recordings of 12 different vocalizing bird species. Each sound can be assigned to various bird species.
0 runs0 likes0 downloads0 reach1 impact
645 instances - 279 features - classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10262, and it has 639 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
639 instances - 1026 features - 0 classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : This is a pre-processed version of the dataset used in Kaggles Online Product Sales competition…
0 runs0 likes0 downloads0 reach1 impact
639 instances - 413 features - classes - 10012 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12265, and it has 636 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
636 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11105, and it has 631 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
631 instances - 1026 features - 0 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
2841 runs0 likes3 downloads3 reach16 impact
630 instances - 10937 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100854, and it has 625 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
625 instances - 1026 features - 0 classes - 0 missing values
The data consist of annual observations on the level of strike volume (days lost due to industrial disputes per 1000 wage salary earners), and their covariates in 18 OECD countries from 1951-1985. The…
0 runs0 likes2 downloads2 reach5 impact
625 instances - 7 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100410, and it has 625 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
625 instances - 1026 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
777 runs0 likes8 downloads8 reach7 impact
625 instances - 5 features - 2 classes - 0 missing values
This data set was generated to model psychological experimental results. Each example is classified as having the balance scale tip to the right, tip to the left, or be balanced. The attributes are…
26795 runs2 likes15 downloads17 reach3 impact
625 instances - 5 features - 3 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
780 runs0 likes10 downloads10 reach7 impact
625 instances - 7 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12786, and it has 624 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
624 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10547, and it has 622 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
622 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10850, and it has 622 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
622 instances - 1026 features - 0 classes - 0 missing values