OpenML
Filter results by:
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
726 runs0 likes9 downloads9 reach7 impact
576 instances - 12 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12659, and it has 577 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
577 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11018, and it has 579 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
579 instances - 1026 features - 0 classes - 0 missing values
This data set contains 416 liver patient records and 167 non liver patient records.The data set was collected from north east of Andhra Pradesh, India. The class label divides the patients into 2…
154027 runs0 likes21 downloads21 reach18 impact
583 instances - 11 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101408, and it has 583 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
583 instances - 1026 features - 0 classes - 0 missing values
The ILPD dataset from the OpenCC18 with all categorical variables label encoded
0 runs0 likes0 downloads0 reach0 impact
583 instances - 11 features - 0 classes - 0 missing values
The ILPD liver dataset from the OpenCC18 with the gender binary encoded so all features are numeric
0 runs0 likes0 downloads0 reach1 impact
583 instances - 11 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 19904, and it has 584 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
584 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11019, and it has 592 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
592 instances - 1026 features - 0 classes - 0 missing values
Multi-label dataset. Audio dataset (emotions) consists of 593 musical files with 6 clustered emotional labels and 72 predictors. Each song can be labeled with one or more of the labels…
0 runs2 likes5 downloads7 reach3 impact
593 instances - 78 features - 2 classes - 0 missing values
Multi-label dataset. Audio dataset (emotions) consists of 593 musical files with 6 clustered emotional labels and 72 predictors. Each song can be labeled with one or more of the labels…
0 runs0 likes0 downloads0 reach1 impact
593 instances - 78 features - classes - 0 missing values
Multi-label dataset. Audio dataset (emotions) consists of 593 musical files with 6 clustered emotional labels and 72 predictors. Each song can be labeled with one or more of the labels…
0 runs0 likes0 downloads0 reach1 impact
593 instances - 78 features - classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101058, and it has 594 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
594 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12738, and it has 596 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
596 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10982, and it has 600 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
600 instances - 1026 features - 0 classes - 0 missing values
DEXTER is a text classification problem in a bag-of-word representation. This is a two-class classification problem with sparse continuous input variables. This dataset is one of five datasets of the…
0 runs0 likes5 downloads5 reach11 impact
600 instances - 20001 features - 2 classes - 0 missing values
Binarized version of the isolet dataset (see version 1). Only instances with class labels 1 and 2 from the original dataset are considered.
0 runs0 likes0 downloads0 reach2 impact
600 instances - 618 features - 2 classes - 0 missing values
### Description Synthetic Control Chart Time Series. This is actually time series classification. ### Sources ``` * Original Owner and Donor Dr Robert Alcock rob@skyblue.csd.auth.gr ``` ### Dataset…
20354 runs0 likes10 downloads10 reach40 impact
600 instances - 62 features - 6 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
169 runs0 likes8 downloads8 reach8 impact
600 instances - 62 features - 2 classes - 0 missing values
Once upon a time, in July 1991, the monks of Corsendonk Priory were faced with a school held in their priory, namely the 2nd European Summer School on Machine Learning. After listening more than one…
394289 runs0 likes20 downloads20 reach26 impact
601 instances - 7 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
2860 runs0 likes5 downloads5 reach16 impact
604 instances - 10937 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11524, and it has 605 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
605 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12627, and it has 612 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
612 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 218, and it has 613 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
613 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11430, and it has 614 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
614 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11269, and it has 614 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
614 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10026, and it has 621 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
621 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10547, and it has 622 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
622 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10850, and it has 622 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
622 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12786, and it has 624 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
624 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100854, and it has 625 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
625 instances - 1026 features - 0 classes - 0 missing values
The data consist of annual observations on the level of strike volume (days lost due to industrial disputes per 1000 wage salary earners), and their covariates in 18 OECD countries from 1951-1985. The…
0 runs0 likes2 downloads2 reach5 impact
625 instances - 7 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100410, and it has 625 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
625 instances - 1026 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
777 runs0 likes8 downloads8 reach7 impact
625 instances - 5 features - 2 classes - 0 missing values
This data set was generated to model psychological experimental results. Each example is classified as having the balance scale tip to the right, tip to the left, or be balanced. The attributes are…
26795 runs2 likes15 downloads17 reach3 impact
625 instances - 5 features - 3 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
780 runs0 likes10 downloads10 reach7 impact
625 instances - 7 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
2841 runs0 likes3 downloads3 reach16 impact
630 instances - 10937 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11105, and it has 631 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
631 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12265, and it has 636 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
636 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10262, and it has 639 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
639 instances - 1026 features - 0 classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : This is a pre-processed version of the dataset used in Kaggles Online Product Sales competition…
0 runs0 likes0 downloads0 reach1 impact
639 instances - 413 features - classes - 10012 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10653, and it has 645 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
645 instances - 1026 features - 0 classes - 0 missing values
Multi-label dataset. The birds dataset consists of 327 audio recordings of 12 different vocalizing bird species. Each sound can be assigned to various bird species.
0 runs0 likes4 downloads4 reach3 impact
645 instances - 279 features - 2 classes - 0 missing values
Multi-label dataset. The birds dataset consists of 327 audio recordings of 12 different vocalizing bird species. Each sound can be assigned to various bird species.
0 runs0 likes0 downloads0 reach1 impact
645 instances - 279 features - classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12735, and it has 646 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
646 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 30002, and it has 646 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
646 instances - 1026 features - 0 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
2 runs0 likes0 downloads0 reach5 impact
649 instances - 3 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101349, and it has 652 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
652 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 30045, and it has 655 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
655 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 19623, and it has 656 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
656 instances - 1026 features - 0 classes - 0 missing values
Pizza cutter
197 runs0 likes8 downloads8 reach6 impact
661 instances - 38 features - 2 classes - 0 missing values
Data Used in "A BAYESIAN APPROACH TO DATA DISCLOSURE: OPTIMAL INTRUDER BEHAVIOR FOR CONTINUOUS DATA" by Stephen E. Fienberg, Udi E. Makov, and Ashish P. Sanil Background: ========== In this paper we…
0 runs0 likes0 downloads0 reach5 impact
662 instances - 4 features - 0 classes - 0 missing values
Data Used in "A BAYESIAN APPROACH TO DATA DISCLOSURE: OPTIMAL INTRUDER BEHAVIOR FOR CONTINUOUS DATA" by Stephen E. Fienberg, Udi E. Makov, and Ashish P. Sanil Background: ========== In this paper we…
0 runs0 likes0 downloads0 reach5 impact
662 instances - 4 features - 0 classes - 0 missing values
Data Used in "A BAYESIAN APPROACH TO DATA DISCLOSURE: OPTIMAL INTRUDER BEHAVIOR FOR CONTINUOUS DATA" by Stephen E. Fienberg, Udi E. Makov, and Ashish P. Sanil Background: ========== In this paper we…
0 runs0 likes1 downloads1 reach5 impact
662 instances - 4 features - 0 classes - 0 missing values
Data Used in "A BAYESIAN APPROACH TO DATA DISCLOSURE: OPTIMAL INTRUDER BEHAVIOR FOR CONTINUOUS DATA" by Stephen E. Fienberg, Udi E. Makov, and Ashish P. Sanil Background: ========== In this paper we…
0 runs0 likes0 downloads0 reach5 impact
662 instances - 4 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10396, and it has 662 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
662 instances - 1026 features - 0 classes - 0 missing values
Multi-label dataset. The genbase dataset contains protein sequences that can be assigned to several classes of protein families.
0 runs0 likes1 downloads1 reach3 impact
662 instances - 1213 features - 2 classes - 0 missing values
Multi-label dataset. The genbase dataset contains protein sequences that can be assigned to several classes of protein families.
0 runs0 likes0 downloads0 reach1 impact
662 instances - 1212 features - classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
802 runs0 likes8 downloads8 reach7 impact
662 instances - 4 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
792 runs0 likes7 downloads7 reach7 impact
662 instances - 4 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
739 runs0 likes6 downloads6 reach7 impact
662 instances - 4 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
813 runs0 likes7 downloads7 reach7 impact
662 instances - 4 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 103106, and it has 664 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
664 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11288, and it has 665 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
665 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12226, and it has 665 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
665 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12968, and it has 668 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
668 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10329, and it has 669 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
669 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101281, and it has 670 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
670 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 17074, and it has 671 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
671 instances - 1026 features - 0 classes - 0 missing values
PRO FOOTBALL SCORES (raw data appears after the description below) How well do the oddsmakers of Las Vegas predict the outcome of professional football games? Is there really a home field advantage -…
15927 runs0 likes19 downloads19 reach17 impact
672 instances - 10 features - 2 classes - 1200 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101269, and it has 675 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
675 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100789, and it has 676 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
676 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 30035, and it has 678 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
678 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10477, and it has 682 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
682 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 146, and it has 683 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
683 instances - 1026 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
722 runs0 likes6 downloads6 reach7 impact
683 instances - 36 features - 2 classes - 2337 missing values
This is the large soybean database from the UCI repository, with its training and test database combined into a single file. There are 19 classes, only the first 15 of which have been used in prior…
40716 runs0 likes50 downloads50 reach2 impact
683 instances - 36 features - 19 classes - 2337 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11154, and it has 688 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
688 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11629, and it has 688 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
688 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10056, and it has 690 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
690 instances - 1026 features - 0 classes - 0 missing values
This dataset was retrieved 2014-11-14 from the UCI site and converted to the ARFF format. __Major changes w.r.t. version 3: dataset from UCI that matches description and data types__ ### Feature…
4196 runs0 likes4 downloads4 reach5 impact
690 instances - 15 features - 2 classes - 0 missing values
No data.
414 runs0 likes8 downloads8 reach51 impact
690 instances - 8262 features - 10 classes - 0 missing values
This file concerns credit card applications. All attribute names and values have been changed to meaningless symbols to protect the confidentiality of the data. This dataset is interesting because…
24244 runs1 likes28 downloads29 reach2 impact
690 instances - 16 features - 2 classes - 67 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 13004, and it has 692 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
692 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11006, and it has 692 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
692 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12514, and it has 692 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
692 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 112, and it has 692 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
692 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 20007, and it has 694 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
694 instances - 1026 features - 0 classes - 0 missing values
Current dataset was adapted to ARFF format from the UCI version. Sample code ID's were removed. ! Note that there is also a related Breast Cancer Wisconsin (Diagnosis) Data Set with a different set of…
25223 runs1 likes18 downloads19 reach1 impact
699 instances - 10 features - 2 classes - 16 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au7-700 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and heterogeneity of…
4537 runs0 likes7 downloads7 reach19 impact
700 instances - 13 features - 3 classes - 0 missing values
pie chart 1
102 runs0 likes5 downloads5 reach5 impact
705 instances - 38 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11869, and it has 705 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
705 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10183, and it has 706 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
706 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101220, and it has 709 rows and 1026 features…
1 runs0 likes1 downloads1 reach3 impact
709 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 30037, and it has 711 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
711 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11085, and it has 712 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
712 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10009, and it has 714 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
714 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11691, and it has 715 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
715 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 30001, and it has 717 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
717 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 215, and it has 726 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach3 impact
726 instances - 1026 features - 0 classes - 0 missing values