Data
Filter results by:
This directory contains Thyroid datasets. "ann-train.data" contains 3772 learning examples and "ann-test.data" contains 3428 testing examples. I have obtained this data from…
31 runs0 likes2 downloads2 reach4 impact
3772 instances - 22 features - 3 classes - 0 missing values
Attribute information: ``` sick, negative. | classes age: continuous. sex: M, F. on thyroxine: f, t. query on thyroxine: f, t. on antithyroid medication: f, t. sick: f, t. pregnant: f, t. thyroid…
17794 runs0 likes28 downloads28 reach0 impact
3772 instances - 30 features - 2 classes - 6064 missing values
Predict a biological response of molecules from their chemical properties. Each row in this data set represents a molecule. The first column contains experimental data describing an actual biological…
40845 runs1 likes32 downloads33 reach19 impact
3751 instances - 1777 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 137, and it has 3689 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach2 impact
3689 instances - 1026 features - 0 classes - 0 missing values
"The speech dataset was also provided by (see citation request) and contains real world data from recorded English language. The normal class contains data from persons having an American accent…
1599 runs0 likes4 downloads4 reach8 impact
3686 instances - 401 features - 2 classes - 0 missing values
### Description ### This dataset is part of a collection datasets based on the game "Jungle Chess" (a.k.a. Dou Shou Qi). For a description of the rules, please refer to the paper (link attached). The…
10 runs0 likes0 downloads0 reach0 impact
3660 instances - 47 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10434, and it has 3650 rows and 1026 features…
1 runs0 likes1 downloads1 reach2 impact
3650 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 114, and it has 3490 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach2 impact
3490 instances - 1026 features - 0 classes - 0 missing values
Dataset from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch), which consisted of 5 different datasets (SYLVA, GINA, NOVA, HIVA, ADA). The purpose of the challenge…
65060 runs0 likes19 downloads19 reach17 impact
3468 instances - 971 features - 2 classes - 0 missing values
Datasets from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch) Dataset from: http://www.agnostic.inf.ethz.ch/datasets.php Modified by TunedIT (converted to ARFF…
396 runs0 likes15 downloads15 reach6 impact
3468 instances - 785 features - 10 classes - 0 missing values
Datasets from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch) Dataset from: http://www.agnostic.inf.ethz.ch/datasets.php Modified by TunedIT (converted to ARFF…
548 runs0 likes9 downloads9 reach7 impact
3468 instances - 785 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 13000, and it has 3459 rows and 1026 features…
1 runs0 likes1 downloads1 reach2 impact
3459 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 280, and it has 3438 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach2 impact
3438 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11140, and it has 3429 rows and 1026 features…
1 runs0 likes1 downloads1 reach2 impact
3429 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 51, and it has 3356 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach2 impact
3356 instances - 1026 features - 0 classes - 0 missing values
### Description __Changes to version 1:__ all categorical features transformed as such. This dataset represents a set of possible advertisements on Internet pages. ### Sources (a) Creator and donor:…
4 runs0 likes2 downloads2 reach6 impact
3279 instances - 1559 features - 2 classes - 0 missing values
* Dataset Title: Volcanoes on Venus - JARtool experiment Data Set Experiment: A1 * Source: Michael C. Burl MS 126-347, JPL 4800 Oak Grove Drive Pasadena, CA 91109 (818) 393-5345 Michael.C.Burl '@'…
262 runs0 likes4 downloads4 reach4 impact
3252 instances - 4 features - 5 classes - 0 missing values
No data.
264 runs0 likes11 downloads11 reach35 impact
3204 instances - 13196 features - 6 classes - 0 missing values
led24-pmlb
31 runs0 likes1 downloads1 reach8 impact
3200 instances - 25 features - 10 classes - 0 missing values
led7-pmlb
31 runs0 likes0 downloads0 reach8 impact
3200 instances - 8 features - 10 classes - 0 missing values
1. Title: Chess End-Game -- King+Rook versus King+Pawn on a7 (usually abbreviated KRKPA7). The pawn on a7 means it is one square away from queening. It is the King+Rook's side (white) to move. 2.…
255589 runs0 likes33 downloads33 reach2 impact
3196 instances - 37 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
135 runs0 likes9 downloads9 reach6 impact
3190 instances - 62 features - 2 classes - 0 missing values
Primate splice-junction gene sequences (DNA) with associated imperfect domain theory. Splice junctions are points on a DNA sequence at which 'superfluous' DNA is removed during the process of protein…
18786 runs1 likes14 downloads15 reach0 impact
3190 instances - 62 features - 3 classes - 0 missing values
Originally from the StatLog project. The raw data is still available on [UCI](https://archive.ics.uci.edu/ml/datasets/Molecular+Biology+(Splice-junction+Gene+Sequences)). The data consists of 3,186…
4282 runs0 likes2 downloads2 reach10 impact
3186 instances - 181 features - 3 classes - 0 missing values
libSVM","AAD group #Dataset from the LIBSVM data repository. Preprocessing: scaled to [-1,1]
0 runs0 likes0 downloads0 reach4 impact
3175 instances - 61 features - 0 classes - 0 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
0 runs0 likes0 downloads0 reach1 impact
3153 instances - 971 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 133, and it has 3151 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach2 impact
3151 instances - 1026 features - 0 classes - 0 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
0 runs0 likes0 downloads0 reach1 impact
3140 instances - 260 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10280, and it has 3134 rows and 1026 features…
1 runs0 likes1 downloads1 reach2 impact
3134 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 130, and it has 3133 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach2 impact
3133 instances - 1026 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
745 runs0 likes10 downloads10 reach6 impact
3107 instances - 7 features - 2 classes - 0 missing values
Geographical Analysis Spatial Data This georeferenced data set was used in: Pace, R. Kelley, and Ronald Barry, Quick Computation of Regressions with a Spatially Autoregressive Dependent Variable,…
0 runs1 likes1 downloads2 reach4 impact
3107 instances - 7 features - 0 classes - 0 missing values
No data.
268 runs0 likes9 downloads9 reach35 impact
3075 instances - 12433 features - 6 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10197, and it has 3058 rows and 1026 features…
1 runs0 likes1 downloads1 reach2 impact
3058 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 19905, and it has 3048 rows and 1026 features…
1 runs0 likes1 downloads1 reach2 impact
3048 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12252, and it has 2998 rows and 1026 features…
1 runs0 likes1 downloads1 reach2 impact
2998 instances - 1026 features - 0 classes - 0 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
0 runs0 likes1 downloads1 reach2 impact
2984 instances - 145 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10142, and it has 2872 rows and 1026 features…
1 runs0 likes1 downloads1 reach2 impact
2872 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 108, and it has 2869 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach2 impact
2869 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12512, and it has 2866 rows and 1026 features…
1 runs0 likes1 downloads1 reach2 impact
2866 instances - 1026 features - 0 classes - 0 missing values
General Description of Thyroid Disease Databases and Related Files This directory contains 6 databases, corresponding test set, and corresponding documentation. They were left at the University of…
31 runs1 likes8 downloads9 reach3 impact
2800 instances - 27 features - 5 classes - 0 missing values
General Description of Thyroid Disease Databases and Related Files This directory contains 6 databases, corresponding test set, and corresponding documentation. They were left at the University of…
92 runs0 likes5 downloads5 reach4 impact
2800 instances - 27 features - 5 classes - 0 missing values
General Description of Thyroid Disease Databases and Related Files This directory contains 6 databases, corresponding test set, and corresponding documentation. They were left at the University of…
32 runs0 likes7 downloads7 reach3 impact
2800 instances - 27 features - 5 classes - 0 missing values
General Description of Thyroid Disease Databases and Related Files This directory contains 6 databases, corresponding test set, and corresponding documentation. They were left at the University of…
31 runs1 likes8 downloads9 reach3 impact
2800 instances - 27 features - 5 classes - 0 missing values
UCI Thyroid allbp dataset.
97 runs0 likes7 downloads7 reach4 impact
2800 instances - 27 features - 5 classes - 0 missing values
Data on tree growth used in the Case Study published in the September, 1995 issue of the Canadian Journal of Statistics. This data set was been provided by Dr. Fernando Camacho, Ontario Hydro…
14764 runs1 likes14 downloads15 reach30 impact
2796 instances - 35 features - 6 classes - 68100 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 107, and it has 2778 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach2 impact
2778 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12967, and it has 2756 rows and 1026 features…
1 runs0 likes1 downloads1 reach2 impact
2756 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 20014, and it has 2625 rows and 1026 features…
1 runs0 likes1 downloads1 reach2 impact
2625 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12592, and it has 2615 rows and 1026 features…
1 runs0 likes1 downloads1 reach2 impact
2615 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10140, and it has 2615 rows and 1026 features…
1 runs0 likes1 downloads1 reach2 impact
2615 instances - 1026 features - 0 classes - 0 missing values
#### Abstract: MADELON is an artificial dataset, which was part of the NIPS 2003 feature selection challenge. This is a two-class classification problem with continuous input variables. The difficulty…
93305 runs0 likes16 downloads16 reach16 impact
2600 instances - 501 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11225, and it has 2585 rows and 1026 features…
1 runs0 likes1 downloads1 reach2 impact
2585 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10529, and it has 2578 rows and 1026 features…
1 runs0 likes1 downloads1 reach2 impact
2578 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11678, and it has 2577 rows and 1026 features…
1 runs0 likes1 downloads1 reach2 impact
2577 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 163, and it has 2570 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach2 impact
2570 instances - 1026 features - 0 classes - 0 missing values
1. Title: Ozone Level Detection 2. Source: Kun Zhang zhang.kun05 '@' gmail.com Department of Computer Science, Xavier University of Lousiana Wei Fan wei.fan '@' gmail.com IBM T.J.Watson Research…
0 runs0 likes1 downloads1 reach4 impact
2536 instances - 73 features - 0 classes - 0 missing values
Forecasting skewed biased stochastic ozone days: analyses, solutions and beyond, Knowledge and Information Systems, Vol. 14, No. 3, 2008. 1 . Abstract: Two ground ozone level data sets are included in…
178593 runs0 likes12 downloads12 reach16 impact
2534 instances - 73 features - 2 classes - 0 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au4-2500 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and heterogeneity of…
4222 runs0 likes7 downloads7 reach18 impact
2500 instances - 101 features - 3 classes - 0 missing values
No data.
426 runs0 likes15 downloads15 reach75 impact
2463 instances - 2001 features - 17 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 250, and it has 2446 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach2 impact
2446 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11451, and it has 2442 rows and 1026 features…
1 runs0 likes1 downloads1 reach2 impact
2442 instances - 1026 features - 0 classes - 0 missing values
Yeast dataset Past Usage: André Elisseeff and Jason Weston. A kernel method for multi-labelled classification. In Thomas G. Dietterich, Susan Becker, and Zoubin Ghahramani, editors, Advances in…
139 runs0 likes8 downloads8 reach5 impact
2417 instances - 117 features - 2 classes - 0 missing values
Multi-label dataset. The yeast dataset (Elisseeff and Weston, 2002) consists of micro-array expression data, as well as phylogenetic profiles of yeast, and includes 2417 genes and 103 predictors. In…
0 runs0 likes2 downloads2 reach2 impact
2417 instances - 117 features - 2 classes - 0 missing values
Multi-label dataset. The yeast dataset (Elisseeff and Weston, 2002) consists of micro-array expression data, as well as phylogenetic profiles of yeast, and includes 2417 genes and 103 predictors. In…
0 runs0 likes0 downloads0 reach0 impact
2417 instances - 117 features - classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10627, and it has 2408 rows and 1026 features…
1 runs0 likes2 downloads2 reach2 impact
2408 instances - 1026 features - 0 classes - 0 missing values
### Description Scene recognition dataset - It contains characteristics about images and their classes. The original dataset is a multi-label classification problem with 6 different labels: {Beach,…
86252 runs0 likes21 downloads21 reach16 impact
2407 instances - 300 features - 2 classes - 0 missing values
Multi-label dataset. The scene dataset is an image classification task where labels like Beach, Mountain, Field, Urban are assigned to each image.
0 runs0 likes12 downloads12 reach2 impact
2407 instances - 300 features - 2 classes - 0 missing values
Multi-label dataset. The scene dataset is an image classification task where labels like Beach, Mountain, Field, Urban are assigned to each image.
0 runs0 likes0 downloads0 reach0 impact
2407 instances - 300 features - classes - 0 missing values
Multi-label dataset. The scene dataset is an image classification task where labels like Beach, Mountain, Field, Urban are assigned to each image.
0 runs0 likes0 downloads0 reach0 impact
2407 instances - 300 features - classes - 0 missing values
"The debutanizer column is part of a desulfuring and naphtha splitter plant." u1 Top temperature u2 Top pressure u3 Reflux flow u4 Flow to next process u5 6th tray temperature u6 Bottom…
0 runs0 likes1 downloads1 reach2 impact
2394 instances - 8 features - 0 classes - 0 missing values
### Description ### This dataset is part of a collection datasets based on the game "Jungle Chess" (a.k.a. Dou Shou Qi). For a description of the rules, please refer to the paper (link attached). The…
10 runs0 likes0 downloads0 reach0 impact
2352 instances - 47 features - 2 classes - 0 missing values
### Description ### This dataset is part of a collection datasets based on the game "Jungle Chess" (a.k.a. Dou Shou Qi). For a description of the rules, please refer to the paper (link attached). The…
12 runs0 likes0 downloads0 reach0 impact
2351 instances - 47 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11024, and it has 2329 rows and 1026 features…
1 runs0 likes1 downloads1 reach2 impact
2329 instances - 1026 features - 0 classes - 0 missing values
The instances were drawn randomly from a database of 7 outdoor images. The images were hand-segmented to create a classification for every pixel. Each instance is a 3x3 region. ### Attribute…
19234 runs0 likes22 downloads22 reach0 impact
2310 instances - 20 features - 7 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
772 runs0 likes14 downloads14 reach6 impact
2310 instances - 20 features - 2 classes - 0 missing values
The instances were drawn randomly from a database of 7 outdoor images. The images were hand-segmented to create a classification for every pixel. Each instance is a 3x3 region. __Major changes w.r.t.…
4250 runs0 likes2 downloads2 reach7 impact
2310 instances - 20 features - 7 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 278, and it has 2256 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach2 impact
2256 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12666, and it has 2243 rows and 1026 features…
1 runs0 likes1 downloads1 reach2 impact
2243 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 188, and it has 2230 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach2 impact
2230 instances - 1026 features - 0 classes - 0 missing values
These weekly averages are ultimately based on measurements of 4 air samples per hour taken atop intake lines on several towers during steady periods of CO2 concentration of not less than 6 hours per…
0 runs1 likes1 downloads2 reach0 impact
2225 instances - 7 features - 0 classes - 0 missing values
PMLB version of the Titanic dataset, which only uses 3 features. See version 1 for the complete version: https://www.openml.org/d/40945
31 runs0 likes0 downloads0 reach9 impact
2201 instances - 4 features - 2 classes - 0 missing values
File README ----------- smoothmeth A collection of the data sets used in the book "Smoothing Methods in Statistics," by Jeffrey S. Simonoff, Springer-Verlag, New York, 1996. Submitted by Jeff Simonoff…
0 runs0 likes0 downloads0 reach4 impact
2178 instances - 4 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
765 runs0 likes11 downloads11 reach6 impact
2178 instances - 4 features - 2 classes - 0 missing values
Dataset from Smoothing Methods in Statistics (ftp stat.cmu.edu/datasets) Simonoff, J.S. (1996). Smoothing Methods in Statistics. New York: Springer-Verlag.
2 runs0 likes1 downloads1 reach0 impact
2178 instances - 4 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 234, and it has 2145 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach2 impact
2145 instances - 1026 features - 0 classes - 0 missing values
A 3-class version of Cardiotocography dataset.
134 runs0 likes13 downloads13 reach5 impact
2126 instances - 36 features - 3 classes - 0 missing values
2126 fetal cardiotocograms (CTGs) were automatically processed and the respective diagnostic features measured. The CTGs were also classified by three expert obstetricians and a consensus…
23009 runs3 likes25 downloads28 reach47 impact
2126 instances - 36 features - 10 classes - 0 missing values
One of the NASA Metrics Data Program defect data sets. Data from software for storage management for receiving and processing ground data. Data comes from McCabe and Halstead features extractors of…
154298 runs2 likes21 downloads23 reach19 impact
2109 instances - 22 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 61, and it has 2076 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach2 impact
2076 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 90, and it has 2055 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach2 impact
2055 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10781, and it has 2044 rows and 1026 features…
1 runs0 likes2 downloads2 reach2 impact
2044 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 174, and it has 2027 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach2 impact
2027 instances - 1026 features - 0 classes - 0 missing values
The data consist of 2001 observations taken from a balloon about 30 kilometres above the surface of the earth. In the section of the flight shown here the balloon increases in height. As radiation…
0 runs1 likes2 downloads3 reach4 impact
2001 instances - 3 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
154 runs0 likes9 downloads9 reach6 impact
2001 instances - 3 features - 2 classes - 0 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. Corresponding patterns in different datasets correspond to the same…
26672 runs0 likes10 downloads10 reach0 impact
2000 instances - 77 features - 10 classes - 0 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. Corresponding patterns in different datasets correspond to the same…
26665 runs0 likes19 downloads19 reach0 impact
2000 instances - 65 features - 10 classes - 0 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. The maps were scanned in 8 bit grey value at density of 400dpi,…
22666 runs0 likes17 downloads17 reach0 impact
2000 instances - 241 features - 10 classes - 0 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. Corresponding patterns in different datasets correspond to the same…
26776 runs0 likes21 downloads21 reach0 impact
2000 instances - 48 features - 10 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
728 runs0 likes7 downloads7 reach6 impact
2000 instances - 241 features - 2 classes - 0 missing values