OpenML
Filter results by:
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100098, and it has 472 rows and 1026 features…
1 runs0 likes1 downloads1 reach11 impact
472 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100856, and it has 471 rows and 1026 features…
1 runs0 likes1 downloads1 reach11 impact
471 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100069, and it has 471 rows and 1026 features…
1 runs0 likes1 downloads1 reach11 impact
471 instances - 1026 features - 0 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
65 runs0 likes4 downloads4 reach15 impact
470 instances - 10936 features - 2 classes - 0 missing values
* Title: Thoracic Surgery Data Data Set * Abstract: The data is dedicated to classification problem related to the post-operative life expectancy in the lung cancer patients: class 1 - death within…
145 runs0 likes8 downloads8 reach14 impact
470 instances - 17 features - 2 classes - 0 missing values
The data was collected retrospectively at Wroclaw Thoracic Surgery Centre for patients who underwent major lung resections for primary lung cancer in the years 2007 - 2011. The Centre is associated…
31 runs0 likes5 downloads5 reach12 impact
470 instances - 17 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12708, and it has 469 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
469 instances - 1026 features - 0 classes - 0 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes0 downloads0 reach13 impact
468 instances - 3 features - 0 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
65 runs0 likes4 downloads4 reach15 impact
468 instances - 10937 features - 2 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
2 runs0 likes1 downloads1 reach13 impact
468 instances - 4 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100431, and it has 468 rows and 1026 features…
1 runs0 likes1 downloads1 reach11 impact
468 instances - 1026 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
771 runs0 likes9 downloads9 reach15 impact
468 instances - 4 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
903 runs0 likes8 downloads8 reach15 impact
468 instances - 3 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10624, and it has 466 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
466 instances - 1026 features - 0 classes - 0 missing values
* Title: South Africa Heart Disease Dataset * Description A retrospective sample of males in a heart-disease high-risk region of the Western Cape, South Africa. There are roughly two controls per case…
155 runs0 likes14 downloads14 reach14 impact
462 instances - 10 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11871, and it has 462 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
462 instances - 1026 features - 0 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
67 runs0 likes2 downloads2 reach15 impact
458 instances - 10936 features - 2 classes - 0 missing values
One of the NASA Metrics Data Program defect data sets. The specific type of software is unknown. Data comes from McCabe and Halstead features extractors of source code. These features were defined in…
777 runs0 likes9 downloads9 reach15 impact
458 instances - 40 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10472, and it has 455 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
455 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10702, and it has 455 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
455 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 76, and it has 454 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
454 instances - 1026 features - 0 classes - 0 missing values
The aim is to determine the type of arrhythmia from the ECG recordings. This database contains 279 attributes, 206 of which are linear valued and the rest are nominal. Concerning the study of H. Altay…
4430 runs0 likes50 downloads50 reach15 impact
452 instances - 280 features - 13 classes - 408 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
736 runs1 likes5 downloads6 reach15 impact
452 instances - 280 features - 2 classes - 408 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
2 runs0 likes0 downloads0 reach13 impact
450 instances - 4 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
768 runs0 likes7 downloads7 reach15 impact
450 instances - 4 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10273, and it has 449 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
449 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101014, and it has 448 rows and 1026 features…
1 runs0 likes1 downloads1 reach11 impact
448 instances - 1026 features - 0 classes - 0 missing values
It is a public set of comments collected for spam research. It has five datasets composed by 1,956 real messages extracted from five videos that were among the 10 most viewed on the collection period.…
0 runs0 likes0 downloads0 reach0 impact
448 instances - 5 features - classes - 245 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10927, and it has 446 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
446 instances - 1026 features - 0 classes - 0 missing values
Michel Lang fRMA-normalized. Only "Kratz-genes"*. \* (see: A practical molecular assay to predict survival in resected non-squamous, non-small-cell lung cancer: development and international…
3 runs0 likes3 downloads3 reach12 impact
442 instances - 24 features - 0 classes - 0 missing values
.. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442…
0 runs0 likes0 downloads0 reach13 impact
442 instances - 11 features - 0 classes - 0 missing values
.. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442…
0 runs0 likes0 downloads0 reach13 impact
442 instances - 11 features - 0 classes - 0 missing values
.. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442…
0 runs0 likes0 downloads0 reach13 impact
442 instances - 11 features - 0 classes - 0 missing values
.. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442…
0 runs0 likes0 downloads0 reach13 impact
442 instances - 11 features - 0 classes - 0 missing values
.. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442…
0 runs0 likes0 downloads0 reach13 impact
442 instances - 11 features - 0 classes - 0 missing values
.. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442…
0 runs0 likes1 downloads1 reach13 impact
442 instances - 11 features - 0 classes - 0 missing values
* Title: Wholesale customers Data Set * Abstract: The data set refers to clients of a wholesale distributor. It includes the annual spending in monetary units (m.u.) on diverse product categories *…
161 runs0 likes11 downloads11 reach14 impact
440 instances - 9 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 17081, and it has 440 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
440 instances - 1026 features - 0 classes - 0 missing values
It is a public set of comments collected for spam research. It has five datasets composed by 1,956 real messages extracted from five videos that were among the 10 most viewed on the collection period.…
0 runs0 likes0 downloads0 reach0 impact
438 instances - 5 features - classes - 0 missing values
1. Title: 1984 United States Congressional Voting Records Database 2. Source Information: (a) Source: Congressional Quarterly Almanac, 98th Congress, 2nd session 1984, Volume XL: Congressional…
2262 runs0 likes17 downloads17 reach9 impact
435 instances - 17 features - 2 classes - 392 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11379, and it has 429 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
429 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 262, and it has 429 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
429 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10878, and it has 427 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
427 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 251, and it has 426 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
426 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10574, and it has 422 rows and 1026 features (including…
1 runs0 likes2 downloads2 reach11 impact
422 instances - 1026 features - 0 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
78 runs0 likes4 downloads4 reach15 impact
421 instances - 10936 features - 2 classes - 0 missing values
Arbres urbains
0 runs0 likes0 downloads0 reach0 impact
421 instances - 3 features - 1 classes - 0 missing values
------------------------------------------------------------------------ Primary Biliary Cirrhosis The data set found in appendix D of Fleming and Harrington, Counting Processes and Survival Analysis,…
18 runs1 likes3 downloads4 reach14 impact
418 instances - 20 features - 0 classes - 1033 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Case number deleted. X treated as the class attribute. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric…
10 runs0 likes1 downloads1 reach12 impact
418 instances - 19 features - 0 classes - 1239 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
723 runs0 likes5 downloads5 reach15 impact
418 instances - 19 features - 2 classes - 1239 missing values
No data.
219 runs0 likes5 downloads5 reach21 impact
414 instances - 6430 features - 9 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
77 runs0 likes3 downloads3 reach15 impact
413 instances - 10936 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11402, and it has 413 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
413 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11785, and it has 413 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
413 instances - 1026 features - 0 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
65 runs0 likes3 downloads3 reach15 impact
412 instances - 10936 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 168, and it has 412 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
412 instances - 1026 features - 0 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
1187 runs1 likes10 downloads11 reach9 impact
412 instances - 9 features - 7 classes - 96 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
721 runs0 likes5 downloads5 reach15 impact
412 instances - 9 features - 2 classes - 96 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11636, and it has 411 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
411 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 236, and it has 411 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
411 instances - 1026 features - 0 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
65 runs0 likes2 downloads2 reach15 impact
410 instances - 10936 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12128, and it has 407 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
407 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12471, and it has 406 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
406 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12067, and it has 406 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
406 instances - 1026 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
718 runs0 likes6 downloads6 reach15 impact
406 instances - 9 features - 2 classes - 14 missing values
The Committee on Statistical Graphics of the American Statistical Association (ASA) invites you to participate in its Second (1983) Exposition of Statistical Graphics Technology. The purposes of the…
164 runs0 likes4 downloads4 reach14 impact
406 instances - 8 features - 3 classes - 14 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
82 runs0 likes5 downloads5 reach15 impact
405 instances - 10936 features - 2 classes - 0 missing values
The YouTube personality dataset consists of a collection of behavorial features, speech transcriptions, and personality impression scores for a set of 404 YouTube vloggers that explicitly show…
0 runs0 likes1 downloads1 reach9 impact
404 instances - 31 features - classes - 0 missing values
The YouTube personality dataset consists of a collection of behavorial features, speech transcriptions, and personality impression scores for a set of 404 YouTube vloggers that explicitly show…
0 runs0 likes0 downloads0 reach9 impact
404 instances - 31 features - classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 148, and it has 404 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
404 instances - 1026 features - 0 classes - 0 missing values
* Title: User Knowledge Modeling Data Set * Abstract: It is the real dataset about the students' knowledge status about the subject of Electrical DC Machines. The dataset had been obtained from Ph.D.…
153 runs1 likes8 downloads9 reach13 impact
403 instances - 6 features - 5 classes - 0 missing values
%-*- text -*- %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE data set made publicly available in order to encourage repeatable, verifiable, refutable,…
765 runs0 likes10 downloads10 reach15 impact
403 instances - 38 features - 2 classes - 0 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes0 downloads0 reach13 impact
400 instances - 8 features - 0 classes - 0 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
22 runs0 likes2 downloads2 reach14 impact
400 instances - 8 features - 0 classes - 0 missing values
This dataset contains a set of face images taken between April 1992 and April 1994 at AT&T Laboratories Cambridge. As described on the original website: There are ten different images of each of 40…
53 runs0 likes0 downloads0 reach15 impact
400 instances - 4097 features - 40 classes - 0 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes0 downloads0 reach13 impact
400 instances - 7 features - 0 classes - 0 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes0 downloads0 reach13 impact
400 instances - 8 features - 0 classes - 0 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes0 downloads0 reach13 impact
400 instances - 8 features - 0 classes - 0 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au6-cd1-400 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and heterogeneity…
144 runs0 likes3 downloads3 reach13 impact
400 instances - 41 features - 8 classes - 0 missing values
This dataset can be used to predict the chronic kidney disease and it can be collected from the hospital nearly 2 months of period. ### Attribute information We use 24 + class = 25 ( 11 numeric ,14…
0 runs0 likes0 downloads0 reach0 impact
400 instances - 26 features - classes - 1009 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
581 runs0 likes5 downloads5 reach14 impact
400 instances - 6 features - 4 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
791 runs0 likes7 downloads7 reach15 impact
400 instances - 8 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
764 runs0 likes6 downloads6 reach15 impact
400 instances - 8 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
779 runs0 likes7 downloads7 reach15 impact
400 instances - 8 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
788 runs0 likes7 downloads7 reach15 impact
400 instances - 8 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
817 runs0 likes8 downloads8 reach15 impact
400 instances - 7 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
757 runs0 likes8 downloads8 reach15 impact
400 instances - 6 features - 2 classes - 0 missing values
calendarDOW-pmlb
31 runs0 likes1 downloads1 reach21 impact
399 instances - 33 features - 5 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 30028, and it has 399 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
399 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101598, and it has 399 rows and 1026 features…
1 runs0 likes1 downloads1 reach11 impact
399 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10116, and it has 399 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
399 instances - 1026 features - 0 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Identifier attribute deleted. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based…
2 runs0 likes2 downloads2 reach12 impact
398 instances - 8 features - 0 classes - 6 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
705 runs0 likes5 downloads5 reach15 impact
398 instances - 8 features - 2 classes - 6 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10577, and it has 397 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
397 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10244, and it has 396 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
396 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11036, and it has 396 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
396 instances - 1026 features - 0 classes - 0 missing values
This data approach student achievement in secondary education of two Portuguese schools. The data attributes include student grades, demographic, social and school related features) and it was…
0 runs0 likes1 downloads1 reach8 impact
395 instances - 33 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12862, and it has 395 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
395 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11680, and it has 395 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
395 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12415, and it has 395 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
395 instances - 1026 features - 0 classes - 0 missing values