OpenML
Filter results by:
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 30010, and it has 82 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
82 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11036, and it has 396 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
396 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 17084, and it has 1863 rows and 1026 features…
1 runs0 likes1 downloads1 reach11 impact
1863 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 227, and it has 1238 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
1238 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11017, and it has 1211 rows and 1026 features…
1 runs0 likes1 downloads1 reach11 impact
1211 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 20157, and it has 63 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
63 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12238, and it has 30 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
30 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100426, and it has 123 rows and 1026 features…
1 runs0 likes1 downloads1 reach11 impact
123 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100590, and it has 11 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
11 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101055, and it has 81 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
81 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 275, and it has 477 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
477 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12000, and it has 366 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
366 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12894, and it has 483 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
483 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101464, and it has 1044 rows and 1026 features…
1 runs0 likes1 downloads1 reach11 impact
1044 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12641, and it has 37 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
37 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 20122, and it has 101 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
101 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 103451, and it has 27 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
27 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 30022, and it has 81 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
81 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 17073, and it has 391 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
391 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12587, and it has 157 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
157 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12163, and it has 285 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
285 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10909, and it has 111 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
111 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11869, and it has 705 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
705 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 13024, and it has 13 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
13 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100427, and it has 77 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
77 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12788, and it has 50 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
50 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12703, and it has 226 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
226 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100992, and it has 83 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
83 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100918, and it has 88 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
88 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11269, and it has 614 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
614 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 20154, and it has 1024 rows and 1026 features…
1 runs0 likes1 downloads1 reach11 impact
1024 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11105, and it has 631 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
631 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12131, and it has 111 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
111 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 20023, and it has 60 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
60 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100962, and it has 35 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
35 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11199, and it has 104 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
104 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12787, and it has 10 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
10 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12227, and it has 1510 rows and 1026 features…
1 runs0 likes1 downloads1 reach11 impact
1510 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11942, and it has 1002 rows and 1026 features…
1 runs0 likes1 downloads1 reach11 impact
1002 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11868, and it has 519 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
519 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12090, and it has 1312 rows and 1026 features…
1 runs0 likes1 downloads1 reach11 impact
1312 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 20067, and it has 29 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
29 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10960, and it has 70 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
70 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10958, and it has 107 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
107 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100895, and it has 21 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
21 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12971, and it has 188 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
188 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 103446, and it has 73 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
73 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 262, and it has 429 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
429 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100579, and it has 564 rows and 1026 features…
1 runs0 likes1 downloads1 reach11 impact
564 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10919, and it has 386 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
386 instances - 1026 features - 0 classes - 0 missing values
SVHN is a real-world image dataset for developing machine learning and object recognition algorithms with minimal requirement on data preprocessing and formatting. It can be seen as similar in flavor…
52 runs1 likes2 downloads3 reach16 impact
99289 instances - 3073 features - 10 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11780, and it has 50 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
50 instances - 1026 features - 0 classes - 0 missing values
The data was collected retrospectively at Wroclaw Thoracic Surgery Centre for patients who underwent major lung resections for primary lung cancer in the years 2007 - 2011. The Centre is associated…
31 runs0 likes5 downloads5 reach12 impact
470 instances - 17 features - 2 classes - 0 missing values
Abstract: CART book's waveform domains Source: Original Owners: Breiman,L., Friedman,J.H., Olshen,R.A., & Stone,C.J. (1984). Classification and Regression Trees. Wadsworth International Group:…
0 runs2 likes6 downloads8 reach11 impact
5000 instances - 22 features - classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12171, and it has 344 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
344 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 116, and it has 794 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
794 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 103106, and it has 664 rows and 1026 features…
1 runs0 likes1 downloads1 reach11 impact
664 instances - 1026 features - 0 classes - 0 missing values
One of the biggest challenges of an auto dealership purchasing a used car at an auto auction is the risk of that the vehicle might have serious issues that prevent it from being sold to customers. The…
3 runs0 likes3 downloads3 reach13 impact
72983 instances - 33 features - 2 classes - 149271 missing values
Modified version for the automl benchmark. Regroups information for about 7800 different US colleges. Including geographical information, stats about the population attending and post graduation…
0 runs0 likes0 downloads0 reach0 impact
7063 instances - 45 features - 0 classes - 104249 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au7-300-drift-au7-cpd1-800 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and…
7130 runs0 likes12 downloads12 reach35 impact
1100 instances - 13 features - 5 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11891, and it has 121 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
121 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11499, and it has 12 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
12 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 17024, and it has 373 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
373 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101058, and it has 594 rows and 1026 features…
1 runs0 likes1 downloads1 reach11 impact
594 instances - 1026 features - 0 classes - 0 missing values
kaggle 30day ml
0 runs0 likes0 downloads0 reach0 impact
300000 instances - 25 features - 0 classes - 0 missing values
Human Activity Recognition (HAR) database built from the recordings of 30 subjects performing activities of daily living (ADL) while carrying a waist-mounted smartphone with embedded inertial sensors.…
24379 runs1 likes26 downloads27 reach42 impact
10299 instances - 562 features - 6 classes - 0 missing values
####1. Summary This database was generated by the Laboratory of Image Processing and Pattern Recognition (INPG-LTIRF) in the development of the Esprit project ELENA No. 6891 and the Esprit working…
20234 runs0 likes13 downloads13 reach18 impact
5500 instances - 41 features - 11 classes - 0 missing values
Expression levels of 77 proteins measured in the cerebral cortex of 8 classes of control and Down syndrome mice exposed to context fear conditioning, a task used to assess associative learning. The…
9545 runs0 likes0 downloads0 reach21 impact
1080 instances - 82 features - 8 classes - 1396 missing values
__Changes w.r.t. version 1: included one target factor with 7 levels as target variable for the classification. Also deleted the previous 7 binary target variables.__ A dataset of steel plates'…
9051 runs1 likes3 downloads4 reach15 impact
1941 instances - 28 features - 7 classes - 0 missing values
#### Abstract: MADELON is an artificial dataset, which was part of the NIPS 2003 feature selection challenge. This is a two-class classification problem with continuous input variables. The difficulty…
100900 runs0 likes20 downloads20 reach26 impact
2600 instances - 501 features - 2 classes - 0 missing values
Forecasting skewed biased stochastic ozone days: analyses, solutions and beyond, Knowledge and Information Systems, Vol. 14, No. 3, 2008. 1 . Abstract: Two ground ozone level data sets are included in…
187959 runs1 likes20 downloads21 reach28 impact
2534 instances - 73 features - 2 classes - 0 missing values
Source: James P Bridge, Sean B Holden and Lawrence C Paulson University of Cambridge Computer Laboratory William Gates Building 15 JJ Thomson Avenue Cambridge CB3 0FD UK +44 (0)1223 763500…
26339 runs1 likes21 downloads22 reach43 impact
6118 instances - 52 features - 6 classes - 0 missing values
Author: Volker Lohweg (University of Applied Sciences, Ostwestfalen-Lippe) Source: [UCI](https://archive.ics.uci.edu/ml/datasets/banknote+authentication) - 2012 Please cite:…
137717 runs5 likes39 downloads44 reach30 impact
1372 instances - 5 features - 2 classes - 0 missing values
Predict a biological response of molecules from their chemical properties. Each row in this data set represents a molecule. The first column contains experimental data describing an actual biological…
48340 runs2 likes38 downloads40 reach34 impact
3751 instances - 1777 features - 2 classes - 0 missing values
The data were collected as the SCITOS G5 robot navigates through the room following the wall in a clockwise direction, for 4 rounds, using 24 ultrasound sensors arranged circularly around its 'waist'.…
25211 runs0 likes22 downloads22 reach34 impact
5456 instances - 25 features - 4 classes - 0 missing values
0. airplane 1. automobile 2. bird 3. cat 4. deer 5. dog 6. frog 7. horse 8. ship 9. truck CIFAR-10 contains 6000 images per class. The original train-test split randomly divided these into 5000 train…
160 runs0 likes6 downloads6 reach21 impact
60000 instances - 3073 features - 10 classes - 0 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. The maps were scanned in 8 bit grey value at density of 400dpi,…
11351 runs1 likes2 downloads3 reach22 impact
2000 instances - 241 features - 10 classes - 0 missing values
__Changes w.r.t. version 1: renamed variables such that they match description.__ ### Dataset: Wilt Data Set ### Abstract: High-resolution Remote Sensing data set (Quickbird). Small number of training…
10966 runs0 likes2 downloads2 reach21 impact
4839 instances - 6 features - 2 classes - 0 missing values
### Description __Changes to version 1:__ all categorical features transformed as such. This dataset represents a set of possible advertisements on Internet pages. ### Sources (a) Creator and donor:…
1432 runs0 likes5 downloads5 reach23 impact
3279 instances - 1559 features - 2 classes - 0 missing values
Over 92 thousand images (32x32 pixels) of 46 characters from Devanagari script. Includes the alphabet as well as the numbers. Devanagari is an Indic script and forms a basis for over 100 languages…
43 runs2 likes8 downloads10 reach14 impact
92000 instances - 1025 features - 46 classes - 0 missing values
### Description This is a data set containing 1080 documents of free text business descriptions of Brazilian companies categorized into a subset of 9 categories. ### Source ``` Patrick Marques…
34164 runs0 likes17 downloads17 reach56 impact
1080 instances - 857 features - 9 classes - 0 missing values
This is the original version of the famous covertype dataset in ARFF format. Predicting forest cover type from cartographic variables only (no remotely sensed data). The actual forest cover type for a…
9 runs1 likes14 downloads15 reach24 impact
581012 instances - 55 features - 7 classes - 0 missing values
A dataset relating characteristics of telephony account features and usage and whether or not the customer churned. Originally used in [Discovering Knowledge in Data: An Introduction to Data…
7512 runs2 likes9 downloads11 reach25 impact
5000 instances - 21 features - 2 classes - 0 missing values
This database was derived from a simple hierarchical decision model originally developed for the demonstration of DEX (M. Bohanec, V. Rajkovic: Expert system for decision making. Sistemica 1(1), pp.…
7180 runs0 likes11 downloads11 reach24 impact
1728 instances - 7 features - 4 classes - 0 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
3 runs0 likes2 downloads2 reach18 impact
10000 instances - 2001 features - 5 classes - 0 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
3 runs0 likes0 downloads0 reach18 impact
8237 instances - 801 features - 7 classes - 0 missing values
1. Data set title: Nomao Data Set 2. Abstract: Nomao collects data about places (name, phone, localization...) from many sources. Deduplication consists in detecting what data refer to the same place.…
67399 runs0 likes16 downloads16 reach28 impact
34465 instances - 119 features - 2 classes - 0 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
4 runs0 likes2 downloads2 reach18 impact
2984 instances - 145 features - 2 classes - 0 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
4 runs0 likes2 downloads2 reach19 impact
5124 instances - 21 features - 2 classes - 0 missing values
shuttle-pmlb
10 runs0 likes4 downloads4 reach24 impact
58000 instances - 10 features - 7 classes - 0 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
10 runs0 likes1 downloads1 reach20 impact
20000 instances - 4297 features - 2 classes - 0 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
11 runs1 likes1 downloads2 reach20 impact
20000 instances - 4297 features - 2 classes - 0 missing values
Fashion-MNIST is a dataset of Zalando's article images, consisting of a training set of 60,000 examples and a test set of 10,000 examples. Each example is a 28x28 grayscale image, associated with a…
453 runs0 likes12 downloads12 reach26 impact
70000 instances - 785 features - 10 classes - 0 missing values
This is the dataset used for the 2016 IDA Industrial Challenge, courtesy of Scania. For a full description, see http://archive.ics.uci.edu/ml/datasets/IDA2016Challenge . This dataset contains both the…
7 runs0 likes2 downloads2 reach18 impact
76000 instances - 171 features - 2 classes - 1078695 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
13 runs0 likes1 downloads1 reach19 impact
58310 instances - 181 features - 10 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12965, and it has 74 rows and 1026 features (including…
1 runs0 likes1 downloads1 reach11 impact
74 instances - 1026 features - 0 classes - 0 missing values
simple engine data
52 runs0 likes6 downloads6 reach12 impact
383 instances - 6 features - 3 classes - 0 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
12 runs0 likes1 downloads1 reach19 impact
83733 instances - 55 features - 4 classes - 0 missing values
This dataset is taken from the MiniBooNE experiment and is used to distinguish electron neutrinos (signal) from muon neutrinos (background). This dataset is ordered. It first contains all signal…
12 runs0 likes4 downloads4 reach13 impact
130064 instances - 51 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100862, and it has 177 rows and 1026 features…
1 runs0 likes1 downloads1 reach11 impact
177 instances - 1026 features - 0 classes - 0 missing values