Data
Filter results by:
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
2 runs0 likes0 downloads0 reach13 impact
475 instances - 4 features - 0 classes - 0 missing values
Information about the dataset CLASSTYPE: numeric CLASSINDEX: last
2 runs0 likes0 downloads0 reach13 impact
559 instances - 5 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach13 impact
80 instances - 113 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach13 impact
34 instances - 1143 features - 0 classes - 0 missing values
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage repeatable,…
0 runs0 likes0 downloads0 reach13 impact
145 instances - 95 features - 0 classes - 0 missing values
Data Used in "A BAYESIAN APPROACH TO DATA DISCLOSURE: OPTIMAL INTRUDER BEHAVIOR FOR CONTINUOUS DATA" by Stephen E. Fienberg, Udi E. Makov, and Ashish P. Sanil Background: ========== In this paper we…
0 runs0 likes0 downloads0 reach13 impact
662 instances - 4 features - 0 classes - 0 missing values
Datasets of Data And Story Library, project illustrating use of basic statistic methods, converted to arff format by Hakan Kjellerstrand. Source: TunedIT: http://tunedit.org/repo/DASL DASL file…
2 runs0 likes0 downloads0 reach14 impact
59 instances - 16 features - 0 classes - 0 missing values
source: http://www.cs.ubc.ca/labs/beta/Projects/SATzilla/ authors: L. Xu, F. Hutter, H. Hoos, K. Leyton-Brown translator in coseal format: M. Lindauer with the help of Alexandre Frechette the data do…
0 runs0 likes0 downloads0 reach8 impact
296 instances - 116 features - 14 classes - 1810 missing values
The German Traffic Sign Benchmark is a multi-class, single-image classification challenge held at the International Joint Conference on Neural Networks (IJCNN) 2011. We cordially invite researchers…
0 runs0 likes0 downloads0 reach10 impact
51839 instances - 1569 features - 43 classes - 0 missing values
The German Traffic Sign Benchmark is a multi-class, single-image classification challenge held at the International Joint Conference on Neural Networks (IJCNN) 2011. We cordially invite researchers…
0 runs0 likes0 downloads0 reach10 impact
51839 instances - 1569 features - 43 classes - 0 missing values
The German Traffic Sign Benchmark is a multi-class, single-image classification challenge held at the International Joint Conference on Neural Networks (IJCNN) 2011. We cordially invite researchers…
1 runs0 likes0 downloads0 reach12 impact
51839 instances - 2917 features - 43 classes - 0 missing values
The German Traffic Sign Benchmark is a multi-class, single-image classification challenge held at the International Joint Conference on Neural Networks (IJCNN) 2011. We cordially invite researchers…
1 runs0 likes0 downloads0 reach10 impact
51839 instances - 257 features - 43 classes - 0 missing values
sdwd dede
0 runs0 likes0 downloads0 reach7 impact
44 instances - 2 features - classes - 0 missing values
At Santander our mission is to help people and businesses prosper. We are always looking for ways to help our customers understand their financial health and identify which products and services might…
0 runs0 likes0 downloads0 reach8 impact
200000 instances - 202 features - 2 classes - 0 missing values
titanic surviual prediction
0 runs0 likes0 downloads0 reach7 impact
891 instances - 8 features - classes - 0 missing values
swd dced
0 runs0 likes0 downloads0 reach6 impact
589 instances - 3 features - classes - 0 missing values
sdsw frfr
0 runs0 likes0 downloads0 reach6 impact
1556 instances - 3 features - classes - 0 missing values
efe rgrg
0 runs0 likes0 downloads0 reach6 impact
e fvr
0 runs0 likes0 downloads0 reach6 impact
2 instances - 11 features - classes - 0 missing values
dd fgrfg
0 runs0 likes0 downloads0 reach7 impact
2 instances - 3 features - classes - 0 missing values
efef ffrf
0 runs0 likes0 downloads0 reach6 impact
9 instances - 3 features - classes - 0 missing values
Identify jets of particles from the LHC, created for the study of ultra low latency inference with hls4ml. Use 16 high level features to identify the 5 jet classes: quark (q), gluon (g), W boson (w),…
0 runs0 likes0 downloads0 reach6 impact
830000 instances - 17 features - 5 classes - 0 missing values
ssc vdv
0 runs0 likes0 downloads0 reach6 impact
1556 instances - 2 features - classes - 0 missing values
Experiment data obtained by running random configurations of the hnsw kNN through mlr on 116 different classification tasks from openml.
0 runs0 likes0 downloads0 reach7 impact
111753 instances - 13 features - classes - 0 missing values
Experiment data obtained by running random configurations of glmnet through mlr on 114 different classification tasks from openml.
0 runs0 likes0 downloads0 reach7 impact
104820 instances - 10 features - classes - 0 missing values
Experiment data obtained by running random configurations of an SVM through mlr on 106 different classification tasks from openml.
0 runs0 likes0 downloads0 reach7 impact
540576 instances - 15 features - classes - 658962 missing values
Experiment data obtained by running random configurations of rpart through mlr on 115 different classification tasks from openml.
0 runs0 likes0 downloads0 reach7 impact
92067 instances - 12 features - classes - 0 missing values
Experiment data obtained by running random configurations of ranger through mlr on 119 different classification tasks from openml.
0 runs0 likes0 downloads0 reach7 impact
278863 instances - 16 features - classes - 138965 missing values
Experiment data obtained by running random configurations of xgboost through mlr on 118 different classification tasks from openml. Parameter descriptions:…
0 runs0 likes0 downloads0 reach7 impact
2955210 instances - 21 features - classes - 7051006 missing values
dataset for bme
0 runs0 likes0 downloads0 reach7 impact
63 instances - 12 features - classes - 52 missing values
titanic surviual prediction
0 runs0 likes0 downloads0 reach7 impact
891 instances - 8 features - classes - 0 missing values
titanic surviual prediction
0 runs0 likes0 downloads0 reach7 impact
891 instances - 8 features - classes - 0 missing values
titanic surviual prediction
6 runs0 likes0 downloads0 reach8 impact
891 instances - 8 features - classes - 0 missing values
swd cdef
0 runs0 likes0 downloads0 reach7 impact
3 instances - 2 features - classes - 0 missing values
sxd cde
0 runs0 likes0 downloads0 reach7 impact
3 instances - 2 features - classes - 0 missing values
xscdc frfgrg
0 runs0 likes0 downloads0 reach7 impact
3 instances - 1 features - classes - 0 missing values
wded def
0 runs0 likes0 downloads0 reach7 impact
3 instances - 2 features - classes - 0 missing values
sds dcdcc
0 runs0 likes0 downloads0 reach7 impact
3 instances - 2 features - classes - 0 missing values
Download test
0 runs0 likes0 downloads0 reach7 impact
3 instances - 2 features - classes - 0 missing values
xsxs cdf
0 runs0 likes0 downloads0 reach7 impact
6 instances - 2 features - classes - 0 missing values
Uploead test
0 runs0 likes0 downloads0 reach7 impact
958 instances - 10 features - classes - 0 missing values
ddfef fvdf
0 runs0 likes0 downloads0 reach7 impact
8 instances - 1 features - classes - 0 missing values
wdwd cd
0 runs0 likes0 downloads0 reach7 impact
2 instances - 2 features - classes - 0 missing values
efef fdfef
0 runs0 likes0 downloads0 reach7 impact
2 instances - 2 features - classes - 0 missing values
zaxa xcdc
0 runs0 likes0 downloads0 reach7 impact
2 instances - 2 features - classes - 0 missing values
dedfef
0 runs0 likes0 downloads0 reach7 impact
2 instances - 1 features - classes - 0 missing values
scs
0 runs0 likes0 downloads0 reach7 impact
2 instances - 2 features - classes - 0 missing values
wdede
0 runs0 likes0 downloads0 reach7 impact
2 instances - 2 features - classes - 0 missing values
wdwd
0 runs0 likes0 downloads0 reach7 impact
2 instances - 1 features - classes - 0 missing values
qsqs
0 runs0 likes0 downloads0 reach7 impact
2 instances - 1 features - classes - 0 missing values
swdw
0 runs0 likes0 downloads0 reach7 impact
2 instances - 2 features - classes - 0 missing values
werr
0 runs0 likes0 downloads0 reach7 impact
2 instances - 2 features - classes - 0 missing values
ssf
0 runs0 likes0 downloads0 reach7 impact
2 instances - 2 features - classes - 0 missing values
swd
0 runs0 likes0 downloads0 reach7 impact
2 instances - 2 features - classes - 0 missing values
ddef
0 runs0 likes0 downloads0 reach7 impact
2 instances - 2 features - classes - 0 missing values
frf r
0 runs0 likes0 downloads0 reach6 impact
2 instances - 3 features - classes - 0 missing values
e eded
0 runs0 likes0 downloads0 reach6 impact
2 instances - 4 features - classes - 0 missing values
e3r4vr t4r
0 runs0 likes0 downloads0 reach6 impact
2 instances - 5 features - classes - 0 missing values
f fr
0 runs0 likes0 downloads0 reach6 impact
2 instances - 5 features - classes - 0 missing values
testing temperature and ph
0 runs0 likes0 downloads0 reach3 impact
26 instances - 8 features - classes - 0 missing values
This data is used to test water contamination
0 runs0 likes0 downloads0 reach7 impact
26 instances - 8 features - classes - 0 missing values
AutoML challenge 2014. Original task: regression. Test and validation sets can be obtained on the Cha Learn website: https://automl.chalearn.org/data
0 runs0 likes0 downloads0 reach2 impact
99 instances - 200001 features - 0 classes - 0 missing values
% Title: Flora % Source: https://automl.chalearn.org/data % % Dataset from the first ChaLearn AutoML challenge (2014). % Only the training data is included, as there were no labels for validation and…
0 runs0 likes0 downloads0 reach3 impact
15000 instances - 200001 features - 0 classes - 0 missing values
A subset of the 3D dataset from Princeton\'s COS 429 Computer Vision course. The dataset consists of 40 models organised into 4 classes of 10 objects each.
0 runs0 likes0 downloads0 reach2 impact
16000 instances - 4 features - classes - 0 missing values
Version with corrected feature types. 'PrivacySuppressed' are converted to None. Regroups information for about 7800 different US colleges. Including geographical information, stats about the…
0 runs0 likes0 downloads0 reach0 impact
7063 instances - 47 features - 0 classes - 104305 missing values
Airlines Departure Delay Prediction (Regression). Original data can be found at: http://www.transtats.bts.gov This is a processed version of the original data, designed to predict departure delay (in…
0 runs0 likes0 downloads0 reach1 impact
10000000 instances - 10 features - 0 classes - 0 missing values
User profile data for San Francisco OkCupid users published in [Kim, A. Y., & Escobedo-Land, A. (2015). OKCupid data for introductory statistics and data science courses. Journal of Statistics…
0 runs0 likes0 downloads0 reach10 impact
50789 instances - 20 features - 3 classes - 154107 missing values
Bike sharing systems are new generation of traditional bike rentals where whole process from membership, rental and return back has become automatic. Through these systems, user is able to easily rent…
0 runs0 likes0 downloads0 reach3 impact
17379 instances - 13 features - 0 classes - 0 missing values
Bike sharing systems are new generation of traditional bike rentals where whole process from membership, rental and return back has become automatic. Through these systems, user is able to easily rent…
0 runs0 likes0 downloads0 reach2 impact
17379 instances - 13 features - 0 classes - 0 missing values
This is a preprocessed version of the anneal dataset (version 1). All missing values are treated as a nominal value with label '?'. (Quotes for clarity). Because this is not good…
0 runs0 likes0 downloads0 reach0 impact
898 instances - 39 features - 5 classes - 0 missing values
The Inpatient Utilization and Payment Public Use File (Inpatient PUF) provides information on inpatient discharges for Medicare fee-for-service beneficiaries. The Inpatient PUF includes information on…
0 runs0 likes0 downloads0 reach0 impact
163065 instances - 12 features - 0 classes - 0 missing values
sample
0 runs0 likes0 downloads0 reach2 impact
14 instances - 5 features - classes - 0 missing values
this is test data
0 runs0 likes0 downloads0 reach2 impact
5 instances - 5 features - classes - 0 missing values
newtest3
0 runs0 likes0 downloads0 reach3 impact
2 instances - 6 features - classes - 0 missing values
test3
0 runs0 likes0 downloads0 reach2 impact
2 instances - 8 features - classes - 0 missing values
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach2 impact
150 instances - 5 features - classes - 0 missing values
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach2 impact
150 instances - 5 features - 3 classes - 0 missing values
This is the same data as version 5 (OpenML ID = 1220) with '_id' features coded as nominal factor variables.
0 runs0 likes0 downloads0 reach10 impact
39948 instances - 12 features - 2 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach13 impact
32 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach13 impact
19 instances - 10 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach13 impact
26 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach13 impact
13 instances - 1143 features - 0 classes - 0 missing values
Dataset from Smoothing Methods in Statistics (ftp stat.cmu.edu/datasets) Simonoff, J.S. (1996). Smoothing Methods in Statistics. New York: Springer-Verlag. Gasoline comnsumption is being treated as…
2 runs0 likes0 downloads0 reach9 impact
27 instances - 5 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach13 impact
37 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach13 impact
15 instances - 10 features - 0 classes - 0 missing values
Dataset listing all-time NFL passers through 1994 by the NFL passing efficiency rating. Associated passing statistics from which this rating is computed are included. The dataset lists statistics for…
0 runs0 likes0 downloads0 reach13 impact
26 instances - 6 features - 0 classes - 0 missing values
This S dump contains 22 data sets from the book Visualizing Data published by Hobart Press (books@hobart.com). The dump was created by data.dump() and can be read back into S by data.restore(). The…
0 runs0 likes0 downloads0 reach13 impact
73 instances - 6 features - 0 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes0 downloads0 reach13 impact
4052 instances - 8 features - 0 classes - 0 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes0 downloads0 reach11 impact
50 instances - 3 features - classes - 0 missing values
DATA-SETS FROM DIGGLE, P.J. (1990). TIME SERIES : A BIOSTATISTICAL INTRODUCTION. Oxford University Press. Table: Table A1 Lutenizing hormone Information about the dataset CLASSTYPE: numeric…
0 runs0 likes0 downloads0 reach13 impact
48 instances - 5 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach13 impact
22 instances - 40 features - 0 classes - 0 missing values
This file contains data from Regression Analysis By Example, 2nd Edition, by Samprit Chatterjee and Bertram Price, John Wiley, 1991. Data sets have names of the form 'rabe.xxx' where xxx is the page…
0 runs0 likes0 downloads0 reach13 impact
66 instances - 6 features - 0 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Attributes 2,4, and 6 deleted. Midrange price treated as the class attribute. As used by Kilpatrick, D. & Cameron-Jones, M.…
0 runs0 likes0 downloads0 reach18 impact
93 instances - 23 features - 0 classes - 14 missing values
This file is a text file giving details about the time series analysed in 'The Analysis of Time Series' by Chris Chatfield. The 5th edn was published in 1996 and the 6th edn in 2003. The series are…
0 runs0 likes0 downloads0 reach13 impact
235 instances - 13 features - 0 classes - 0 missing values
This file contains data from Regression Analysis By Example, 2nd Edition, by Samprit Chatterjee and Bertram Price, John Wiley, 1991. Data sets have names of the form 'rabe.xxx' where xxx is the page…
0 runs0 likes0 downloads0 reach13 impact
51 instances - 7 features - 0 classes - 0 missing values
This file contains data from Regression Analysis By Example, 2nd Edition, by Samprit Chatterjee and Bertram Price, John Wiley, 1991. Data sets have names of the form 'rabe.xxx' where xxx is the page…
0 runs0 likes0 downloads0 reach13 impact
46 instances - 4 features - 0 classes - 0 missing values
Contains 110 data sets from the book 'The Statistical Sleuth' by Fred Ramsey and Dan Schafer; Duxbury Press, 1997. (schafer@stat.orst.edu) [14/Oct/97] (172k) Note: description taken from this web…
0 runs0 likes0 downloads0 reach13 impact
34 instances - 9 features - 0 classes - 0 missing values
Datasets of Data And Story Library, project illustrating use of basic statistic methods, converted to arff format by Hakan Kjellerstrand. Source: TunedIT: http://tunedit.org/repo/DASL DASL file…
0 runs0 likes0 downloads0 reach13 impact
40 instances - 7 features - 0 classes - 3 missing values
Contains 110 data sets from the book 'The Statistical Sleuth' by Fred Ramsey and Dan Schafer; Duxbury Press, 1997. (schafer@stat.orst.edu) [14/Oct/97] (172k) Note: description taken from this web…
0 runs0 likes0 downloads0 reach13 impact
62 instances - 6 features - 0 classes - 0 missing values
It has 3 attributes (ID, tweet, label ) 91299 tweets with non-sarcastic 39998 tweets and 51300 sarcastic tweets.
0 runs0 likes0 downloads0 reach9 impact
91298 instances - 2 features - 0 classes - 0 missing values