OpenML
Filter results by:
Data set shows information about participants of math conference. isPresent is target column for classification task.
0 runs0 likes0 downloads0 reach9 impact
246 instances - 7 features - 2 classes - 0 missing values
This dataset is gather to detect whether a person is running or walking based on deep neural networks and sensor data collected from iOS devices. The dataset represents 88588 sensor data samples…
1 runs0 likes4 downloads4 reach14 impact
88588 instances - 7 features - 2 classes - 0 missing values
valores de saida de fardamento com temperaturas, admissões e demissões
0 runs0 likes0 downloads0 reach8 impact
6277 instances - 7 features - 0 classes - 0 missing values
dsd efe
1 runs0 likes0 downloads0 reach7 impact
601 instances - 7 features - classes - 0 missing values
b gtrg
0 runs0 likes0 downloads0 reach7 impact
4 instances - 7 features - classes - 0 missing values
![palmerpenguins](https://github.com/allisonhorst/palmerpenguins/raw/master/man/figures/logo.png) ## Description The goal of palmerpenguins is to provide a great dataset for data exploration &…
0 runs0 likes0 downloads0 reach6 impact
344 instances - 7 features - 3 classes - 18 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) The infamous Longley data, "An appraisal of least-squares programs from the point of view of the user", JASA, 62(1967) p819-841. Variables are: Number of…
3 runs0 likes1 downloads1 reach9 impact
16 instances - 7 features - 0 classes - 0 missing values
The data consist of annual observations on the level of strike volume (days lost due to industrial disputes per 1000 wage salary earners), and their covariates in 18 OECD countries from 1951-1985. The…
0 runs0 likes2 downloads2 reach13 impact
625 instances - 7 features - 0 classes - 0 missing values
Contains 110 data sets from the book 'The Statistical Sleuth' by Fred Ramsey and Dan Schafer; Duxbury Press, 1997. (schafer@stat.orst.edu) [14/Oct/97] (172k) Note: description taken from this web…
8 runs0 likes2 downloads2 reach13 impact
50 instances - 7 features - 0 classes - 0 missing values
Estimated article influence scores in 2015
0 runs0 likes0 downloads0 reach8 impact
3615 instances - 7 features - 3169 classes - 48 missing values
data from yahoo finance
0 runs0 likes0 downloads0 reach2 impact
1259 instances - 7 features - classes - 0 missing values
MY Dataset
0 runs0 likes0 downloads0 reach4 impact
120 instances - 7 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach3 impact
2580 instances - 7 features - classes - 2541 missing values
test
0 runs0 likes0 downloads0 reach3 impact
2580 instances - 7 features - classes - 2541 missing values
Geographical Analysis Spatial Data This georeferenced data set was used in: Pace, R. Kelley, and Ronald Barry, Quick Computation of Regressions with a Spatially Autoregressive Dependent Variable,…
4 runs1 likes1 downloads2 reach14 impact
3107 instances - 7 features - 0 classes - 0 missing values
The problem concerns Relative CPU Performance Data. More information can be obtained in the UCI Machine Learning repository (http://www.ics.uci.edu/~mlearn/MLSummary.html). The used attributes are :…
2 runs0 likes2 downloads2 reach12 impact
209 instances - 7 features - 0 classes - 0 missing values
One of the data sets used in the book "Analyzing Categorical Data" by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on statistical…
0 runs0 likes0 downloads0 reach13 impact
30 instances - 7 features - 0 classes - 6 missing values
Contains 110 data sets from the book 'The Statistical Sleuth' by Fred Ramsey and Dan Schafer; Duxbury Press, 1997. (schafer@stat.orst.edu) [14/Oct/97] (172k) Note: description taken from this web…
2 runs0 likes0 downloads0 reach13 impact
147 instances - 7 features - 0 classes - 0 missing values
Contains 110 data sets from the book 'The Statistical Sleuth' by Fred Ramsey and Dan Schafer; Duxbury Press, 1997. (schafer@stat.orst.edu) [14/Oct/97] (172k) Note: description taken from this web…
2 runs0 likes0 downloads0 reach13 impact
93 instances - 7 features - 0 classes - 0 missing values
Data on the recurrence times to infection, at the point of insertion of the catheter, for kidney patients using portable dialysis equipment. Catheters may be removed for reasons other than infection,…
2 runs0 likes1 downloads1 reach13 impact
76 instances - 7 features - 0 classes - 0 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) SUMMARY: Data from an experiment on the affects of machine adjustments on the time to count bolts. Data appear as the STATS (Issue 10) Challenge. DATA:…
4 runs0 likes0 downloads0 reach9 impact
40 instances - 7 features - 0 classes - 0 missing values
This data set is also obtained from the task of controlling the ailerons of a F16 aircraft, although the target variable and attributes are different from the ailerons domain. The target variable here…
2 runs0 likes3 downloads3 reach9 impact
9517 instances - 7 features - 0 classes - 0 missing values
Incident reports from the San Franciso Police Department between January 2003 and May 2018, provided by the City and County of San Francisco. The dataset was downloaded on 05.11.2018. from…
0 runs0 likes0 downloads0 reach7 impact
538638 instances - 7 features - 2 classes - 0 missing values
Prediction of residuary resistance of sailing yachts at the initial design stage is of a great value for evaluating the ship’s performance and for estimating the required propulsive…
0 runs0 likes0 downloads0 reach8 impact
308 instances - 7 features - 0 classes - 0 missing values
Daily electric energy dataset The dee problem involves predicting the daily average price of TkWhe electricity energy in Spain. The data set contains real values from 2003 about the daily consumption…
0 runs0 likes0 downloads0 reach8 impact
365 instances - 7 features - 0 classes - 0 missing values
This file contains data from Regression Analysis By Example, 2nd Edition, by Samprit Chatterjee and Bertram Price, John Wiley, 1991. Data sets have names of the form 'rabe.xxx' where xxx is the page…
0 runs0 likes0 downloads0 reach13 impact
51 instances - 7 features - 0 classes - 0 missing values
Datasets of Data And Story Library, project illustrating use of basic statistic methods, converted to arff format by Hakan Kjellerstrand. Source: TunedIT: http://tunedit.org/repo/DASL DASL file…
0 runs0 likes0 downloads0 reach13 impact
40 instances - 7 features - 0 classes - 3 missing values
These weekly averages are ultimately based on measurements of 4 air samples per hour taken atop intake lines on several towers during steady periods of CO2 concentration of not less than 6 hours per…
0 runs1 likes1 downloads2 reach9 impact
2225 instances - 7 features - 0 classes - 0 missing values
Abstract: A chess endgame data set representing the positions on the board of the white king, the white rook, and the black king. The task is to determine the optimum number of turn required for white…
25 runs0 likes5 downloads5 reach14 impact
28056 instances - 7 features - 18 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
745 runs0 likes11 downloads11 reach15 impact
3107 instances - 7 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
856 runs0 likes11 downloads11 reach15 impact
209 instances - 7 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
780 runs0 likes10 downloads10 reach15 impact
625 instances - 7 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
740 runs0 likes5 downloads5 reach14 impact
51 instances - 7 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
774 runs0 likes14 downloads14 reach15 impact
9517 instances - 7 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
767 runs0 likes9 downloads9 reach14 impact
76 instances - 7 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
731 runs0 likes5 downloads5 reach14 impact
93 instances - 7 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
120 runs0 likes5 downloads5 reach14 impact
50 instances - 7 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
765 runs0 likes13 downloads13 reach15 impact
1728 instances - 7 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
817 runs0 likes8 downloads8 reach15 impact
400 instances - 7 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1038 runs0 likes9 downloads9 reach14 impact
147 instances - 7 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
773 runs0 likes9 downloads9 reach15 impact
2000 instances - 7 features - 2 classes - 0 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes0 downloads0 reach13 impact
400 instances - 7 features - 0 classes - 0 missing values
* Abstract: The data was created by a medical expert as a data set to test the expert system, which will perform the presumptive diagnosis of two diseases of the urinary system. * Source: Jacek…
391 runs0 likes11 downloads11 reach13 impact
120 instances - 7 features - 2 classes - 0 missing values
1. Title: Space Shuttle Autolanding Domain 2. Sources: (a) Original source: unknown -- NASA: Mr. Roger Burke's autolander design team (b) Donor: Bojan Cestnik Jozef Stefan Institute Jamova 39 61000…
1466 runs0 likes9 downloads9 reach9 impact
15 instances - 7 features - 2 classes - 26 missing values
Classify a chess game based on the position of the white king, the white rook and the black king.
1777 runs0 likes15 downloads15 reach9 impact
28056 instances - 7 features - 18 classes - 0 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. Corresponding patterns in different datasets correspond to the same…
35343 runs0 likes17 downloads17 reach11 impact
2000 instances - 7 features - 10 classes - 0 missing values
Once upon a time, in July 1991, the monks of Corsendonk Priory were faced with a school held in their priory, namely the 2nd European Summer School on Machine Learning. After listening more than one…
394293 runs2 likes27 downloads29 reach37 impact
601 instances - 7 features - 2 classes - 0 missing values
Once upon a time, in July 1991, the monks of Corsendonk Priory were faced with a school held in their priory, namely the 2nd European Summer School on Machine Learning. After listening more than one…
108666 runs1 likes14 downloads15 reach34 impact
554 instances - 7 features - 2 classes - 0 missing values
Once upon a time, in July 1991, the monks of Corsendonk Priory were faced with a school held in their priory, namely the 2nd European Summer School on Machine Learning. After listening more than one…
358450 runs2 likes18 downloads20 reach39 impact
556 instances - 7 features - 2 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
1034 runs0 likes10 downloads10 reach14 impact
100 instances - 7 features - 2 classes - 0 missing values
Mammography dataset Past Usage: 1. Woods, K., Doss, C., Bowyer, K., Solka, J., Priebe, C.,
218 runs5 likes48 downloads53 reach24 impact
11183 instances - 7 features - 2 classes - 0 missing values
Dataset from the MLRR repository: http://axon.cs.byu.edu:5000/
731 runs0 likes5 downloads5 reach22 impact
151 instances - 7 features - 3 classes - 0 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
22 runs0 likes2 downloads2 reach14 impact
400 instances - 8 features - 0 classes - 0 missing values
Datasets of Data And Story Library, project illustrating use of basic statistic methods, converted to arff format by Hakan Kjellerstrand. Source: TunedIT: http://tunedit.org/repo/DASL DASL file…
0 runs0 likes2 downloads2 reach13 impact
48 instances - 8 features - 0 classes - 0 missing values
* Title: seeds Data Set * Abstract: Measurements of geometrical properties of kernels belonging to three different varieties of wheat. A soft X-ray technique and GRAINS package were used to construct…
190 runs0 likes5 downloads5 reach13 impact
210 instances - 8 features - 3 classes - 0 missing values
* Title: seismic-bumps Data Set * Abstract: The data describe the problem of high energy (higher than 10^4 J) seismic bumps forecasting in a coal mine. Data come from two of longwalls located in a…
152 runs0 likes37 downloads37 reach13 impact
210 instances - 8 features - 3 classes - 0 missing values
This database has been artificially generated. It describes the structure of the capital letters A, C, D, E, F, G, H, L, P, R, indicated by a number 1-10, in that order (A=1,C=2,...). Each letter's…
24309 runs0 likes10 downloads10 reach57 impact
10218 instances - 8 features - 10 classes - 0 missing values
"The debutanizer column is part of a desulfuring and naphtha splitter plant." u1 Top temperature u2 Top pressure u3 Reflux flow u4 Flow to next process u5 6th tray temperature u6 Bottom…
0 runs0 likes1 downloads1 reach11 impact
2394 instances - 8 features - 0 classes - 0 missing values
cars1-pmlb
31 runs0 likes3 downloads3 reach20 impact
392 instances - 8 features - 3 classes - 0 missing values
cleveland-nominal-pmlb
31 runs0 likes1 downloads1 reach20 impact
303 instances - 8 features - 5 classes - 0 missing values
ecoli-pmlb
31 runs0 likes1 downloads1 reach20 impact
327 instances - 8 features - 5 classes - 0 missing values
led7-pmlb
31 runs0 likes0 downloads0 reach21 impact
3200 instances - 8 features - 10 classes - 0 missing values
The goal is to predict the Fare. Variable description: pclass: A proxy for socio-economic status (SES) 1st = Upper 2nd = Middle 3rd = Lower age: Age is fractional if less than 1. If the age is…
0 runs0 likes4 downloads4 reach10 impact
1307 instances - 8 features - 0 classes - 0 missing values
Data
0 runs0 likes1 downloads1 reach10 impact
539383 instances - 8 features - 2 classes - 0 missing values
Test
0 runs0 likes0 downloads0 reach7 impact
6330 instances - 8 features - classes - 0 missing values
This data represents crime reported to the Seattle Police Department (SPD). Each row contains the record of a unique event where at least one criminal offense was reported by a member of the community…
0 runs0 likes0 downloads0 reach6 impact
52358 instances - 8 features - 0 classes - 650 missing values
Airlines Dataset Inspired in the regression dataset from Elena Ikonomovska. The task is to predict whether a given flight will be delayed, given the information of the scheduled departure. For this…
0 runs0 likes1 downloads1 reach6 impact
26969 instances - 8 features - 2 classes - 0 missing values
This is the hip measurement data from Table B.13 in Chatfield's Problem Solving (1995, 2nd edn, Chapman and Hall). It is given in 8 columns. First 4 columns are for Control Group. Last 4 columns are…
0 runs0 likes0 downloads0 reach11 impact
54 instances - 8 features - classes - 120 missing values
Graeme D. Hutcheson and Nick Sofroniou 1999 The Multivariate Social Scientist: Introductory Statistics Using Generalized Linear Models. SAGE Publications. Copyright: Graeme D. Hutcheson & Nick…
2 runs0 likes0 downloads0 reach13 impact
70 instances - 8 features - 0 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach2 impact
336 instances - 8 features - classes - 0 missing values
xxx
0 runs0 likes0 downloads0 reach6 impact
891 instances - 8 features - classes - 689 missing values
xxx
0 runs0 likes0 downloads0 reach6 impact
891 instances - 8 features - 2 classes - 689 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Identifier attribute deleted. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based…
2 runs0 likes2 downloads2 reach12 impact
398 instances - 8 features - 0 classes - 6 missing values
The data are a subsample of 500 observations from a data set that originate in a study where air pollution at a road is related to traffic volume and meteorological variables, collected by the…
2 runs0 likes1 downloads1 reach13 impact
500 instances - 8 features - 0 classes - 0 missing values
Airlines Dataset Inspired in the regression dataset from Elena Ikonomovska. The task is to predict whether a given flight will be delayed, given the information of the scheduled departure.
291 runs0 likes29 downloads29 reach16 impact
539383 instances - 8 features - 2 classes - 0 missing values
1. Title: Echocardiogram Data 2. Source Information: -- Donor: Steven Salzberg (salzberg@cs.jhu.edu) -- Collector: -- Dr. Evlin Kinney -- The Reed Institute -- P.O. Box 402603 -- Maimi, FL 33140-0603…
0 runs0 likes0 downloads0 reach8 impact
132 instances - 8 features - 4 classes - 103 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes0 downloads0 reach13 impact
400 instances - 8 features - 0 classes - 0 missing values
The data are a subsample of 500 observations from a data set that originate in a study where air pollution at a road is related to traffic volume and meteorological variables, collected by the…
2 runs0 likes1 downloads1 reach13 impact
500 instances - 8 features - 0 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes1 downloads1 reach11 impact
228 instances - 8 features - classes - 20 missing values
This analysis describes and summarizes the relationships between 1987 salaries of major league baseball players and the player's performance. The salary data were taken from Sports Illustrated, April…
0 runs1 likes1 downloads2 reach13 impact
26 instances - 8 features - 0 classes - 0 missing values
This data represents crime reported to the Seattle Police Department (SPD). Each row contains the record of a unique event where at least one criminal offense was reported by a member of the community…
0 runs0 likes0 downloads0 reach8 impact
523590 instances - 8 features - 144 classes - 6916 missing values
exercises
0 runs0 likes0 downloads0 reach8 impact
15000 instances - 8 features - classes - 0 missing values
exercises
0 runs0 likes0 downloads0 reach8 impact
15000 instances - 8 features - classes - 0 missing values
Datasets for `Pattern Recognition and Neural Networks' by B.D. Ripley ===================================================================== Cambridge University Press (1996) ISBN 0-521-46086-7 The…
743 runs0 likes8 downloads8 reach15 impact
200 instances - 8 features - 2 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Weight treated as the class attribute. Identifier deleted. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric…
10 runs0 likes2 downloads2 reach12 impact
158 instances - 8 features - 0 classes - 87 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) Data from which conclusions were drawn in the article "Sleep in Mammals: Ecological and Constitutional Correlates" by Allison, T. and Cicchetti, D.…
0 runs0 likes1 downloads1 reach9 impact
62 instances - 8 features - 0 classes - 12 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes0 downloads0 reach13 impact
400 instances - 8 features - 0 classes - 0 missing values
Contains 110 data sets from the book 'The Statistical Sleuth' by Fred Ramsey and Dan Schafer; Duxbury Press, 1997. (schafer@stat.orst.edu) [14/Oct/97] (172k) Note: description taken from this web…
0 runs0 likes0 downloads0 reach13 impact
47 instances - 8 features - 0 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Attributes 2 and 8 deleted. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based…
2 runs0 likes2 downloads2 reach19 impact
209 instances - 8 features - 0 classes - 0 missing values
this is titanic survival prediction
0 runs0 likes2 downloads2 reach7 impact
891 instances - 8 features - 0 classes - 0 missing values
titanic surviual prediction
0 runs0 likes2 downloads2 reach7 impact
891 instances - 8 features - 0 classes - 0 missing values
this is titanic survival prediction
0 runs0 likes3 downloads3 reach7 impact
891 instances - 8 features - 0 classes - 0 missing values
titanic surviual prediction
0 runs0 likes0 downloads0 reach8 impact
891 instances - 8 features - 0 classes - 0 missing values
titanic surviual prediction
0 runs0 likes0 downloads0 reach7 impact
891 instances - 8 features - classes - 0 missing values
titanic surviual prediction
0 runs0 likes0 downloads0 reach7 impact
891 instances - 8 features - classes - 0 missing values
titanic surviual prediction
0 runs0 likes0 downloads0 reach7 impact
891 instances - 8 features - classes - 0 missing values
titanic surviual prediction
6 runs0 likes0 downloads0 reach8 impact
891 instances - 8 features - classes - 0 missing values
titanic surviual prediction
0 runs0 likes1 downloads1 reach7 impact
891 instances - 8 features - 0 classes - 0 missing values
titanic surviual prediction
0 runs0 likes1 downloads1 reach7 impact
891 instances - 8 features - 0 classes - 0 missing values
r fgtgt
0 runs0 likes1 downloads1 reach7 impact
2 instances - 8 features - classes - 0 missing values