OpenML
Filter results by:
f fr
0 runs0 likes0 downloads0 reach6 impact
2 instances - 5 features - classes - 0 missing values
testing temperature and ph
0 runs0 likes0 downloads0 reach3 impact
26 instances - 8 features - classes - 0 missing values
This data is used to test water contamination
0 runs0 likes0 downloads0 reach7 impact
26 instances - 8 features - classes - 0 missing values
A subset of the 3D dataset from Princeton\'s COS 429 Computer Vision course. The dataset consists of 40 models organised into 4 classes of 10 objects each.
0 runs0 likes0 downloads0 reach2 impact
16000 instances - 4 features - classes - 0 missing values
sample
0 runs0 likes0 downloads0 reach2 impact
14 instances - 5 features - classes - 0 missing values
this is test data
0 runs0 likes0 downloads0 reach2 impact
5 instances - 5 features - classes - 0 missing values
newtest3
0 runs0 likes0 downloads0 reach3 impact
2 instances - 6 features - classes - 0 missing values
test3
0 runs0 likes0 downloads0 reach2 impact
2 instances - 8 features - classes - 0 missing values
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach2 impact
150 instances - 5 features - classes - 0 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes0 downloads0 reach11 impact
50 instances - 3 features - classes - 0 missing values
One of the data sets used in the book "Analyzing Categorical Data" by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on statistical…
0 runs0 likes1 downloads1 reach11 impact
31 instances - 16 features - classes - 150 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes2 downloads2 reach11 impact
100 instances - 10 features - classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes2 downloads2 reach11 impact
366 instances - 5 features - classes - 2 missing values
Multi-label dataset. The birds dataset consists of 327 audio recordings of 12 different vocalizing bird species. Each sound can be assigned to various bird species.
0 runs0 likes0 downloads0 reach9 impact
645 instances - 279 features - classes - 0 missing values
Multi-label dataset. Audio dataset (emotions) consists of 593 musical files with 6 clustered emotional labels and 72 predictors. Each song can be labeled with one or more of the labels…
0 runs0 likes0 downloads0 reach9 impact
593 instances - 78 features - classes - 0 missing values
Multi-label dataset. The UC Berkeley enron4 dataset represents a subset of the original enron5 dataset and consists of 1684 cases of emails with 21 labels and 1001 predictor variables.
0 runs0 likes0 downloads0 reach9 impact
1702 instances - 1054 features - classes - 0 missing values
Multi-label dataset. The genbase dataset contains protein sequences that can be assigned to several classes of protein families.
0 runs0 likes0 downloads0 reach9 impact
662 instances - 1212 features - classes - 0 missing values
Multi-label dataset. The image benchmark dataset consists of 2000 natural scene images. Zhou and Zhang (2007) extracted 135 features for each image and made it publicly available as processed image…
0 runs1 likes1 downloads2 reach9 impact
2000 instances - 140 features - classes - 0 missing values
The langLog dataset includes 1004 textual predictors and was originally compiled in the doctorial thesis of Read (2010). It consists of 956 text samples that can be assigned to one or more topics such…
0 runs0 likes0 downloads0 reach9 impact
1460 instances - 1079 features - classes - 0 missing values
Multi-label dataset. A subset of the reuters dataset includes 2000 observations for text classification.
0 runs0 likes0 downloads0 reach9 impact
2000 instances - 250 features - classes - 0 missing values
Multi-label dataset. The scene dataset is an image classification task where labels like Beach, Mountain, Field, Urban are assigned to each image.
0 runs0 likes0 downloads0 reach9 impact
2407 instances - 300 features - classes - 0 missing values
Multi-label dataset for text-classification. It consists of article titles and partial blurbs. Blurbs can be assigned to several categories (e.g. Science, News, Games) based on word predictors.
0 runs0 likes2 downloads2 reach13 impact
3782 instances - 1101 features - classes - 0 missing values
Multi-label dataset. The yeast dataset (Elisseeff and Weston, 2002) consists of micro-array expression data, as well as phylogenetic profiles of yeast, and includes 2417 genes and 103 predictors. In…
0 runs0 likes0 downloads0 reach9 impact
2417 instances - 117 features - classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Andromeda dataset (Hatzikos et al. 2008) concerns the prediction of future values for six…
0 runs0 likes0 downloads0 reach9 impact
49 instances - 36 features - classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Airline Ticket Price dataset concerns the prediction of airline ticket prices. The rows are a…
0 runs0 likes0 downloads0 reach9 impact
337 instances - 417 features - classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Airline Ticket Price dataset concerns the prediction of airline ticket prices. The rows are a…
0 runs0 likes0 downloads0 reach9 impact
296 instances - 417 features - classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Electrical Discharge Machining dataset (Karalic and Bratko 1997) represents a two-target…
0 runs0 likes0 downloads0 reach9 impact
154 instances - 18 features - classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Energy Building dataset (Tsanas and Xifara 2012) concerns the prediction of the heating load…
0 runs0 likes0 downloads0 reach9 impact
768 instances - 10 features - classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Jura (Goovaerts 1997) dataset consists of measurements of concentrations of seven heavy…
0 runs0 likes0 downloads0 reach9 impact
359 instances - 18 features - classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : This is a pre-processed version of the dataset used in Kaggles Online Product Sales competition…
0 runs0 likes0 downloads0 reach9 impact
639 instances - 413 features - classes - 10012 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The river flow datasets concern the prediction of river network flows for 48 h in the future at…
0 runs0 likes0 downloads0 reach9 impact
9125 instances - 72 features - classes - 3264 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The river flow datasets concern the prediction of river network flows for 48 h in the future at…
0 runs0 likes0 downloads0 reach9 impact
9125 instances - 584 features - classes - 356160 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Supply Chain Management datasets are derived from the Trading Agent Competition in Supply…
0 runs0 likes0 downloads0 reach9 impact
9803 instances - 296 features - classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Supply Chain Management datasets are derived from the Trading Agent Competition in Supply…
0 runs0 likes0 downloads0 reach9 impact
8966 instances - 77 features - classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : This is a pre-processed version of the dataset used in Kaggles See Click Predict Fix competition…
0 runs0 likes0 downloads0 reach9 impact
1137 instances - 26 features - classes - 9255 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Solar Flare dataset (Lichman 2013) has 3 target variables that correspond to the number of…
0 runs0 likes0 downloads0 reach9 impact
323 instances - 13 features - classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Solar Flare dataset (Lichman 2013) has 3 target variables that correspond to the number of…
0 runs0 likes0 downloads0 reach9 impact
1066 instances - 13 features - classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Concrete Slump dataset (Yeh 2007) concerns the prediction of three properties of concrete…
0 runs1 likes0 downloads1 reach10 impact
103 instances - 10 features - classes - 0 missing values
Sensor data measurements of one Boiler, containing WaterInput/SteamOutput (flow, temperature, pressure) for one month, which is measured every minute.
0 runs0 likes1 downloads1 reach9 impact
44643 instances - 8 features - classes - 44643 missing values
Los Angeles ozone pollution data, 1976
0 runs0 likes0 downloads0 reach9 impact
A copy of the data set proposed in: S. M. Weiss, and C. A. Kulikowski, Computer Systems That Learn (1991).
30 runs0 likes3 downloads3 reach13 impact
106 instances - 8 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach17 impact
209 instances - 8 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach11 impact
24 instances - 5 features - classes - 0 missing values
Coal mining requires working in hazardous conditions. Miners in an underground coal mine can face several threats, such as, e.g. methane explosions or rock-burst. To provide protection for people…
0 runs0 likes0 downloads0 reach2 impact
9199930 instances - 34 features - classes - 0 missing values
The YouTube personality dataset consists of a collection of behavorial features, speech transcriptions, and personality impression scores for a set of 404 YouTube vloggers that explicitly show…
0 runs0 likes1 downloads1 reach9 impact
404 instances - 31 features - classes - 0 missing values
SK daily COVID19
0 runs0 likes0 downloads0 reach0 impact
280 instances - 7 features - classes - 0 missing values
Israeli lottery
0 runs0 likes1 downloads1 reach8 impact
1153 instances - 11 features - classes - 0 missing values
50 Danish words with their pronunciation from Dansk Ordbog
0 runs0 likes0 downloads0 reach0 impact
51 instances - 2 features - classes - 2 missing values
Abstract: CART book's waveform domains Source: Original Owners: Breiman,L., Friedman,J.H., Olshen,R.A., & Stone,C.J. (1984). Classification and Regression Trees. Wadsworth International Group:…
0 runs2 likes6 downloads8 reach11 impact
5000 instances - 22 features - classes - 0 missing values
Data reported to the police about the circumstances of personal injury road accidents in Great Britain from 1979, and the maker and model information of vehicles involved in the respective accident
0 runs0 likes2 downloads2 reach0 impact
Data from https://doi.org/10.5281/zenodo.269636
0 runs0 likes5 downloads5 reach14 impact
4758 instances - 39 features - classes - 0 missing values
testing
0 runs0 likes0 downloads0 reach0 impact
3279 instances - 1559 features - classes - 0 missing values
service data
0 runs0 likes0 downloads0 reach0 impact
34 instances - 8 features - classes - 0 missing values
tesl dataset about L
0 runs0 likes0 downloads0 reach0 impact
150000 instances - 8 features - classes - 0 missing values
Multi-label dataset. The image benchmark dataset consists of 2000 natural scene images. Zhou and Zhang (2007) extracted 135 features for each image and made it publicly available as processed image…
0 runs0 likes3 downloads3 reach9 impact
2000 instances - 140 features - classes - 0 missing values
test
0 runs0 likes1 downloads1 reach6 impact
1000 instances - 21 features - classes - 0 missing values
test
0 runs0 likes1 downloads1 reach6 impact
1000 instances - 21 features - classes - 0 missing values
test
0 runs0 likes1 downloads1 reach6 impact
1000 instances - 21 features - classes - 0 missing values
test
0 runs0 likes1 downloads1 reach3 impact
1000 instances - 21 features - classes - 0 missing values
#test data for mlp
0 runs0 likes0 downloads0 reach0 impact
200 instances - 12 features - classes - 0 missing values
PM 2.5 datasetd
0 runs0 likes0 downloads0 reach0 impact
43800 instances - 10 features - classes - 0 missing values
The proposed forecasting approach is tested by using the database from UCI machine learning repository. Using a Deep Learning Model Based on 1D Convnets and Bidirectional GRU
0 runs0 likes1 downloads1 reach9 impact
43800 instances - 10 features - classes - 0 missing values
Source: Original Owner: U.S. Census Bureau http://www.census.gov/ United States Department of Commerce Donor: Terran Lane and Ronny Kohavi Data Mining and Visualization Silicon Graphics. terran '@'…
0 runs1 likes8 downloads9 reach15 impact
299285 instances - 42 features - classes - 0 missing values
https://www.kaggle.com/dansbecker/nba-shot-logs
0 runs0 likes0 downloads0 reach0 impact
128069 instances - 21 features - classes - 5567 missing values
test data test
0 runs0 likes1 downloads1 reach2 impact
2 instances - 5 features - classes - 0 missing values
Generated data from c algorithm to break the composition of primes.Into a unique 4 lined 2D object.
0 runs0 likes0 downloads0 reach0 impact
26 instances - 5 features - classes - 0 missing values