OpenML
Filter results by:
Data reported to the police about the circumstances of personal injury road accidents in Great Britain from 1979, and the maker and model information of vehicles involved in the respective accident
0 runs0 likes0 downloads0 reach0 impact
363206 instances - 66 features - 0 classes - 876555 missing values
This is a smaller version of the original dataset, containing 1M rows. ### Attribute Information * The first column is the class label (1 for signal, 0 for background) * 21 low-level features…
0 runs0 likes0 downloads0 reach1 impact
1000000 instances - 29 features - 2 classes - 0 missing values
This is the full version of the KDD Cup 2009 dataset Customer Relationship Management (CRM) is a key element of modern marketing strategies. The KDD Cup 2009 offers the opportunity to work on large…
0 runs0 likes0 downloads0 reach1 impact
50000 instances - 15001 features - 2 classes - 14616450 missing values
This is the full version of the KDD Cup 2009 dataset Customer Relationship Management (CRM) is a key element of modern marketing strategies. The KDD Cup 2009 offers the opportunity to work on large…
0 runs0 likes0 downloads0 reach1 impact
50000 instances - 15001 features - 2 classes - 25108569 missing values
This is the full version of the KDD Cup 2009 dataset Customer Relationship Management (CRM) is a key element of modern marketing strategies. The KDD Cup 2009 offers the opportunity to work on large…
0 runs0 likes0 downloads0 reach1 impact
50000 instances - 15001 features - 2 classes - 25108569 missing values
tesl dataset about L
0 runs0 likes0 downloads0 reach1 impact
150000 instances - 8 features - classes - 0 missing values
Make target (age) numeric**Author**: 1. Title of Database: Abalone data 2. Sources: (a) Original owners of database: Marine Resources Division Marine Research Laboratories - Taroona Department of…
0 runs0 likes0 downloads0 reach1 impact
4177 instances - 9 features - 0 classes - 0 missing values
String datetime information extracted to numeric columns.Trip Record Data provided by the New York City Taxi and Limousine Commission (TLC)…
0 runs0 likes0 downloads0 reach1 impact
581835 instances - 19 features - 0 classes - 0 missing values
Training dataset of the 'Porto Seguros Safe Driver Prediction' Kaggle challenge [https://www.kaggle.com/c/porto-seguro-safe-driver-prediction]. The goal was to predict whether a driver will file an…
0 runs0 likes0 downloads0 reach1 impact
595212 instances - 58 features - 2 classes - 846458 missing values
INTRUSION DETECTOR LEARNING Software to detect network intrusions protects a computer network from unauthorized users, including perhaps insiders. The intrusion detector learning task is to build a…
0 runs1 likes0 downloads1 reach1 impact
4898431 instances - 42 features - 23 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach1 impact
1000000 instances - 19 features - 4 classes - 0 missing values
No data.
4 runs0 likes0 downloads0 reach1 impact
45918 instances - 22 features - 0 classes - 0 missing values
This is the same data as version 5 (OpenML ID = 1220) with '_id' features coded as nominal factor variables.
0 runs0 likes0 downloads0 reach1 impact
39948 instances - 12 features - 2 classes - 0 missing values
User profile data for San Francisco OkCupid users published in [Kim, A. Y., & Escobedo-Land, A. (2015). OKCupid data for introductory statistics and data science courses. Journal of Statistics…
0 runs0 likes0 downloads0 reach1 impact
50789 instances - 20 features - 3 classes - 154107 missing values
# Achieved Frames per Second (FPS) in video games This dataset contains FPS measurement of video games executed on computers. Each row of the dataset describes the outcome of FPS measurement (outcome…
0 runs0 likes0 downloads0 reach1 impact
425833 instances - 45 features - 0 classes - 1299988 missing values
Date converted to year/mo/day numerics.This dataset contains house sale prices for King County, which includes Seattle. It includes homes sold between May 2014 and May 2015. It contains 19 house…
0 runs0 likes3 downloads3 reach1 impact
21613 instances - 22 features - 0 classes - 0 missing values
Phishing website 1
0 runs0 likes0 downloads0 reach2 impact
11055 instances - 31 features - 0 classes - 0 missing values
Email dataset 2
0 runs0 likes0 downloads0 reach2 impact
11507 instances - 4 features - 0 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach2 impact
26 instances - 5 features - classes - 0 missing values
testing
0 runs0 likes1 downloads1 reach2 impact
3279 instances - 1559 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach2 impact
150 instances - 5 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach2 impact
336 instances - 8 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach2 impact
2178 instances - 4 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach2 impact
8124 instances - 23 features - classes - 2480 missing values
DESCRIPTIVE ABSTRACT: The data set contains the oral, written and combined test scores for 2003 New Haven Fire Department promotion exams. The Race and Position for each test taker are also given.…
0 runs0 likes0 downloads0 reach2 impact
118 instances - 6 features - 2 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach2 impact
101 instances - 18 features - classes - 0 missing values
URL dataset
0 runs0 likes0 downloads0 reach2 impact
121001 instances - 501 features - 0 classes - 0 missing values
URL dataset 2
0 runs0 likes0 downloads0 reach2 impact
95911 instances - 13 features - 0 classes - 0 missing values
URL dataset 3
0 runs0 likes0 downloads0 reach2 impact
18982 instances - 80 features - 5 classes - 0 missing values
This is a preprocessed version of the anneal dataset (version 1). All missing values are treated as a nominal value with label '?'. (Quotes for clarity). Because this is not good…
0 runs0 likes0 downloads0 reach2 impact
898 instances - 39 features - 5 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach2 impact
37 instances - 19 features - classes - 0 missing values
This is weather data in arff format
0 runs0 likes0 downloads0 reach2 impact
14 instances - 5 features - classes - 0 missing values
sample
0 runs0 likes0 downloads0 reach2 impact
14 instances - 5 features - classes - 0 missing values
test data test
0 runs0 likes1 downloads1 reach2 impact
2 instances - 5 features - classes - 0 missing values
this is test data
0 runs0 likes0 downloads0 reach2 impact
5 instances - 5 features - classes - 0 missing values
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach2 impact
150 instances - 5 features - classes - 0 missing values
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach2 impact
150 instances - 5 features - 3 classes - 0 missing values
Salary Emp
0 runs0 likes0 downloads0 reach2 impact
31 instances - 2 features - classes - 0 missing values
A subset of the 3D dataset from Princeton\'s COS 429 Computer Vision course. The dataset consists of 40 models organised into 4 classes of 10 objects each.
0 runs0 likes0 downloads0 reach2 impact
16000 instances - 4 features - classes - 0 missing values
Coal mining requires working in hazardous conditions. Miners in an underground coal mine can face several threats, such as, e.g. methane explosions or rock-burst. To provide protection for people…
0 runs0 likes2 downloads2 reach2 impact
9199930 instances - 34 features - classes - 0 missing values
Airlines Departure Delay Prediction (Regression). Original data can be found at: http://www.transtats.bts.gov This is a processed version of the original data, designed to predict departure delay (in…
0 runs2 likes1 downloads3 reach2 impact
10000000 instances - 10 features - 0 classes - 0 missing values
Airlines Departure Delay Prediction (Regression). Original data can be found at: http://www.transtats.bts.gov This is a processed version of the original data, designed to predict departure delay (in…
0 runs0 likes2 downloads2 reach2 impact
1000000 instances - 10 features - 0 classes - 0 missing values
AutoML challenge 2014. Original task: regression. Test and validation sets can be obtained on the Cha Learn website: https://automl.chalearn.org/data
0 runs0 likes0 downloads0 reach2 impact
99 instances - 200001 features - 0 classes - 0 missing values
data from yahoo finance
0 runs0 likes1 downloads1 reach2 impact
1259 instances - 7 features - classes - 0 missing values
Testing dataset
0 runs0 likes1 downloads1 reach3 impact
134731 instances - 31 features - 2 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach3 impact
270 instances - 14 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach3 impact
270 instances - 14 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach3 impact
270 instances - 14 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach3 impact
270 instances - 14 features - classes - 0 missing values
test
0 runs0 likes1 downloads1 reach3 impact
1000 instances - 21 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach3 impact
4324 instances - 9 features - classes - 3360 missing values
test
0 runs0 likes0 downloads0 reach3 impact
2580 instances - 7 features - classes - 2541 missing values
test
0 runs0 likes0 downloads0 reach3 impact
270 instances - 14 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach3 impact
8553 instances - 10 features - classes - 18454 missing values
test
0 runs0 likes0 downloads0 reach3 impact
2580 instances - 7 features - classes - 2541 missing values
test
0 runs0 likes0 downloads0 reach3 impact
20058 instances - 16 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach3 impact
16598 instances - 11 features - classes - 329 missing values
Bike sharing systems are new generation of traditional bike rentals where whole process from membership, rental and return back has become automatic. Through these systems, user is able to easily rent…
0 runs0 likes2 downloads2 reach3 impact
17379 instances - 13 features - 0 classes - 0 missing values
Bike sharing systems are new generation of traditional bike rentals where whole process from membership, rental and return back has become automatic. Through these systems, user is able to easily rent…
0 runs0 likes1 downloads1 reach3 impact
17379 instances - 13 features - 0 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach3 impact
60197 instances - 6 features - classes - 42138 missing values
test
0 runs0 likes0 downloads0 reach3 impact
60197 instances - 6 features - classes - 42138 missing values
test
0 runs0 likes0 downloads0 reach3 impact
60197 instances - 6 features - classes - 42138 missing values
test
0 runs0 likes0 downloads0 reach3 impact
60197 instances - 6 features - classes - 128136 missing values
test
0 runs0 likes0 downloads0 reach3 impact
60197 instances - 6 features - classes - 128136 missing values
test
0 runs0 likes0 downloads0 reach3 impact
60197 instances - 6 features - classes - 42138 missing values
Email dataset 1a
0 runs0 likes0 downloads0 reach4 impact
4585 instances - 4 features - 0 classes - 0 missing values
Email dataset 1b
0 runs0 likes0 downloads0 reach4 impact
4585 instances - 24 features - 0 classes - 161 missing values
Email dataset 1c
0 runs0 likes0 downloads0 reach4 impact
4585 instances - 792 features - 0 classes - 0 missing values
Email dataset 1d
0 runs0 likes0 downloads0 reach4 impact
4585 instances - 11 features - 0 classes - 0 missing values
Email dataset 1e
0 runs0 likes0 downloads0 reach4 impact
4585 instances - 580 features - 0 classes - 0 missing values
No data.
7 runs0 likes0 downloads0 reach4 impact
45918 instances - 22 features - 0 classes - 0 missing values
MY Dataset
0 runs0 likes0 downloads0 reach4 impact
120 instances - 7 features - classes - 0 missing values
test3
0 runs0 likes0 downloads0 reach4 impact
2 instances - 8 features - classes - 0 missing values
Rows with NaN and inf values removed. Data converted from CSV to ARFF.
0 runs0 likes0 downloads0 reach4 impact
18982 instances - 80 features - classes - 0 missing values
AutoML challenge 2014. Original task: regression. Test and validation sets can be obtained on the Cha Learn website: https://automl.chalearn.org/data
0 runs0 likes0 downloads0 reach4 impact
400000 instances - 101 features - 0 classes - 0 missing values
% Title: Flora % Source: https://automl.chalearn.org/data % % Dataset from the first ChaLearn AutoML challenge (2014). % Only the training data is included, as there were no labels for validation and…
0 runs0 likes0 downloads0 reach4 impact
15000 instances - 200001 features - 0 classes - 0 missing values
https://archive.ics.uci.edu/ml/datasets/Diabetes
0 runs0 likes2 downloads2 reach4 impact
768 instances - 9 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach5 impact
41188 instances - 21 features - classes - 0 missing values
This dataset contains 10962 houses to rent with 13 diferent features. Some values in the dataset can be considered as outliers for further analyses. Bear in mind that the Web Crawler was used only to…
0 runs0 likes0 downloads0 reach5 impact
10692 instances - 13 features - 0 classes - 0 missing values
newtest3
0 runs0 likes0 downloads0 reach5 impact
2 instances - 6 features - classes - 0 missing values
testing temperature and ph
0 runs0 likes0 downloads0 reach5 impact
26 instances - 8 features - classes - 0 missing values
BitcoinHeist Ransomware Dataset Akcora, C.G., Li, Y., Gel, Y.R. and Kantarcioglu, M., 2019. BitcoinHeist. Topological Data Analysis for Ransomware Detection on the Bitcoin Blockchain. IJCAI-PRICAI…
0 runs1 likes0 downloads1 reach6 impact
2916697 instances - 10 features - 29 classes - 0 missing values
Airlines Dataset Inspired in the regression dataset from Elena Ikonomovska. The task is to predict whether a given flight will be delayed, given the information of the scheduled departure. For this…
0 runs0 likes2 downloads2 reach6 impact
26969 instances - 8 features - 2 classes - 0 missing values
ede wey
0 runs0 likes1 downloads1 reach6 impact
589 instances - 2909 features - classes - 0 missing values
Touch Signals
0 runs0 likes0 downloads0 reach6 impact
265 instances - 11 features - classes - 0 missing values
The midwest survey dataset contain individual responses from surveys about regional identification conducted for FiveThirtyEight by SurveyMonkey.
0 runs0 likes0 downloads0 reach6 impact
2778 instances - 28 features - 10 classes - 1744 missing values
swd dced
0 runs0 likes0 downloads0 reach6 impact
589 instances - 3 features - classes - 0 missing values
sdsw frfr
0 runs0 likes0 downloads0 reach6 impact
1556 instances - 3 features - classes - 0 missing values
efe rgrg
0 runs0 likes0 downloads0 reach6 impact
e fvr
0 runs0 likes0 downloads0 reach6 impact
2 instances - 11 features - classes - 0 missing values
efef ffrf
0 runs0 likes0 downloads0 reach6 impact
9 instances - 3 features - classes - 0 missing values
ssc vdv
0 runs0 likes0 downloads0 reach6 impact
1556 instances - 2 features - classes - 0 missing values
The midwest survey dataset contain individual responses from surveys about regional identification conducted for FiveThirtyEight by SurveyMonkey.
0 runs0 likes0 downloads0 reach6 impact
2778 instances - 28 features - 10 classes - 1744 missing values
frf r
0 runs0 likes0 downloads0 reach6 impact
2 instances - 3 features - classes - 0 missing values
e eded
0 runs0 likes0 downloads0 reach6 impact
2 instances - 4 features - classes - 0 missing values
e3r4vr t4r
0 runs0 likes0 downloads0 reach6 impact
2 instances - 5 features - classes - 0 missing values
f fr
0 runs0 likes0 downloads0 reach6 impact
2 instances - 5 features - classes - 0 missing values
student performance analysis 1
0 runs0 likes1 downloads1 reach6 impact
3892 instances - 36 features - classes - 0 missing values
mydata
0 runs0 likes0 downloads0 reach6 impact
3892 instances - 36 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach6 impact
1000 instances - 21 features - classes - 0 missing values