OpenML
Filter results by:
test
0 runs0 likes0 downloads0 reach7 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
1000 instances - 21 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach6 impact
1000 instances - 21 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach6 impact
1000 instances - 21 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach6 impact
1000 instances - 21 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
diabetes
0 runs0 likes0 downloads0 reach6 impact
768 instances - 9 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
test
0 runs0 likes0 downloads0 reach6 impact
891 instances - 12 features - classes - 866 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
1000000 instances - 19 features - 4 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes1 downloads1 reach13 impact
8 instances - 1143 features - 0 classes - 0 missing values
Information about customers consists of 86 variables and includes product usage data and socio-demographic data derived from zip area codes. The data was supplied by the Dutch data mining company…
0 runs0 likes3 downloads3 reach13 impact
9822 instances - 86 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach13 impact
14 instances - 1143 features - 0 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes1 downloads1 reach13 impact
365 instances - 4 features - 0 classes - 30 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach13 impact
30 instances - 1143 features - 0 classes - 0 missing values
Primary Biliary Cirrhosis This data set is a follow-up to the original PBC data set, as discussed in appendix D of Fleming and Harrington, Counting Processes and Survival Analysis, Wiley, 1991. An…
0 runs0 likes5 downloads5 reach13 impact
1945 instances - 19 features - 0 classes - 1133 missing values
One of two multivariate regression data sets from paper industry, from an experiment at the paper plant Saugbruksforeningen, Norway. They have been described and analysed in: Aldrin, M. (1996),…
0 runs0 likes0 downloads0 reach13 impact
30 instances - 41 features - 0 classes - 0 missing values
--------------------------------------------------------------------------- Short description --------------------------------------------------------------------------- Data on tree growth used in…
0 runs0 likes2 downloads2 reach11 impact
2796 instances - 35 features - 6 classes - 68100 missing values
%%%%%%%%%%%%%%%%%%% Data-Description % %%%%%%%%%%%%%%%%%%% COIL 1999 Competition Data Data Type multivariate Abstract This data set is from the 1999 Computational Intelligence and Learning (COIL)…
0 runs0 likes0 downloads0 reach13 impact
316 instances - 12 features - 0 classes - 56 missing values
This file contains data from Regression Analysis By Example, 2nd Edition, by Samprit Chatterjee and Bertram Price, John Wiley, 1991. Data sets have names of the form 'rabe.xxx' where xxx is the page…
0 runs0 likes0 downloads0 reach13 impact
70 instances - 4 features - 0 classes - 0 missing values
Following are data on the shooting of Vinnie Johnson of the Detroit Pistons during the 1985-1986 through 1988-1989 seasons. Source was the New York Times. The data are analyzed in the Carnegie Mellon…
0 runs0 likes0 downloads0 reach13 impact
380 instances - 3 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach13 impact
34 instances - 1143 features - 0 classes - 0 missing values
This S dump contains 22 data sets from the book Visualizing Data published by Hobart Press (books@hobart.com). The dump was created by data.dump() and can be read back into S by data.restore(). The…
0 runs0 likes1 downloads1 reach15 impact
323 instances - 5 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
0 runs0 likes0 downloads0 reach11 impact
2796 instances - 35 features - 2 classes - 68100 missing values
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 1. Title: Assessing the Reliability of a Human Estimator…
0 runs0 likes0 downloads0 reach13 impact
75 instances - 15 features - 0 classes - 0 missing values
This data set addresses a control problem, namely flying a F16 aircraft. The attributes describe the status of the aeroplane, while the goal is to predict the control action on the ailerons of the…
0 runs0 likes6 downloads6 reach13 impact
13750 instances - 41 features - 0 classes - 0 missing values
# Data Description This is the historical price data of the FOREX EUR/SEK from Dukascopy. One instance (row) is one candlestick of one minute. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes0 downloads0 reach8 impact
375840 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX AUD/SGD from Dukascopy. One instance (row) is one candlestick of one hour. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes1 downloads1 reach8 impact
43825 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX AUD/CAD from Dukascopy. One instance (row) is one candlestick of one hour. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes1 downloads1 reach8 impact
43825 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX GBP/USD from Dukascopy. One instance (row) is one candlestick of one day. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes1 downloads1 reach8 impact
1834 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX EUR/TRY from Dukascopy. One instance (row) is one candlestick of one day. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes1 downloads1 reach8 impact
1832 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX EUR/GBP from Dukascopy. One instance (row) is one candlestick of one day. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes1 downloads1 reach8 impact
1835 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX CAD/JPY from Dukascopy. One instance (row) is one candlestick of one hour. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes2 downloads2 reach8 impact
43825 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX EUR/NOK from Dukascopy. One instance (row) is one candlestick of one hour. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes1 downloads1 reach8 impact
43825 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX AUD/CAD from Dukascopy. One instance (row) is one candlestick of one day. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes1 downloads1 reach8 impact
1834 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX AUD/JPY from Dukascopy. One instance (row) is one candlestick of one minute. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes0 downloads0 reach8 impact
375840 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX EUR/SEK from Dukascopy. One instance (row) is one candlestick of one day. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes1 downloads1 reach8 impact
1837 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX USD/JPY from Dukascopy. One instance (row) is one candlestick of one minute. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes0 downloads0 reach8 impact
375840 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX EUR/HKD from Dukascopy. One instance (row) is one candlestick of one day. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes1 downloads1 reach8 impact
1832 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX EUR/PLN from Dukascopy. One instance (row) is one candlestick of one hour. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes0 downloads0 reach8 impact
43825 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX EUR/CAD from Dukascopy. One instance (row) is one candlestick of one minute. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes0 downloads0 reach8 impact
375840 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX EUR/HUF from Dukascopy. One instance (row) is one candlestick of one hour. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes1 downloads1 reach8 impact
43825 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX AUD/USD from Dukascopy. One instance (row) is one candlestick of one day. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes1 downloads1 reach8 impact
1834 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX CHF/SGD from Dukascopy. One instance (row) is one candlestick of one minute. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes0 downloads0 reach8 impact
375840 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX EUR/HKD from Dukascopy. One instance (row) is one candlestick of one hour. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes1 downloads1 reach8 impact
43825 instances - 12 features - 2 classes - 0 missing values
# Data Description This is the historical price data of the FOREX AUD/CHF from Dukascopy. One instance (row) is one candlestick of one day. The whole dataset has the data range from 1-1-2018 to…
0 runs0 likes1 downloads1 reach8 impact
1833 instances - 12 features - 2 classes - 0 missing values
The weather problem is a tiny dataset that we will use repeatedly to illustrate machine learning methods. Entirely fictitious, it supposedly concerns the conditions that are suitable for playing some…
0 runs0 likes0 downloads0 reach8 impact
14 instances - 5 features - 2 classes - 0 missing values
The weather problem is a tiny dataset that we will use repeatedly to illustrate machine learning methods. Entirely fictitious, it supposedly concerns the conditions that are suitable for playing some…
0 runs0 likes0 downloads0 reach8 impact
14 instances - 5 features - 2 classes - 0 missing values
The weather problem is a tiny dataset that we will use repeatedly to illustrate machine learning methods. Entirely fictitious, it supposedly concerns the conditions that are suitable for playing some…
0 runs0 likes0 downloads0 reach8 impact
14 instances - 5 features - 2 classes - 0 missing values
analysis of stocks
0 runs0 likes0 downloads0 reach8 impact
245 instances - 15 features - classes - 0 missing values
This dataset is an artificial simulation of the Duffing system with random changes from the chaotic to the non-chaotic regime at different noise levels.
0 runs0 likes0 downloads0 reach8 impact
2493200 instances - 26 features - classes - 0 missing values
This dataset is an artificial simulation of the Duffing system with one phase transition to the chaotic regime.
0 runs0 likes0 downloads0 reach8 impact
9983 instances - 4 features - classes - 0 missing values
punch sound
0 runs0 likes1 downloads1 reach8 impact
221 instances - 1 features - classes - 0 missing values
Hourly particulate matter air polution data of Great Britain for the year 2017, provided by Ricardo Energy and Environment on behalf of the UK Department for Environment, Food and Rural Affairs…
0 runs0 likes0 downloads0 reach8 impact
394299 instances - 10 features - 0 classes - 0 missing values
Trip Record Data provided by the New York City Taxi and Limousine Commission (TLC) [http://www.nyc.gov/html/tlc/html/about/trip_record_data.shtml]. The dataset includes TLC trips of the green line in…
0 runs0 likes0 downloads0 reach9 impact
581835 instances - 15 features - 0 classes - 0 missing values
Context It is important that credit card companies are able to recognize fraudulent credit card transactions so that customers are not charged for items that they did not purchase. Content The…
0 runs0 likes7 downloads7 reach8 impact
284807 instances - 31 features - 0 classes - 0 missing values
Source: C. Okan Sakar a, Gorkem Serbes b, Aysegul Gunduz c, Hunkar C. Tunc a, Hatice Nizam d, Betul Erdogdu Sakar e, Melih Tutuncu c, Tarkan Aydin a, M. Erdem Isenkul d, Hulya Apaydin c a Department…
0 runs0 likes0 downloads0 reach12 impact
756 instances - 754 features - 0 classes - 0 missing values
1. Title: Echocardiogram Data 2. Source Information: -- Donor: Steven Salzberg (salzberg@cs.jhu.edu) -- Collector: -- Dr. Evlin Kinney -- The Reed Institute -- P.O. Box 402603 -- Maimi, FL 33140-0603…
0 runs0 likes0 downloads0 reach8 impact
132 instances - 8 features - 4 classes - 103 missing values
Context "Predict behavior to retain customers. You can analyze all relevant customer data and develop focused customer retention programs." [IBM Sample Data Sets] Content Each row represents a…
0 runs1 likes2 downloads3 reach8 impact
7043 instances - 20 features - 2 classes - 0 missing values
The Inpatient Utilization and Payment Public Use File (Inpatient PUF) provides information on inpatient discharges for Medicare fee-for-service beneficiaries. The Inpatient PUF includes information on…
0 runs0 likes2 downloads2 reach8 impact
163065 instances - 12 features - 0 classes - 0 missing values
This dataset contains traffic violation information from all electronic traffic violations issued in the County. Any information that can be used to uniquely identify the vehicle, the vehicle owner or…
0 runs0 likes1 downloads1 reach8 impact
1578154 instances - 43 features - 4 classes - 8006541 missing values
Chocolate Bar Ratings. Expert ratings of over 1,700 chocolate bars. Each chocolate is evaluated from a combination of both objective qualities and subjective interpretation. A rating here only…
0 runs0 likes1 downloads1 reach8 impact
1795 instances - 9 features - 42 classes - 1 missing values
Regroups information for about 7800 different US colleges. Including geographical information, stats about the population attending and post graduation career earnings.
0 runs0 likes0 downloads0 reach8 impact
This dataset reflects incidents of crime in the City of Los Angeles dating back to 2010. This data is transcribed from original crime reports that are typed on paper and therefore there may be some…
0 runs0 likes0 downloads0 reach8 impact
Public procurement data for the European Economic Area, Switzerland, and the Macedonia. 2015
0 runs0 likes0 downloads0 reach8 impact
10% stratified subsample of the original SVHN data
0 runs0 likes0 downloads0 reach9 impact
9927 instances - 3073 features - 10 classes - 0 missing values
Public procurement data for the European Economic Area, Switzerland, and the Macedonia. 2015
0 runs0 likes1 downloads1 reach8 impact
565163 instances - 75 features - 0 classes - 15247061 missing values
Anonymized data of dating profiles from OkCupid
0 runs0 likes3 downloads3 reach8 impact
59946 instances - 31 features - 0 classes - 273249 missing values
Chocolate Bar Ratings. Expert ratings of over 1,700 chocolate bars. Each chocolate is evaluated from a combination of both objective qualities and subjective interpretation. A rating here only…
0 runs0 likes1 downloads1 reach8 impact
1794 instances - 9 features - 41 classes - 0 missing values
50% stratified subsample of the original SVHN data
0 runs0 likes0 downloads0 reach9 impact
49644 instances - 3073 features - 10 classes - 0 missing values
nfl_games
0 runs0 likes0 downloads0 reach8 impact
16274 instances - 12 features - classes - 0 missing values
nominal features and target for COMPAS
0 runs0 likes1 downloads1 reach9 impact
5278 instances - 14 features - 2 classes - 0 missing values
Original data from https://github.com/propublica/compas-analysis/ by ProPublica. The data was subsequently preprocessed and reduced to relevant features for classification. The target variable is…
0 runs0 likes1 downloads1 reach10 impact
5278 instances - 14 features - 2 classes - 0 missing values
The dataset contains all the statistics for each player from 2008 to 2016.
0 runs0 likes1 downloads1 reach8 impact
183978 instances - 42 features - classes - 47301 missing values
The dataset contains the premier league matches for the season 2014-2015.
0 runs0 likes1 downloads1 reach8 impact
380 instances - 38 features - classes - 9 missing values
The dataset contains the serie a matches for season 2015-2016
0 runs0 likes0 downloads0 reach8 impact
379 instances - 38 features - classes - 44 missing values
This dataset contains all Premier League matches, with player statistic take from Sofifa, from 2008 to 2016
0 runs0 likes0 downloads0 reach8 impact
2961 instances - 17 features - classes - 0 missing values
This dataset contains, for each Premier League matches 2014-2015, the probabilities generated with the L2F models, as well as matches odds.
0 runs0 likes0 downloads0 reach8 impact
323 instances - 11 features - classes - 0 missing values
This dataset contains all the player names and player ids, taken from Sofifa
0 runs0 likes0 downloads0 reach8 impact
11009 instances - 3 features - classes - 0 missing values
dataset for feature extraction
0 runs0 likes0 downloads0 reach8 impact
69 instances - 37 features - classes - 0 missing values
Regroups information for about 7800 different US colleges. Including geographical information, stats about the population attending and post graduation career earnings.
0 runs0 likes1 downloads1 reach9 impact
7063 instances - 50 features - 0 classes - 125494 missing values
This dataset reflects incidents of crime in the City of Los Angeles dating back to 2010. This data is transcribed from original crime reports that are typed on paper and therefore there may be some…
0 runs0 likes0 downloads0 reach8 impact
1468825 instances - 26 features - 0 classes - 7881776 missing values
This dataset contains a simulation of the Lorenz attractor with the parameter $\rho$ varying in time. The stable and chaotic regimes alternate.
0 runs0 likes0 downloads0 reach8 impact
4942 instances - 4 features - classes - 0 missing values