Data
Filter results by:
Abstract: This dataset contains timeseries of mel-frequency cepstrum coefficients (MFCCs) corresponding to spoken Arabic digits. Includes data from 44 male and 44 female native Arabic speakers.…
0 runs0 likes3 downloads3 reach3 impact
178526 instances - 13 features - classes - 57200 missing values
Two colour spotted cDNA array data set of a series of experiments to identify which genes in Yeast are cell cycle regulated.
0 runs0 likes0 downloads0 reach1 impact
6178 instances - 82 features - classes - 59017 missing values
This file holds global land temperatures by country
0 runs0 likes0 downloads0 reach2 impact
577462 instances - 4 features - classes - 64563 missing values
holds information on average temperature per country
0 runs0 likes0 downloads0 reach2 impact
577462 instances - 4 features - classes - 64563 missing values
Data on tree growth used in the Case Study published in the September, 1995 issue of the Canadian Journal of Statistics. This data set was been provided by Dr. Fernando Camacho, Ontario Hydro…
18456 runs1 likes15 downloads16 reach31 impact
2796 instances - 35 features - 6 classes - 68100 missing values
This dataset consists of beer reviews from Beeradvocate. The data span a period of more than 10 years, including all ~1.5 million reviews up to November 2011. Each review includes ratings in terms of…
0 runs0 likes0 downloads0 reach0 impact
1586614 instances - 13 features - 104 classes - 68148 missing values
This dataset consists of beer reviews from Beeradvocate. The data span a period of more than 10 years, including all ~1.5 million reviews up to November 2011. Each review includes ratings in terms of…
0 runs0 likes0 downloads0 reach0 impact
1586614 instances - 13 features - 104 classes - 68148 missing values
This dataset consists of beer reviews from Beeradvocate. The data span a period of more than 10 years, including all ~1.5 million reviews up to November 2011. Each review includes ratings in terms of…
0 runs0 likes0 downloads0 reach0 impact
1586614 instances - 13 features - 104 classes - 68148 missing values
This dataset consists of beer reviews from Beeradvocate. The data span a period of more than 10 years, including all ~1.5 million reviews up to November 2011. Each review includes ratings in terms of…
0 runs0 likes0 downloads0 reach0 impact
1586614 instances - 13 features - 104 classes - 68148 missing values
Regroups information for about 7800 different US colleges. Including geographical information, stats about the population attending and post graduation career earnings.
0 runs0 likes0 downloads0 reach0 impact
7063 instances - 50 features - 0 classes - 125494 missing values
Zurich public transport delay data 2016-10-30 03:30:00 CET - 2016-11-27 01:20:00 CET cleaned and prepared at Open Data Day 2017.
0 runs0 likes2 downloads2 reach4 impact
5465575 instances - 15 features - 0 classes - 132617 missing values
One of the biggest challenges of an auto dealership purchasing a used car at an auto auction is the risk of that the vehicle might have serious issues that prevent it from being sold to customers. The…
0 runs0 likes2 downloads2 reach4 impact
72983 instances - 33 features - 2 classes - 149271 missing values
Wine data gathered by https://www.kaggle.com/zynicideThe data was scraped from WineEnthusiast during the week of June 15th, 2017. The code for the scraper can be found at…
0 runs0 likes0 downloads0 reach0 impact
150930 instances - 10 features - classes - 174477 missing values
uci
0 runs0 likes0 downloads0 reach0 impact
101766 instances - 52 features - classes - 192849 missing values
Author: Marius Lindauer Date: 27.02.2014 These data set was generated for a publication about claspfolio 2.0, i.e., an algorithm selector for ASP. The algorithm portfolio of clasp (2.1.4)…
0 runs0 likes0 downloads0 reach1 impact
14234 instances - 143 features - 0 classes - 200838 missing values
Anonymized data of dating profiles from OkCupid
0 runs0 likes0 downloads0 reach0 impact
59946 instances - 31 features - 0 classes - 273249 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The river flow datasets concern the prediction of river network flows for 48 h in the future at…
0 runs0 likes0 downloads0 reach1 impact
9125 instances - 584 features - classes - 356160 missing values
This data is derived from the 2012 KDD Cup. The data is subsampled to 1% of the original number of instances, downsampling the majority class (click=0) so that the target feature is reasonably…
0 runs0 likes2 downloads2 reach2 impact
798964 instances - 12 features - 3 classes - 399482 missing values
This is the dataset used for the 2016 IDA Industrial Challenge, courtesy of Scania. For a full description, see http://archive.ics.uci.edu/ml/datasets/IDA2016Challenge . This dataset contains both the…
7 runs0 likes0 downloads0 reach7 impact
76000 instances - 171 features - 2 classes - 1078695 missing values
UserID
0 runs0 likes0 downloads0 reach0 impact
1974675 instances - 10 features - classes - 1974675 missing values
web services evaluations in this table
0 runs0 likes0 downloads0 reach1 impact
1974675 instances - 10 features - classes - 1974675 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
0 runs0 likes0 downloads0 reach6 impact
425240 instances - 79 features - 2 classes - 2734000 missing values
Dataset KDD98 challenge: https://kdd.ics.uci.edu/databases/kddcup98/kddcup98.html The goal is to estimate the return from a direct mailing in order to maximize donation profits. This dataset…
0 runs0 likes5 downloads5 reach4 impact
191260 instances - 479 features - 0 classes - 5587563 missing values
This dataset reflects incidents of crime in the City of Los Angeles dating back to 2010. This data is transcribed from original crime reports that are typed on paper and therefore there may be some…
0 runs0 likes0 downloads0 reach0 impact
1468825 instances - 26 features - 0 classes - 7881776 missing values
This dataset contains traffic violation information from all electronic traffic violations issued in the County. Any information that can be used to uniquely identify the vehicle, the vehicle owner or…
0 runs0 likes1 downloads1 reach0 impact
1578154 instances - 43 features - 4 classes - 8006541 missing values
The KDD Cup 2009 offers the opportunity to work on large marketing databases from the French Telecom company Orange to predict the propensity of customers to switch provider (churn). Churn (wikipedia…
10982 runs0 likes15 downloads15 reach17 impact
50000 instances - 231 features - 2 classes - 8024152 missing values
Datasets from ACM KDD Cup (http://www.sigkdd.org/kddcup/index.php) KDD Cup 2009 http://www.kddcup-orange.com Converted to ARFF format by TunedIT Customer Relationship Management (CRM) is a key element…
11301 runs0 likes12 downloads12 reach17 impact
50000 instances - 231 features - 2 classes - 8024152 missing values
Datasets from ACM KDD Cup (http://www.sigkdd.org/kddcup/index.php) KDD Cup 2009 http://www.kddcup-orange.com Converted to ARFF format by TunedIT Customer Relationship Management (CRM) is a key element…
223 runs0 likes17 downloads17 reach9 impact
50000 instances - 231 features - 2 classes - 8024152 missing values
General Description 2015-current: greater than $200.00. The Commission categorizes contributions from individuals using the calendar year-to-date amount for political action committee (PAC) and party…
0 runs0 likes1 downloads1 reach0 impact
3348209 instances - 21 features - 0 classes - 10786577 missing values
Domain dataset
0 runs0 likes0 downloads0 reach1 impact
1637 instances - 9839 features - 3 classes - 13231887 missing values
Public procurement data for the European Economic Area, Switzerland, and the Macedonia. 2015
0 runs0 likes0 downloads0 reach0 impact
565163 instances - 75 features - 0 classes - 15247061 missing values
Dataset from Smoothing Methods in Statistics (ftp stat.cmu.edu/datasets) Simonoff, J.S. (1996). Smoothing Methods in Statistics. New York: Springer-Verlag.
0 runs0 likes0 downloads0 reach5 impact
El Nino Data Data Type spatio-temporal Abstract The data set contains oceanographic and surface meteorological readings taken from a series of buoys positioned throughout the equatorial Pacific. The…
0 runs1 likes3 downloads4 reach5 impact
1. Title: Faults in a urban waste water treatment plant 2. Source Information: -- Creators: Manel Poch (igte2@cc.uab.es) Unitat d'Enginyeria Quimica Universitat Autonoma de Barcelona. Bellaterra.…
0 runs0 likes1 downloads1 reach5 impact
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes0 downloads0 reach5 impact
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes1 downloads1 reach5 impact
We consider the following problem: You are running a cloud computing service, where customers contract to run computing services (tasks). Each task has a duration, an earliest start and latest end,…
0 runs0 likes7 downloads7 reach5 impact
Datasets of Data And Story Library, project illustrating use of basic statistic methods, converted to arff format by Hakan Kjellerstrand. Source: TunedIT: http://tunedit.org/repo/DASL DASL file…
0 runs0 likes0 downloads0 reach6 impact
One of two multivariate regression data sets from paper industry, from an experiment at the paper plant Norske Skog, Skogn, Norway. They have been described and analysed in: Aldrin, M. (1996),…
0 runs0 likes2 downloads2 reach5 impact
This analysis describes and summarizes the relationships between 1987 salaries of major league baseball players and the player's performance. The salary data were taken from Sports Illustrated, April…
0 runs0 likes0 downloads0 reach5 impact
The USNEWS dataset for the ASA Statistical Graphics Section's 1995 Data Analysis Exposition contains information on over 1300 American colleges and universities. The data may be obtained in either of…
0 runs0 likes0 downloads0 reach5 impact
This analysis describes and summarizes the relationships between 1987 salaries of major league baseball players and the player's performance. The salary data were taken from Sports Illustrated, April…
0 runs0 likes2 downloads2 reach5 impact
USDA, NRCS. 2008. The PLANTS Database ([Web Link], 31 December 2008). National Plant Data Center, Baton Rouge, LA 70874-4490 USA. Abstract: Data has been extracted from the USDA plants database. It…
0 runs0 likes4 downloads4 reach3 impact
Abstract: This data contains general demographic information on internet users in 1997. Source: Original Owner: Graphics, Visualization, & Usability Center College of Computing Geogia Institute of…
0 runs0 likes2 downloads2 reach3 impact
Gestures from Rest Positions. In: Symposium on Applied Computing (SAC), 2013, Coimbra. Proceedings of the 28th Annual ACM Symposium on Applied Computing (SAC), 2013. p. 46-52. Data Set Information:…
74 runs0 likes7 downloads7 reach6 impact
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Horsepower treated as the class attribute. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using…
0 runs0 likes1 downloads1 reach1 impact
Public procurement data for the European Economic Area, Switzerland, and the Macedonia. 2015
0 runs0 likes0 downloads0 reach0 impact
Regroups information for about 7800 different US colleges. Including geographical information, stats about the population attending and post graduation career earnings.
0 runs0 likes0 downloads0 reach0 impact
Public procurement data for the European Economic Area, Switzerland, and the Macedonia. 2015
0 runs0 likes0 downloads0 reach0 impact
Employee remuneration and expenses (earning over 75,000CAD per year). This data set includes remuneration and expenses from employees earning over 75,000CAD per year. Attributes: NAME: Name of…
0 runs0 likes0 downloads0 reach0 impact
The BoT-IoT dataset was created by designing a realistic network environment in the Cyber Range Lab of The center of UNSW Canberra Cyber. The environment incorporates a combination of normal and…
0 runs0 likes0 downloads0 reach1 impact
Daily air quality measurements in New York, May to September 1973. This data is taken from R.
0 runs0 likes0 downloads0 reach0 impact
Daily air quality measurements in New York, May to September 1973. This data is taken from R.
0 runs0 likes0 downloads0 reach0 impact
Daily air quality measurements in New York, May to September 1973. This data is taken from R.
0 runs0 likes0 downloads0 reach0 impact
Daily air quality measurements in New York, May to September 1973. This data is taken from R.
0 runs0 likes0 downloads0 reach0 impact
Regroups information for about 7800 different US colleges. Including geographical information, stats about the population attending and post graduation career earnings.
0 runs0 likes0 downloads0 reach0 impact
This dataset reflects incidents of crime in the City of Los Angeles dating back to 2010. This data is transcribed from original crime reports that are typed on paper and therefore there may be some…
0 runs0 likes0 downloads0 reach0 impact
Public procurement data for the European Economic Area, Switzerland, and the Macedonia. 2015
0 runs0 likes0 downloads0 reach0 impact
Los Angeles ozone pollution data, 1976
0 runs0 likes0 downloads0 reach1 impact
The database covers all the international short track games in the last 5 years. Currently it contains only men's 500m. Detailed lap data including personal time and ranking in each game from seasons…
0 runs0 likes0 downloads0 reach1 impact
This dataset is just like the CIFAR-10, except it has 100 classes containing 600 images each. There are 500 training images and 100 testing images per class. The 100 classes in the CIFAR-100 are…
0 runs0 likes0 downloads0 reach1 impact