Data
Filter results by:
No data.
143 runs0 likes4 downloads4 reach2 impact
1000000 instances - 39 features - 6 classes - 0 missing values
Automated file upload of BNG(credit-g)
99 runs0 likes3 downloads3 reach2 impact
1000000 instances - 21 features - 2 classes - 0 missing values
Automated file upload of BNG(spambase)
98 runs0 likes3 downloads3 reach2 impact
1000000 instances - 58 features - 2 classes - 0 missing values
Automated file upload of BNG(optdigits)
100 runs1 likes1 downloads2 reach2 impact
1000000 instances - 65 features - 10 classes - 0 missing values
Automated file upload of BNG(ionosphere)
99 runs1 likes4 downloads5 reach3 impact
1000000 instances - 35 features - 2 classes - 0 missing values
Automated file upload of BNG(segment)
99 runs0 likes1 downloads1 reach2 impact
1000000 instances - 20 features - 7 classes - 0 missing values
Automated file upload of BNG(anneal)
100 runs0 likes3 downloads3 reach3 impact
1000000 instances - 39 features - 6 classes - 0 missing values
* Abstract: 9-class version of poker-hand dataset, it was removed the minority class.
1 runs0 likes2 downloads2 reach6 impact
1025000 instances - 11 features - 9 classes - 0 missing values
* Abstract: Purpose is to predict poker hands * Source - Creators: Robert Cattral (cattral '@' gmail.com) Franz Oppacher (oppacher '@' scs.carleton.ca) Carleton University, Department of Computer…
1 runs0 likes4 downloads4 reach6 impact
1025009 instances - 11 features - 10 classes - 0 missing values
libSVM","AAD group #Dataset from the LIBSVM data repository.
1 runs0 likes2 downloads2 reach5 impact
1025010 instances - 11 features - 0 classes - 0 missing values
This is the poker dataset, retrieved 2013-11-14 from the libSVM site. Additional to the preprocessing done there (see LibSVM site for details), this dataset was created as follows: -join test and…
23 runs0 likes17 downloads17 reach7 impact
1025010 instances - 11 features - 2 classes - 0 missing values
No data.
253 runs0 likes7 downloads7 reach1 impact
1076790 instances - 30 features - 2 classes - 7275 missing values
Dataset created to study concept drift in stream mining. It is constructed by combining the Covertype, Poker-Hand, and Electricity datasets. More details can be found in: Albert Bifet, Geoff Holmes,…
332 runs0 likes26 downloads26 reach3 impact
1455525 instances - 73 features - 10 classes - 0 missing values
This dataset reflects incidents of crime in the City of Los Angeles dating back to 2010. This data is transcribed from original crime reports that are typed on paper and therefore there may be some…
0 runs0 likes0 downloads0 reach0 impact
1468825 instances - 26 features - 0 classes - 7881776 missing values
Data on predicting clicks on ads in a search engine.
0 runs0 likes7 downloads7 reach5 impact
1496391 instances - 12 features - 2 classes - 0 missing values
This dataset contains traffic violation information from all electronic traffic violations issued in the County. Any information that can be used to uniquely identify the vehicle, the vehicle owner or…
0 runs0 likes1 downloads1 reach0 impact
1578154 instances - 43 features - 4 classes - 8006541 missing values
This dataset consists of beer reviews from Beeradvocate. The data span a period of more than 10 years, including all ~1.5 million reviews up to November 2011. Each review includes ratings in terms of…
0 runs0 likes0 downloads0 reach0 impact
1586614 instances - 13 features - 104 classes - 68148 missing values
This dataset consists of beer reviews from Beeradvocate. The data span a period of more than 10 years, including all ~1.5 million reviews up to November 2011. Each review includes ratings in terms of…
0 runs0 likes0 downloads0 reach0 impact
1586614 instances - 13 features - 104 classes - 68148 missing values
This dataset consists of beer reviews from Beeradvocate. The data span a period of more than 10 years, including all ~1.5 million reviews up to November 2011. Each review includes ratings in terms of…
0 runs0 likes0 downloads0 reach0 impact
1586614 instances - 13 features - 104 classes - 68148 missing values
This dataset consists of beer reviews from Beeradvocate. The data span a period of more than 10 years, including all ~1.5 million reviews up to November 2011. Each review includes ratings in terms of…
0 runs0 likes0 downloads0 reach0 impact
1586614 instances - 13 features - 104 classes - 68148 missing values
UserID
0 runs0 likes0 downloads0 reach0 impact
1974675 instances - 10 features - classes - 1974675 missing values
web services evaluations in this table
0 runs0 likes0 downloads0 reach1 impact
1974675 instances - 10 features - classes - 1974675 missing values
Balanced version of click prediction data
36 runs0 likes13 downloads13 reach5 impact
1997410 instances - 12 features - 2 classes - 0 missing values
DBpedia with top-474 most frequent YAGO types HMC dataset for type prediction. Ingoing and outgoing properties as features
0 runs0 likes3 downloads3 reach3 impact
2886305 instances - 2401 features - classes - 0 missing values
General Description 2015-current: greater than $200.00. The Commission categorizes contributions from individuals using the calendar year-to-date amount for political action committee (PAC) and party…
0 runs0 likes1 downloads1 reach0 impact
3348209 instances - 21 features - 0 classes - 10786577 missing values
Datasets from ACM KDD Cup (http://www.sigkdd.org/kddcup/index.php) Data set for KDD Cup 1999 Modified by TunedIT (converted to ARFF format)…
4 runs1 likes19 downloads20 reach6 impact
4898431 instances - 42 features - 23 classes - 0 missing values
Zurich public transport delay data 2016-10-30 03:30:00 CET - 2016-11-27 01:20:00 CET cleaned and prepared at Open Data Day 2017.
0 runs0 likes2 downloads2 reach4 impact
5465575 instances - 15 features - 0 classes - 132617 missing values
## Guess which points belong to signal track [COMET](http://comet.kek.jp/Introduction.html) is an experiment being constructed at the J-PARC proton beam laboratory in Japan. It will search for…
0 runs0 likes0 downloads0 reach4 impact
7619400 instances - 6 features - 0 classes - 0 missing values
## Guess which points belong to signal track [COMET](http://comet.kek.jp/Introduction.html) is an experiment being constructed at the J-PARC proton beam laboratory in Japan. It will search for…
0 runs0 likes1 downloads1 reach3 impact
7619400 instances - 6 features - 0 classes - 0 missing values
## Guess which points belong to signal track [COMET](http://comet.kek.jp/Introduction.html) is an experiment being constructed at the J-PARC proton beam laboratory in Japan. It will search for…
0 runs0 likes1 downloads1 reach3 impact
7619400 instances - 6 features - 0 classes - 0 missing values
Wikidata with top-474 most frequent types and ingoing/outgoing properties as features
0 runs0 likes14 downloads14 reach3 impact
19254100 instances - 2331 features - classes - 0 missing values
Dataset from Smoothing Methods in Statistics (ftp stat.cmu.edu/datasets) Simonoff, J.S. (1996). Smoothing Methods in Statistics. New York: Springer-Verlag.
0 runs0 likes0 downloads0 reach5 impact
El Nino Data Data Type spatio-temporal Abstract The data set contains oceanographic and surface meteorological readings taken from a series of buoys positioned throughout the equatorial Pacific. The…
0 runs1 likes3 downloads4 reach5 impact
1. Title: Faults in a urban waste water treatment plant 2. Source Information: -- Creators: Manel Poch (igte2@cc.uab.es) Unitat d'Enginyeria Quimica Universitat Autonoma de Barcelona. Bellaterra.…
0 runs0 likes1 downloads1 reach5 impact
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes0 downloads0 reach5 impact
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes1 downloads1 reach5 impact
We consider the following problem: You are running a cloud computing service, where customers contract to run computing services (tasks). Each task has a duration, an earliest start and latest end,…
0 runs0 likes7 downloads7 reach5 impact
Datasets of Data And Story Library, project illustrating use of basic statistic methods, converted to arff format by Hakan Kjellerstrand. Source: TunedIT: http://tunedit.org/repo/DASL DASL file…
0 runs0 likes0 downloads0 reach6 impact
One of two multivariate regression data sets from paper industry, from an experiment at the paper plant Norske Skog, Skogn, Norway. They have been described and analysed in: Aldrin, M. (1996),…
0 runs0 likes2 downloads2 reach5 impact
This analysis describes and summarizes the relationships between 1987 salaries of major league baseball players and the player's performance. The salary data were taken from Sports Illustrated, April…
0 runs0 likes0 downloads0 reach5 impact
The USNEWS dataset for the ASA Statistical Graphics Section's 1995 Data Analysis Exposition contains information on over 1300 American colleges and universities. The data may be obtained in either of…
0 runs0 likes0 downloads0 reach5 impact
This analysis describes and summarizes the relationships between 1987 salaries of major league baseball players and the player's performance. The salary data were taken from Sports Illustrated, April…
0 runs0 likes2 downloads2 reach5 impact
USDA, NRCS. 2008. The PLANTS Database ([Web Link], 31 December 2008). National Plant Data Center, Baton Rouge, LA 70874-4490 USA. Abstract: Data has been extracted from the USDA plants database. It…
0 runs0 likes4 downloads4 reach3 impact
Abstract: This data contains general demographic information on internet users in 1997. Source: Original Owner: Graphics, Visualization, & Usability Center College of Computing Geogia Institute of…
0 runs0 likes2 downloads2 reach3 impact
Gestures from Rest Positions. In: Symposium on Applied Computing (SAC), 2013, Coimbra. Proceedings of the 28th Annual ACM Symposium on Applied Computing (SAC), 2013. p. 46-52. Data Set Information:…
74 runs0 likes7 downloads7 reach6 impact
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Horsepower treated as the class attribute. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using…
0 runs0 likes1 downloads1 reach1 impact
Public procurement data for the European Economic Area, Switzerland, and the Macedonia. 2015
0 runs0 likes0 downloads0 reach0 impact
Regroups information for about 7800 different US colleges. Including geographical information, stats about the population attending and post graduation career earnings.
0 runs0 likes0 downloads0 reach0 impact
Public procurement data for the European Economic Area, Switzerland, and the Macedonia. 2015
0 runs0 likes0 downloads0 reach0 impact
Employee remuneration and expenses (earning over 75,000CAD per year). This data set includes remuneration and expenses from employees earning over 75,000CAD per year. Attributes: NAME: Name of…
0 runs0 likes0 downloads0 reach0 impact
The BoT-IoT dataset was created by designing a realistic network environment in the Cyber Range Lab of The center of UNSW Canberra Cyber. The environment incorporates a combination of normal and…
0 runs0 likes0 downloads0 reach1 impact
Daily air quality measurements in New York, May to September 1973. This data is taken from R.
0 runs0 likes0 downloads0 reach0 impact
Daily air quality measurements in New York, May to September 1973. This data is taken from R.
0 runs0 likes0 downloads0 reach0 impact
Daily air quality measurements in New York, May to September 1973. This data is taken from R.
0 runs0 likes0 downloads0 reach0 impact
Daily air quality measurements in New York, May to September 1973. This data is taken from R.
0 runs0 likes0 downloads0 reach0 impact
Regroups information for about 7800 different US colleges. Including geographical information, stats about the population attending and post graduation career earnings.
0 runs0 likes0 downloads0 reach0 impact
This dataset reflects incidents of crime in the City of Los Angeles dating back to 2010. This data is transcribed from original crime reports that are typed on paper and therefore there may be some…
0 runs0 likes0 downloads0 reach0 impact
Public procurement data for the European Economic Area, Switzerland, and the Macedonia. 2015
0 runs0 likes0 downloads0 reach0 impact
Los Angeles ozone pollution data, 1976
0 runs0 likes0 downloads0 reach1 impact
The database covers all the international short track games in the last 5 years. Currently it contains only men's 500m. Detailed lap data including personal time and ranking in each game from seasons…
0 runs0 likes0 downloads0 reach1 impact
This dataset is just like the CIFAR-10, except it has 100 classes containing 600 images each. There are 500 training images and 100 testing images per class. The 100 classes in the CIFAR-100 are…
0 runs0 likes0 downloads0 reach1 impact