Study
classify
0 datasets, 0 tasks, 0 flows, 0 runs
stanford stuff
0 datasets, 0 tasks, 0 flows, 0 runs
No data.
0 datasets, 0 tasks, 0 flows, 0 runs
Selected regression problems for aggregate model analysis
30 datasets, 0 tasks, 0 flows, 0 runs
a
a
0 datasets, 0 tasks, 0 flows, 0 runs
AFH
55 datasets, 0 tasks, 0 flows, 0 runs
Comparison of linear and non-linear models. [Jupyter Notebook](https://github.com/janvanrijn/linear-vs-non-linear/blob/master/notebook/Linear-vs-Non-Linear.ipynb)
299 datasets, 299 tasks, 5 flows, 1693 runs
text
0 datasets, 0 tasks, 0 flows, 0 runs
Recoil estimation from drifting position data.
0 datasets, 0 tasks, 0 flows, 0 runs
Show how one-hot-encoding impacts the performance of decision trees. See also https://roamanalytics.com/2016/10/28/are-categorical-variables-getting-lost-in-your-random-forests/
0 datasets, 0 tasks, 2 flows, 159 runs
My first test on the platform
0 datasets, 0 tasks, 0 flows, 0 runs
Test
0 datasets, 0 tasks, 0 flows, 0 runs
Dependency parser for news data
0 datasets, 0 tasks, 0 flows, 0 runs
We want to predict the type of a DBpedia resource from its structure in the Knowledge graph. our preliminary study concludes that we can achieve it with accuracy above 90%. Paper submitted to ICWE…
6 datasets, 1 tasks, 0 flows, 0 runs
Studying Weather with machine learning
0 datasets, 0 tasks, 0 flows, 0 runs
Runs made for constructing a meta-dataset in a study on the effects of sparsity on the meta-level.
0 datasets, 803 tasks, 31 flows, 24622 runs
Test Of Random
0 datasets, 0 tasks, 0 flows, 0 runs
We advocate the use of curated, comprehensive benchmark suites of machine learning datasets, backed by standardized OpenML-based interfaces and complementary software toolkits written in Python, Java…
73 datasets, 73 tasks, 0 flows, 0 runs
Benchmark study, using 73 datasets from OpenML-CC18, on the importance of hyperparameter tuning: which parameters are important to tune and which might be set to a default value instead? For each…
59 datasets, 59 tasks, 6 flows, 281063 runs
No data.
0 datasets, 0 tasks, 0 flows, 0 runs
First analysis of CML survey results from over a year
0 datasets, 0 tasks, 0 flows, 0 runs
No data.
0 datasets, 0 tasks, 0 flows, 0 runs
[Sport Data Valley](https://www.sportinnovator.nl/sport-data-valley) is a Dutch initiative to collect, share and analyse datasets on sports and exercise.…
11 datasets, 0 tasks, 0 flows, 0 runs
Data prefetching is a standard technique used to accelerate the access to data stores. In the context of SPARQL endpoints, previous approaches have been based on two main techniques: (1) query…
3 datasets, 3 tasks, 0 flows, 5 runs
Paper submitted to ESWC 2018
0 datasets, 0 tasks, 0 flows, 0 runs
Datasets
218 datasets, 0 tasks, 0 flows, 0 runs
project
0 datasets, 0 tasks, 0 flows, 0 runs
Classifiers in R
0 datasets, 0 tasks, 0 flows, 0 runs
1
0 datasets, 0 tasks, 0 flows, 0 runs
1
0 datasets, 0 tasks, 0 flows, 0 runs
The library contains different multi-class datasets.
27 datasets, 0 tasks, 0 flows, 0 runs
just messing around
0 datasets, 0 tasks, 0 flows, 0 runs
Workflow recomendation experiment using runs considered "human-made"
0 datasets, 60 tasks, 44 flows, 60 runs
A small study of algorithms on datasets provided by the students.
4 datasets, 0 tasks, 0 flows, 0 runs
With the advent of automated machine learning, automated hyperparameter optimization methods are by now routinely used. However, this progress is not yet matched by equal progress on automatic…
0 datasets, 0 tasks, 0 flows, 164911 runs
This collection of datasets and runs was used in the study included in the dissertation, prepared by Miguel Viana Cachada, for the Master in Data Analytics from _Faculdade de Economia do Porto_…
37 datasets, 37 tasks, 0 flows, 13616 runs
Datasets used to evaluate Layered TPOT against 'vanilla' TPOT. Comprises a selection of large datasets, with between 100k and 1m instances each, contains pseudo-synthetic datasets.
18 datasets, 0 tasks, 0 flows, 0 runs
Run experiments on study 14
0 datasets, 0 tasks, 0 flows, 0 runs
A simple study created for a talk at CENISBS
0 datasets, 0 tasks, 1 flows, 60 runs
This study is intented for exploring the platform. Most things will be deleted.
0 datasets, 0 tasks, 0 flows, 0 runs
Here is description in the form of a tutorial: https://medium.com/@alexrachnog/neural-networks-for-algorithmic-trading-multimodal-and-multitask-deep-learning-5498e0098caf; a link to the Github repo is…
0 datasets, 0 tasks, 0 flows, 0 runs
No data.
0 datasets, 0 tasks, 0 flows, 0 runs
Identify best ML for predicting the churn
0 datasets, 0 tasks, 0 flows, 0 runs
This was an study started by Nandana and Mariano in 2016. We started with unsupervised methods, but we could not find good clusters. En 2017 we started with annotated data and here we are. ## Summary…
10 datasets, 4 tasks, 0 flows, 3 runs
This study lists all the experiments described in the paper ...
157 datasets, 0 tasks, 0 flows, 0 runs
ensemble test on diabetes
0 datasets, 0 tasks, 0 flows, 0 runs
No data.
95 datasets, 94 tasks, 6 flows, 2790 runs
testing ball
0 datasets, 0 tasks, 0 flows, 0 runs
test ball
0 datasets, 0 tasks, 0 flows, 0 runs
testing ball
0 datasets, 0 tasks, 0 flows, 0 runs
Containing all datasets, tasks, flows and runs used in the ASLib OpenML Scenario.
442 datasets, 441 tasks, 63 flows, 0 runs
No data.
0 datasets, 0 tasks, 0 flows, 0 runs
This is just to test the new ctree implementation on various problems to check if there is anything where it fails.
0 datasets, 0 tasks, 6 flows, 1458 runs
Authors: Salisu Mamman Abdulrahman, Pavel Brazdil, Jan N. van Rijn, Joaquin Vanschoren Abstract: Algorithm selection methods can be speeded-up substantially by incorporating multi-objective measures…
39 datasets, 39 tasks, 53 flows, 9627 runs
All datasets, tasks, flows and setups used for Chapter 6 in the PhD Thesis "Massively Collaborative Machine Learning"
105 datasets, 105 tasks, 27 flows, 0 runs
this study joins multiple data stream studies
0 datasets, 0 tasks, 0 flows, 0 runs
Iris dataset
0 datasets, 0 tasks, 0 flows, 0 runs
Compare several trees, bagged trees and the random forest.
6 datasets, 6 tasks, 6 flows, 36 runs
See if it's cool.
0 datasets, 0 tasks, 0 flows, 0 runs
Based on three different tasks we want to compare three versions of ksvm - C-svc C classification - spoc-svc Crammer, Singer native multi-class - kbb-svc Weston, Watkins native multi-class
0 datasets, 0 tasks, 2 flows, 19 runs
none
2 datasets, 1 tasks, 2 flows, 15 runs
Paper on OpenML R library. Includes a case study on bagging vs forests
0 datasets, 0 tasks, 0 flows, 80 runs
Study
0 datasets, 0 tasks, 0 flows, 0 runs
An increase in undergraduate registered students in universities largely grown last years. However, the number of graduates remains low. The main cause of this issue is the evasion and / or retention…
0 datasets, 0 tasks, 0 flows, 0 runs
Data mining researchers and practitioners often use general rules of thumb or common data mining wisdom, those are so called data-mining myths. Even though, these myths are not always proven or…
394 datasets, 394 tasks, 25 flows, 11780 runs
a test study. has no value!
0 datasets, 0 tasks, 0 flows, 0 runs
numerai
0 datasets, 0 tasks, 0 flows, 0 runs
A subgroup discovery study.
0 datasets, 3600 tasks, 4 flows, 0 runs
Ensembles of classifiers are among the best performing classifiers available in many data mining applications. Rather than training one classifier, multiple classifiers are trained, and their…
60 datasets, 60 tasks, 6 flows, 389 runs
Feature selection can be of value to classification for a variety of reasons. Real world data sets can be rife with irrelevant features, especially if the data was not gather specifically for the…
394 datasets, 394 tasks, 24 flows, 9454 runs
Benchmarking in Machine Learning is often much more difficult than it seems, and hard to reproduce. This study is a new approach to do a collaborative, in-depth benchmarking of algorithms, and allows…
101 datasets, 101 tasks, 373 flows, 1290 runs
Almost every form of statistical and machine learning method has been applied to learning QSARs at one time or another: linear regression, decision trees, neural networks, nearest-neighbour methods,…
16941 datasets, 0 tasks, 0 flows, 24034 runs
The work will be submitted to ECML-PKDD2016
0 datasets, 0 tasks, 0 flows, 0 runs
Ensembles of classifiers are among the best performing classifiers available in many data mining applications. However, most ensembles developed specifically for the dynamic data stream setting rely…
0 datasets, 62 tasks, 13 flows, 805 runs
Example of collaborative research conducted by means of OpenML NB:
2 datasets, 0 tasks, 0 flows, 0 runs
how should I proceed? [![run at everware](https://img.shields.io/badge/run…
0 datasets, 0 tasks, 0 flows, 0 runs
No data.
0 datasets, 0 tasks, 0 flows, 0 runs
In this study, we investigate and summarize the performance of a wide range of ML algorithms (using its default hyper-parameter values) on a wide range of OpenML classifications tasks. This will yield…
414 datasets, 425 tasks, 104 flows, 17652 runs
To see what a study can do
0 datasets, 0 tasks, 0 flows, 0 runs
This study compares the local and global feature selection strategy on multilabel classification transformation methods
0 datasets, 0 tasks, 0 flows, 0 runs
The task of Quantitative Structure Activity Relationship (QSAR) Learning is to learn a function that, given the structure of a small molecule (a potential drug), outputs the predicted activity of the…
1086 datasets, 1081 tasks, 1 flows, 0 runs
One of the challenges in Machine Learning to find a classifier and parameter settings that work well on a given dataset. Evaluating all possible combinations typically takes too much time, hence many…
0 datasets, 39 tasks, 53 flows, 9627 runs
We investigate the performance of a wide range of classification algorithms on a wide range of datasets to better understand when they perform well and when they don't. This will yield a meta-dataset…
512 datasets, 514 tasks, 63 flows, 91425 runs