Data
Click_prediction_small

Click_prediction_small

active ARFF Publicly available Visibility: public Uploaded 07-01-2019 by Florian Pargent
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
This is the same data as version 5 (OpenML ID = 1220) with '_id' features coded as nominal factor variables.

12 features

click (target)nominal2 unique values
0 missing
impressionnumeric99 unique values
0 missing
url_hashnumeric6941 unique values
0 missing
ad_idnominal19228 unique values
0 missing
advertiser_idnominal6064 unique values
0 missing
depthnumeric3 unique values
0 missing
positionnumeric3 unique values
0 missing
query_idnumeric30748 unique values
0 missing
keyword_idnominal19803 unique values
0 missing
title_idnominal25321 unique values
0 missing
description_idnominal22381 unique values
0 missing
user_idnominal30114 unique values
0 missing

19 properties

39948
Number of instances (rows) of the dataset.
12
Number of attributes (columns) of the dataset.
2
Number of distinct values of the target attribute (if it is nominal).
0
Number of missing values in the dataset.
0
Number of instances with at least one value missing.
5
Number of numeric attributes.
7
Number of nominal attributes.
0
Percentage of instances having missing values.
0.72
Average class difference between consecutive instances.
0
Percentage of missing values.
0
Number of attributes divided by the number of instances.
41.67
Percentage of numeric attributes.
83.16
Percentage of instances belonging to the most frequent class.
58.33
Percentage of nominal attributes.
33220
Number of instances belonging to the most frequent class.
16.84
Percentage of instances belonging to the least frequent class.
6728
Number of instances belonging to the least frequent class.
1
Number of binary attributes.
8.33
Percentage of binary attributes.

10 tasks

0 runs - estimation_procedure: 10-fold Crossvalidation - target_feature: click
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
Define a new task