Data
youtube-spam-shakira

youtube-spam-shakira

active ARFF CC-BY Visibility: public Uploaded 19-05-2021 by Meilina Reksoprodjo
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Author: Unknown Source: [UCI](https://archive.ics.uci.edu/ml/datasets/YouTube+Spam+Collection) - 2017 Please cite*: [Paper](http://dcomp.sor.ufscar.br/talmeida/youtubespamcollection/) YouTube Spam Collection Shakira dataset It is a public set of comments collected for spam research. It has five datasets composed by 1,956 real messages extracted from five videos that were among the 10 most viewed on the collection period. This dataset only contains information about Shakira. It consists of 174 spam entries and 196 ham entries, leading to a grand total of 370samples. ### Attribute information The collection is composed by one CSV file per dataset, where each line has the following attributes: COMMENT_ID,AUTHOR,DATE,CONTENT,TAG

5 features

COMMENT_IDstring369 unique values
0 missing
AUTHORstring319 unique values
0 missing
DATEstring369 unique values
0 missing
CONTENTstring331 unique values
0 missing
CLASSnumeric2 unique values
0 missing

19 properties

370
Number of instances (rows) of the dataset.
5
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
0
Number of missing values in the dataset.
0
Number of instances with at least one value missing.
1
Number of numeric attributes.
0
Number of nominal attributes.
0
Percentage of instances having missing values.
Average class difference between consecutive instances.
0
Percentage of missing values.
0.01
Number of attributes divided by the number of instances.
20
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.

0 tasks

Define a new task