Data
Midwest_Survey

Midwest_Survey

active ARFF Publicly available Visibility: public Uploaded 18-06-2020 by Marcos de Paula Bueno
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
The midwest survey dataset contain individual responses from surveys about regional identification conducted for FiveThirtyEight by SurveyMonkey.

28 features

Location_(Census_Region) (target)nominal9 unique values
284 missing
In_your_own_words,_what_would_you_call_the_part_of_?string1008 unique values
1 missing
Personally_identification_as_a_Midwesterner?nominal4 unique values
0 missing
Illinois_in_MW?nominal2 unique values
0 missing
Indiana_in_MW?nominal2 unique values
0 missing
Iowa_in_MW?nominal2 unique values
0 missing
Kansas_in_MW?nominal2 unique values
0 missing
Michigan_in_MW?nominal2 unique values
0 missing
Minnesota_in_MW?nominal2 unique values
0 missing
Missouri_in_MW?nominal2 unique values
0 missing
Nebraska_in_MW?nominal2 unique values
0 missing
North_Dakota_in_MW?nominal2 unique values
0 missing
Ohio_in_MW?nominal2 unique values
0 missing
South_Dakota_in_MW?nominal2 unique values
0 missing
Wisconsin_in_MW?nominal2 unique values
0 missing
Arkansas_in_MW?nominal2 unique values
0 missing
Colorado_in_MW?nominal2 unique values
0 missing
Kentucky_in_MW?nominal2 unique values
0 missing
Oklahoma_in_MW?nominal2 unique values
0 missing
Pennsylvania_in_MW?nominal2 unique values
0 missing
West_Virginia_in_MW?nominal2 unique values
0 missing
Montana_in_MW?nominal2 unique values
0 missing
Wyoming_in_MW?nominal2 unique values
0 missing
ZIP_Codestring2089 unique values
261 missing
Gendernominal2 unique values
275 missing
Agenominal4 unique values
275 missing
Household_Incomenominal5 unique values
343 missing
Educationnominal5 unique values
305 missing

19 properties

2778
Number of instances (rows) of the dataset.
28
Number of attributes (columns) of the dataset.
10
Number of distinct values of the target attribute (if it is nominal).
1744
Number of missing values in the dataset.
355
Number of instances with at least one value missing.
0
Number of numeric attributes.
26
Number of nominal attributes.
92.86
Percentage of nominal attributes.
27.29
Percentage of instances belonging to the most frequent class.
758
Number of instances belonging to the most frequent class.
3.38
Percentage of instances belonging to the least frequent class.
94
Number of instances belonging to the least frequent class.
21
Number of binary attributes.
75
Percentage of binary attributes.
12.78
Percentage of instances having missing values.
2.24
Percentage of missing values.
0.89
Average class difference between consecutive instances.
0
Percentage of numeric attributes.
0.01
Number of attributes divided by the number of instances.

4 tasks

0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
Define a new task